Commit Graph

  • c6a11298ac Fix missing linebreak in locals dump output JustAnotherArchivist 2019-05-16 22:22:21 +00:00
  • 02cbf6ddf6 Dump locals to a temporary file in case of an exception JustAnotherArchivist 2019-05-16 18:29:30 +00:00
  • 3817aa59d4 Add support for extracting links from tweets (including cards) JustAnotherArchivist 2019-05-16 16:42:52 +00:00
  • 46a51008f8 Fix Instagram signature calculation JustAnotherArchivist 2019-05-16 16:19:51 +00:00
  • f91979eb32 Add --max-position option to twitter-search scraper as a workaround for pagination stopping early (#37) JustAnotherArchivist 2019-05-10 17:30:15 +00:00
  • 85fff319bc Disable Twitter's spelling correction JustAnotherArchivist 2019-05-10 16:43:59 +00:00
  • 6b145526b7 Update README with new modules JustAnotherArchivist 2019-04-21 23:10:32 +02:00
  • abf31764b1 Version 0.2.0 v0.2.0 JustAnotherArchivist 2019-04-21 23:03:21 +02:00
  • 64693f74bb Update Instagram query hash JustAnotherArchivist 2019-04-19 01:47:38 +02:00
  • a7d08ed51c Remove leftover debugging print JustAnotherArchivist 2019-04-19 01:40:29 +02:00
  • f48ca7726e Add support for Gab JustAnotherArchivist 2019-04-19 00:40:43 +02:00
  • 78c295f7e0 Add support for VKontakte (fixes #13) JustAnotherArchivist 2019-04-18 18:39:21 +02:00
  • a5aca1a14f Add support for Instagram hashtags (fixes #29) JustAnotherArchivist 2019-04-18 16:14:54 +02:00
  • 96f7d871c1 Ignore Scraper subclasses which don't set a name JustAnotherArchivist 2019-04-18 16:14:26 +02:00
  • b5dfd37949 Support unix timestamps in --since JustAnotherArchivist 2019-04-18 16:01:35 +02:00
  • b511397791 Add --since option to return only results newer than a certain date (fixes #19) JustAnotherArchivist 2019-04-18 15:12:29 +02:00
  • 536fcb3303 Return proper items from scrapers including clean URLs (fixes #9 and #10) JustAnotherArchivist 2019-04-18 14:44:21 +02:00
  • f8d812f799 Include permalink.php, events, and notes (fixes #32) JustAnotherArchivist 2019-04-18 04:22:47 +02:00
  • c2cebd9166 Accept-Language header to get an English response unconditionally JustAnotherArchivist 2019-04-18 03:58:37 +02:00
  • 73bc99596f Treat Twitter responses without a Content-Type header as invalid (fixes #21) JustAnotherArchivist 2019-04-18 02:24:35 +02:00
  • 8458c12218 Rewrite link extraction on Facebook (fixes #17) JustAnotherArchivist 2019-04-18 02:09:42 +02:00
  • b59c7e8d8f Merge pull request #28 from peterk/master JustAnotherArchivist 2019-03-11 13:32:07 +01:00
  • 3ceb849d98 Adds socks proxy support (via requests) Peter Krantz 2019-01-10 22:54:42 +01:00
  • f5ee1f7ac5 Merge pull request #26 from ludios/avoid-twitter-bans JustAnotherArchivist 2018-12-25 02:19:17 +01:00
  • 1984110f78 twitter: randomize user agent to avoid Twitter's (IP, UA)-keyed bans Ivan Kozik 2018-10-11 13:02:58 +00:00
  • c5a5dcb92c snscrape is now on PyPI JustAnotherArchivist 2018-10-09 17:26:03 +02:00
  • cfb1c9a2aa Version 0.1.3 v0.1.3 JustAnotherArchivist 2018-10-01 03:26:22 +02:00
  • d0d3c8b2a6 Better log output for temporary failures (fixes #2) JustAnotherArchivist 2018-10-01 03:24:29 +02:00
  • 4d0350e541 Disable "quality filter" on Twitter (fixes #3) JustAnotherArchivist 2018-10-01 02:51:33 +02:00
  • d17aa15bcb Version 0.1.2 v0.1.2 JustAnotherArchivist 2018-09-11 12:44:07 +02:00
  • d1ef280d6e Fix snscrape.modules not getting installed JustAnotherArchivist 2018-09-11 12:43:10 +02:00
  • 2823272e0b Version 0.1.1 v0.1.1 JustAnotherArchivist 2018-09-11 12:30:35 +02:00
  • 540f557002 Fix typo in setup.py preventing installation JustAnotherArchivist 2018-09-11 12:30:21 +02:00
  • 5fc60fe978 Version 0.1 v0.1 JustAnotherArchivist 2018-09-10 22:15:11 +02:00
  • cf36e8be97 Add README, LICENSE, and metadata JustAnotherArchivist 2018-09-10 22:15:03 +02:00
  • 0350ab0692 Fix Facebook scraper returning strings instead of Items JustAnotherArchivist 2018-09-10 19:38:43 +02:00
  • 6b6ae3d33b Rename from socialmediascraper to snscrape JustAnotherArchivist 2018-08-21 22:54:14 +02:00
  • 9fb3ac6013 Add support for Google+ user profiles JustAnotherArchivist 2018-05-18 23:53:27 +02:00
  • 897f5bebe6 Add support for POST requests JustAnotherArchivist 2018-05-18 16:24:14 +02:00
  • e28a2cdb4b Fix Instagram again JustAnotherArchivist 2018-04-26 15:06:28 +02:00
  • 5a084af85c Fix Instagram JustAnotherArchivist 2018-04-25 22:11:10 +02:00
  • 14831d4137 Add support for Facebook user profiles JustAnotherArchivist 2018-04-17 18:46:04 +02:00
  • 6d54655a7f Add support for Instagram user profiles JustAnotherArchivist 2018-03-08 15:15:32 +01:00
  • 3ab69a1a0f Merge Twitter user and hashtag into one, and add support for generic Twitter search scrapes JustAnotherArchivist 2018-03-02 18:55:36 +01:00
  • d03c82d413 Support nested inheritance from socialmediascraper.base.Scraper JustAnotherArchivist 2018-03-02 18:50:32 +01:00
  • 02473876d7 Add milliseconds to the log timestamps JustAnotherArchivist 2018-03-02 18:50:32 +01:00
  • e3190ee541 Add support for Twitter hashtags JustAnotherArchivist 2018-02-09 14:36:26 +01:00
  • 606b81e066 Use a session for proper cookie handling, and add exponential backoff in case of errors JustAnotherArchivist 2018-02-08 14:22:12 +01:00
  • d085018a5f Split up into modules JustAnotherArchivist 2018-02-08 14:00:00 +01:00
  • 1ae006b268 Initial commit JustAnotherArchivist 2018-02-08 13:05:00 +01:00