Commit Graph

  • 5490947657 Add packaging to Poetry. erinhmclark 2024-12-31 14:09:38 +00:00
  • fd9a6c26ed Create Poetry environment. erinhmclark 2024-12-31 11:46:53 +00:00
  • 3546d4ad79 Fix 'download_syndication' method for tweet archiving (now requires a token) Patrick Robertson 2025-01-12 12:50:23 +01:00
  • c932fb7416 Improved logging when an invalid/deleted tweet is attempted to be downloaded Patrick Robertson 2025-01-12 12:00:45 +01:00
  • f29950905c Merge branch 'main' into small_issues Patrick Robertson 2025-01-12 11:47:55 +01:00
  • 8e99d62c97 Merge pull request #165 from bellingcat/fix/snscrape Patrick Robertson 2025-01-09 11:06:14 +01:00
  • 9dc4eb35de Switch to pytest and use vcr for request storing Patrick Robertson 2025-01-08 11:25:13 +01:00
  • 8c044c15f0 Add base test class for archivers with boilerplate code Patrick Robertson 2025-01-07 19:43:20 +01:00
  • ab9335bb7a Merge branch 'main' into feat/unittest Patrick Robertson 2025-01-08 10:35:45 +01:00
  • add83c9650 Remove snscrape from twitter_archiver Patrick Robertson 2025-01-07 19:40:19 +01:00
  • a697f0a212 adds an unauthenticated Bluesky archiver (#160) Miguel Sozinho Ramalho 2025-01-07 10:28:07 +00:00
  • bffa3a6254 Merge pull request #159 from bellingcat/print_pdf Patrick Robertson 2025-01-06 18:13:38 +01:00
  • ef471f41e1 adds better debug for wayback failures (#161) Miguel Sozinho Ramalho 2025-01-06 16:49:11 +00:00
  • 928518cda7 Allow setting cookies for yt-dl (#158) Patrick Robertson 2025-01-06 17:19:53 +01:00
  • 1bd017000e Add Github CI test workflow Patrick Robertson 2024-12-31 15:20:33 +01:00
  • 33e967ce4b Update pipfile for: Patrick Robertson 2024-12-31 15:20:11 +01:00
  • 30d423c8e6 Setup a basic framework for unit tests Patrick Robertson 2024-12-31 11:51:43 +01:00
  • 0c803f15a5 Fix showing preview images in the .html file when using local storage Patrick Robertson 2024-12-31 09:29:31 +01:00
  • a46f9997ea Better logging when there's a timestamp parse error Patrick Robertson 2024-12-31 09:28:08 +01:00
  • 83da9ae089 adds pdf preview support for html formatter msramalho 2024-12-23 18:19:26 +00:00
  • 82c00d491d Option to provide cookies for use by ytdl, fixes #150 youtube-cookies Patrick Robertson 2024-12-18 12:55:31 +03:00
  • 663c8ad93a Add 'print_pdf' option to the screenshot enricher. Fixes #132 Patrick Robertson 2024-12-18 13:37:44 +03:00
  • e49550163f adds proxy_server option to wacz msramalho 2024-10-06 10:45:34 +06:00
  • e6f5981afc numpy version downgrade msramalho 2024-10-06 10:10:04 +06:00
  • c62bf1a34d yt-dlp version bump msramalho 2024-10-05 17:43:07 +06:00
  • b166d57e61 v0.12.0 bump v0.12.0 msramalho 2024-08-21 13:34:34 +01:00
  • 11c3288267 closes #146 msramalho 2024-08-21 13:33:58 +01:00
  • 004143a58a version bump v0.11.6 v0.11.6 msramalho 2024-07-18 11:27:39 +01:00
  • 686f0027c4 adds new entries to example orchestration file msramalho 2024-07-18 11:27:15 +01:00
  • b03cf32c73 Bump authlib from 1.3.0 to 1.3.1 (#144) dependabot[bot] 2024-07-18 11:26:22 +01:00
  • dc9e64397e bumping yt-dlp msramalho 2024-07-18 11:23:09 +01:00
  • c7bc5e2988 cleanup msramalho 2024-05-15 11:04:29 +01:00
  • 1e375bd740 version bump v0.11.5 msramalho 2024-05-14 16:42:15 +01:00
  • f8824691dd refactors free twitter archiver strategies (#142) v0.11.4 Miguel Sozinho Ramalho 2024-05-14 16:23:33 +01:00
  • 012cc36609 removes deprecated datetime method msramalho 2024-05-14 15:54:50 +01:00
  • 7cfe1e39cc #135 fix cleanup of telethon session files (#139) v0.11.3 Miguel Sozinho Ramalho 2024-04-16 12:45:45 +01:00
  • a455728673 version bump fix/135 msramalho 2024-04-16 12:44:42 +01:00
  • 8d4357a22c closes #135 msramalho 2024-04-16 12:44:32 +01:00
  • cf8691bad7 Add yt-dlp based archiving for TwitterArchiver (#138) v0.11.2 Jett Chen 2024-04-16 02:54:55 +08:00
  • f603400d0d Add direct Atlos integration (#137) v0.11.1 R. Miles McCain 2024-04-15 22:25:17 +04:00
  • eb37f0b45b version bump msramalho 2024-04-15 19:02:54 +01:00
  • 75497f5773 minor bug fix when using an archiver_enricher in enrichers only msramalho 2024-04-15 19:02:40 +01:00
  • 623e555713 dependencies updates msramalho 2024-04-15 19:02:20 +01:00
  • 9c7824de57 browsertrix docker updates msramalho 2024-04-15 19:01:55 +01:00
  • f4827770e6 adds instagram no stories as success, and fix for telethon-based archivers. v0.10.1 msramalho 2024-03-05 14:49:10 +00:00
  • 601572d76e strip url v0.10.0 msramalho 2024-02-29 11:54:01 +00:00
  • d21e79a272 general security updates msramalho 2024-02-29 11:40:30 +00:00
  • ccf5f857ef adds configurable limits to instagram/youtube v0.9.11 msramalho 2024-02-25 15:14:17 +00:00
  • 7de317d1b5 avoiding exception v0.9.10 msramalho 2024-02-23 15:54:33 +00:00
  • 70075a1e5e improving insta archiver v0.9.9 msramalho 2024-02-23 15:37:28 +00:00
  • 5b9bc4919a version bump v0.9.8 msramalho 2024-02-23 14:08:23 +00:00
  • f0158ffd9c adds tagged posts and better parsing msramalho 2024-02-23 14:08:17 +00:00
  • bfb35a43a9 adds more details from yt-dlp msramalho 2024-02-23 14:08:05 +00:00
  • ef5b39c4f1 dind exception v0.9.7 msramalho 2024-02-22 18:05:56 +00:00
  • 24ceafcb64 missing forward slash v0.9.6 msramalho 2024-02-22 17:47:13 +00:00
  • 9fd4bb56a8 new attempt at dind wacz v0.9.5 msramalho 2024-02-22 17:24:27 +00:00
  • 5324d562ba cleanup wacz patch v0.9.4 msramalho 2024-02-21 18:14:30 +00:00
  • 5bf0a0206d version update v0.9.3 msramalho 2024-02-21 17:26:07 +00:00
  • 4941823565 fix growing volume size in wacz_enricher msramalho 2024-02-21 17:25:55 +00:00
  • 27310c2911 fixes issue with api requests v0.9.2 msramalho 2024-02-21 12:25:05 +00:00
  • eb973ba42d v0.9.1 fixes to bad parsing in ssl certificates v0.9.1 msramalho 2024-02-20 19:31:19 +00:00
  • 7a21ae96af V0.9.0 - closes several open issues: new enrichers and bug fixes (#133) v0.9.0 Miguel Sozinho Ramalho 2024-02-20 18:05:29 +00:00
  • 5c49124ac6 Merge branch 'main' of https://github.com/bellingcat/auto-archiver msramalho 2024-02-13 15:44:53 +00:00
  • b9d71d0b3f Change submit-archive from basic to bearer auth (#128) Kai 2024-02-06 05:24:15 -10:00
  • b9b831ce03 v8.0.1 v0.8.1 msramalho 2024-02-01 15:08:55 +00:00
  • 2a773a25e8 better handling of telethon data display msramalho 2024-02-01 15:08:23 +00:00
  • 719645fc2d minor improvement to html_template msramalho 2024-02-01 15:03:00 +00:00
  • 71fcf5a089 fix: Correct the path of service account in google drive settings (#123) Chu-An, Huang 2024-02-01 23:02:04 +08:00
  • 590d3fe824 Fix typo in readme (#121) Tomas Apodaca 2024-01-24 13:17:31 -08:00
  • e6b6b83007 0.8.0 new features and dependency updates (#119) v0.8.0 Miguel Sozinho Ramalho 2023-12-20 14:13:22 +00:00
  • 499832d146 fix datetime parsing v0.7.10 msramalho 2023-12-13 18:41:48 +00:00
  • fa1163532b patching now optional value v0.7.9 msramalho 2023-12-13 13:55:31 +00:00
  • 96f6ea8f09 v0.7.8 v0.7.8 msramalho 2023-12-13 13:03:39 +00:00
  • ff17dfd0aa enables option to toggle db api writes (#118) Miguel Sozinho Ramalho 2023-12-13 12:54:47 +00:00
  • 345e03e916 enables option to toggle db api writes v0.7.7 msramalho 2023-12-13 12:54:12 +00:00
  • 0a3053bbc7 version update v0.7.6 msramalho 2023-12-13 11:29:13 +00:00
  • e69660be82 chooses most complete result from api (#117) Miguel Sozinho Ramalho 2023-12-13 11:28:27 +00:00
  • a786d4bb0e chooses most complete result from api (#116) v0.7.5 Miguel Sozinho Ramalho 2023-12-13 11:26:46 +00:00
  • 128d4136e3 fixes empty api search results (#115) v0.7.4 Miguel Sozinho Ramalho 2023-12-13 10:51:25 +00:00
  • 98fb574d89 fixing older db entries formats (#114) v0.7.3 Miguel Sozinho Ramalho 2023-12-12 22:47:54 +00:00
  • 6f36e92e02 enables api_db cache queries if configured with new option (#113) v0.7.2 Miguel Sozinho Ramalho 2023-12-12 19:20:26 +00:00
  • 3e56ef137d reduce s3 duplicating while keeping random urls via hash (#112) Miguel Sozinho Ramalho 2023-12-12 19:12:03 +00:00
  • 9ee323a654 Set _mimetype for final media of html formatter (#111) Jett Chen 2023-12-11 19:47:04 +08:00
  • 9eb39943c7 Extract text in wacz_enricher (#110) Kai 2023-12-05 23:24:12 +01:00
  • 8624e9f177 version update 0.7.1 v0.7.1 msramalho 2023-11-13 11:58:43 +01:00
  • 381940f5a8 Fix Selenium headless invokation (#106) Galen Reich 2023-11-13 11:56:35 +01:00
  • 1382f8b795 version bump and release without commit v0.6.13 msramalho 2023-09-22 10:18:58 +01:00
  • fac8364762 Updated gd.py to work with shared folders (#102) Dave Mateer 2023-09-22 10:17:54 +01:00
  • 0feeb0bd24 Bump version to v0.6.12 for release v0.6.12 msramalho 2023-09-20 10:18:44 +01:00
  • ddb9dc87d7 unfortunately needed twitter->x msramalho 2023-09-20 10:17:31 +01:00
  • e8935b9a80 Bump version to v0.6.11 for release v0.6.11 msramalho 2023-09-15 19:53:07 +01:00
  • b157f9a6b1 renaming variable msramalho 2023-09-15 19:52:47 +01:00
  • ea38a604bb fixes #96 by not assigning to self.prop msramalho 2023-09-15 19:35:35 +01:00
  • 53494c961e Bump version to v0.6.10 for release v0.6.10 msramalho 2023-09-14 17:50:08 +01:00
  • f7839a99cc Add configs for path to write and read wacz archives (#93) Kai 2023-09-14 18:49:37 +02:00
  • 7a2119e6e9 Bump version to v0.6.9 for release v0.6.9 msramalho 2023-09-12 20:08:00 +01:00
  • 3ae25e51e7 adds flexibile setup for wacz in docker (#94) Miguel Sozinho Ramalho 2023-09-12 20:07:21 +01:00
  • 9584193d69 Bump version to v0.6.8 for release v0.6.8 msramalho 2023-09-08 15:10:02 +01:00
  • 0dd45d90f1 fix: docker+wacz troubles msramalho 2023-09-08 15:09:50 +01:00
  • edcb2da74a Bump version to v0.6.7 for release v0.6.7 msramalho 2023-09-06 17:07:14 +01:00