Commit Graph

  • 9bc6dd5c3c Add set_content into generic_extractor.py. erinhmclark 2025-02-25 20:07:00 +00:00
  • cf1219f798 Add text content into gsheet. erinhmclark 2025-02-25 20:06:44 +00:00
  • 4dcb77c29f Merge branch 'main' into timestamping_rewrite Patrick Robertson 2025-02-25 17:10:55 +00:00
  • 1ad158c016 Merge pull request #211 from bellingcat/docs_improvements Patrick Robertson 2025-02-25 14:13:13 +00:00
  • 1df5129268 Small typos. erinhmclark 2025-02-25 14:08:38 +00:00
  • 73b434aafc Tests for test_vk_extractor.py. erinhmclark 2025-02-25 14:08:28 +00:00
  • 2d276cb9c4 Fix tmp test file. erinhmclark 2025-02-25 14:08:14 +00:00
  • 898faf6fe4 Further WIP - currently working on verify_signed Patrick Robertson 2025-02-25 12:08:08 +00:00
  • 6987a4827e Set poetry packages - remove tsp_client and update cryptography Patrick Robertson 2025-02-25 11:57:20 +00:00
  • f8e846d59a Create facebook dropin - working for images + text. CAVEAT: only gets the first ~100 chars of the post at the moment Patrick Robertson 2025-02-25 11:44:35 +00:00
  • 2d4f1b5b79 Added Bgutils PO token provider. erinhmclark 2025-02-25 10:49:57 +00:00
  • d10c7fbe55 Better documentation based on the discord feedbackgst Patrick Robertson 2025-02-24 22:42:42 +00:00
  • ca1ed418aa Throw an error for invalid __manifest__ syntax + fix: allow default values of False/None Patrick Robertson 2025-02-24 21:46:24 +00:00
  • 01bf88a695 Merge branch 'main' into timestamping_rewrite Patrick Robertson 2025-02-24 12:03:14 +00:00
  • c5127f5fd1 Allow flexible extractor_args in generic_extractor.py. erinhmclark 2025-02-24 11:40:44 +00:00
  • 158d448cbc Add yt-dlp-get-pot and yt-dlp-getpot-wpc requirements. erinhmclark 2025-02-24 11:40:39 +00:00
  • 73a2e2d752 Fix tests for moving orchestration to secrets/orchestration.yaml Patrick Robertson 2025-02-21 19:05:39 +00:00
  • 1c17629ac6 Tweaks Patrick Robertson 2025-02-21 18:54:27 +00:00
  • 7562938151 Proof of concept for settings page Patrick Robertson 2025-02-21 18:04:48 +00:00
  • 091a19e25c Further docs improvements/tidy ups Patrick Robertson 2025-02-21 16:52:30 +00:00
  • 77212e8e3f Finishing touches to the how-tos Patrick Robertson 2025-02-20 15:45:48 +00:00
  • 9661e90a05 Allow disabling logging in auto_archiver with logging: enabled: false Patrick Robertson 2025-02-20 15:45:32 +00:00
  • 0bec71d203 Finish how to on authentication Patrick Robertson 2025-02-20 15:33:50 +00:00
  • 4174285898 Fix unit tests Patrick Robertson 2025-02-20 13:13:01 +00:00
  • eda359a1ef Fix json loader - it should go in 'validators' not 'utils' Patrick Robertson 2025-02-20 13:10:23 +00:00
  • 40488e0869 Use 'Auto Archiver' naming for consistency. Patrick Robertson 2025-02-20 11:50:29 +00:00
  • 061f29c885 How-to on updating config file to version 0.13+ Patrick Robertson 2025-02-20 11:43:23 +00:00
  • cbea551876 Better display name for wayback machine to emphasise it's typically used as an enricher Patrick Robertson 2025-02-20 11:33:00 +00:00
  • b978484a89 Rename wacz_enricher to wacz_extractor_enricher. Fixes #205 Patrick Robertson 2025-02-20 11:32:53 +00:00
  • 49b6c32058 Fix the 'full' mode which creates a complete config file Patrick Robertson 2025-02-20 11:29:36 +00:00
  • 4b51ec9ad5 Remove dangling import Patrick Robertson 2025-02-20 11:20:16 +00:00
  • 7734a551fa Move 'assert_valid_url' out into utils, don't use assert but raise Patrick Robertson 2025-02-20 10:57:30 +00:00
  • 77b2b099c6 Replace exit() with raise exceptions. Better for code implementations Patrick Robertson 2025-02-19 15:32:45 +00:00
  • 40b8359348 Implementation test with 2 x orchestrators with different configs Patrick Robertson 2025-02-19 15:30:37 +00:00
  • 5ccea8e44a Absolute paths in README for Github/PyPi/Dockerhub etc. Patrick Robertson 2025-02-19 15:30:13 +00:00
  • 7dde8d609d Merge main Patrick Robertson 2025-02-20 10:29:57 +00:00
  • 6ea943b680 Fix link Patrick Robertson 2025-02-20 10:27:24 +00:00
  • 5211c5de18 Merge pull request #210 from bellingcat/logger_fix v0.13.4 Patrick Robertson 2025-02-19 15:11:42 +00:00
  • 6cdefaa751 Merge pull request #194 from bellingcat/tests/add_module_tests Erin Clark 2025-02-19 13:51:43 +00:00
  • 04507577b6 Version bump Patrick Robertson 2025-02-19 13:36:50 +00:00
  • 47a634fc63 Add WACZ, Wayback and local storage tests. erinhmclark 2025-02-19 13:14:08 +00:00
  • a9802dd004 Remove the global _LAZY_LOADED_MODULES and allow each instance of ArchivingOrchestrator to load its own modules Patrick Robertson 2025-02-19 12:25:35 +00:00
  • a8ffb19325 Fix auth key name for cookies_from_browser. erinhmclark 2025-02-19 10:40:54 +00:00
  • 222a94563f WIP: Docs tidyups+add howto on logging and authentication Patrick Robertson 2025-02-19 10:29:05 +00:00
  • eb60b271b9 Fix issue #200 Patrick Robertson 2025-02-19 10:28:35 +00:00
  • ddf2e76624 Include Atlos Storage __init__.py for module recognition. erinhmclark 2025-02-19 09:24:34 +00:00
  • 10a5ad62b8 Include Atlos tests, metadata fixture. erinhmclark 2025-02-19 09:18:41 +00:00
  • f0fd9bf445 Updates tests to use pytest-mock. erinhmclark 2025-02-18 23:32:03 +00:00
  • 657fbd357d Merge branch 'main' into tests/add_module_tests erinhmclark 2025-02-18 19:47:47 +00:00
  • 7b88df72cb Update test_metadata_enricher.py erinhmclark 2025-02-18 19:46:57 +00:00
  • 3c543a3a6a Various fixes for issues with new architecture (#208) v0.13.3 Patrick Robertson 2025-02-18 19:10:09 +00:00
  • ce5a200d1f Added tests, updated instagram_tbot_extractor.py raise failure. erinhmclark 2025-02-18 12:59:10 +00:00
  • f4c623b11b Merge branch 'main' into tests/add_module_tests erinhmclark 2025-02-17 09:03:04 +00:00
  • 6d43bc7d4d Fix generator programmatic setup (#197) Patrick Robertson 2025-02-15 18:36:44 +01:00
  • 9297697ef5 makes orchestrator.run return the results to allow for code integration (#196) v0.13.2 Miguel Sozinho Ramalho 2025-02-15 12:41:26 +00:00
  • 8ed3ef2f33 Merge branch 'main' into tests/add_module_tests erinhmclark 2025-02-14 12:47:40 +00:00
  • 5614af3f63 removes fixed oscrypto dependency, it blocked pypi publishing (#195) v0.13.1 Miguel Sozinho Ramalho 2025-02-14 10:51:56 +00:00
  • 71b41dd901 Remove accidental path, yet again. erinhmclark 2025-02-14 10:05:32 +00:00
  • b0756a6a34 Remove accidental full path. erinhmclark 2025-02-14 09:57:44 +00:00
  • 319c1e8f92 Add more tests. erinhmclark 2025-02-14 09:48:37 +00:00
  • 3fce593aad Merge branch 'main' into tests/add_module_tests erinhmclark 2025-02-12 19:33:29 +00:00
  • cbe98c729d Enricher tests erinhmclark 2025-02-12 19:32:40 +00:00
  • 27f9287b65 markdown fixes v0.13.0 Miguel Sozinho Ramalho 2025-02-12 17:37:36 +00:00
  • d9d936c2ca Thumbnail enricher fix seconds to minutes. erinhmclark 2025-02-12 12:22:27 +00:00
  • d849678137 Merge pull request #193 from bellingcat/links Patrick Robertson 2025-02-12 13:11:59 +01:00
  • da267f20d7 Update screenshot refs erinhmclark 2025-02-12 11:54:40 +00:00
  • 70f155dfce add more of the USPs to the readme Patrick Robertson 2025-02-12 11:48:51 +00:00
  • 86254bdd4e Fix link in how to Patrick Robertson 2025-02-12 11:48:01 +00:00
  • 17f13db56c Make that code block a shell Patrick Robertson 2025-02-12 11:45:09 +00:00
  • d776be8a81 Fix links to docs Patrick Robertson 2025-02-12 11:41:54 +00:00
  • 460a71649c Merge pull request #190 from bellingcat/docs_update Patrick Robertson 2025-02-12 12:38:04 +01:00
  • a0c4a82825 Improved docstrings for base modules Patrick Robertson 2025-02-12 11:32:13 +00:00
  • 8054ea96b3 Document double dash between command line args and feeder urls on command line Patrick Robertson 2025-02-12 11:20:15 +00:00
  • de79e17128 better wording in link Patrick Robertson 2025-02-12 11:19:21 +00:00
  • d28d99daa6 Docs tidy ups and re-organising Patrick Robertson 2025-02-12 11:16:17 +00:00
  • 5b481f72ab Tidy ups to manifests for docs Patrick Robertson 2025-02-11 20:03:53 +00:00
  • 4c119b4db8 Add missing manifest for atlos_storage Patrick Robertson 2025-02-11 20:03:45 +00:00
  • d8f47ff9e4 Add multi-type modules to all headings on TOC Patrick Robertson 2025-02-11 19:46:57 +00:00
  • 1ee7981c6e Add YAML config to the module docs Patrick Robertson 2025-02-11 19:42:03 +00:00
  • 22fa9ba456 further tweaks Patrick Robertson 2025-02-11 18:52:28 +00:00
  • 1b976f4c09 Remove unused atlos util functions Patrick Robertson 2025-02-11 18:49:54 +00:00
  • 756f46012b Remove empty file Patrick Robertson 2025-02-11 18:47:54 +00:00
  • 1d69053dd5 Upgrade certifi Patrick Robertson 2025-02-11 18:41:37 +00:00
  • 40a95f7348 Add likify deps Patrick Robertson 2025-02-11 18:40:44 +00:00
  • a307d09e67 Readthedocs now requires all packages for running the pre-build scripts Patrick Robertson 2025-02-11 18:38:41 +00:00
  • 2c87474186 Change path for scripts Patrick Robertson 2025-02-11 18:37:03 +00:00
  • e65a99078f Slight tweaks of toctrees + reordering Patrick Robertson 2025-02-11 18:28:21 +00:00
  • e43dda2817 Merge pull request #185 from bellingcat/load_modules Patrick Robertson 2025-02-11 19:21:46 +01:00
  • 3787577a96 Screenshot enricher depends on geckodriver not chromedriver Patrick Robertson 2025-02-11 18:18:52 +00:00
  • d0c379a3ba WIP - timestamping enricher Patrick Robertson 2025-02-11 18:18:19 +00:00
  • ea728a7a97 TODO on facebook dropin not working Patrick Robertson 2025-02-11 15:55:19 +00:00
  • 3163cb793a Fix timestamping enricher for new module structure (temp paths) Patrick Robertson 2025-02-11 15:26:40 +00:00
  • 91f1ebf7b3 fix temp for yandex new shortlink msramalho 2025-02-11 15:23:16 +00:00
  • c720541de2 merge conflicts msramalho 2025-02-11 15:22:06 +00:00
  • 7bb4d68a22 Merge branch 'load_modules' into timestamping_rewrite Patrick Robertson 2025-02-11 15:21:31 +00:00
  • e507fc81d2 improves mimetype guessing, previously file.sub.something would not have an extension msramalho 2025-02-11 15:02:49 +00:00
  • 5478ed3860 bsky fix media fetching msramalho 2025-02-11 15:02:00 +00:00
  • 47d1dc9d47 typing warnings fixed msramalho 2025-02-11 15:01:37 +00:00
  • 2d87935042 Start on opentimestamps enricher Patrick Robertson 2025-02-11 14:54:46 +00:00
  • 62154ddfef Further tweaks and fixes Patrick Robertson 2025-02-11 14:37:29 +00:00