Commit Graph

  • 65e222e177 fixing typo in documentation pytest -> poetry mgaughan 2025-07-22 17:20:59 -04:00
  • f2eb9ef784 correcting to double-dash in the poetry install documentation mgaughan 2025-07-21 17:55:48 -04:00
  • 2081c16555 embed retry into timestamping msramalho 2025-07-10 14:49:53 +01:00
  • d3efd7121c avoid empty metadata comments msramalho 2025-07-06 14:05:17 +01:00
  • 9d3cd5774b an improved approach for #295 msramalho 2025-07-06 14:04:01 +01:00
  • 80d61e8b85 Merge pull request #341 from bellingcat/dev v1.1.2 Miguel Sozinho Ramalho 2025-07-05 20:28:00 +01:00
  • d36cdbfa87 fixing pypaperclip see issue #339 msramalho 2025-07-05 19:07:23 +01:00
  • c1506ee1cf some wayback errors are expected and should be warnings msramalho 2025-07-05 18:31:39 +01:00
  • 3a34a49822 adds antibot tiktok logic for photos closes #295 msramalho 2025-07-05 18:31:12 +01:00
  • 37c6d97275 new auth wall check logic and escaped CSS selector in selenium msramalho 2025-07-05 18:30:31 +01:00
  • 7234eda85f expands Sheets API retries for really large spreadsheets msramalho 2025-07-05 18:29:33 +01:00
  • a8c1ef3912 generic_extractor config to use proxy only when needed to avoid overzealousness msramalho 2025-07-05 16:54:58 +01:00
  • 52ed8196a5 updates dependencies msramalho 2025-07-05 16:03:47 +01:00
  • 2051e8e491 adds further exponential backoff for Sheets API worksheet enumeration msramalho 2025-07-05 16:02:07 +01:00
  • 21255db86a stops using service that is not up for timestamping msramalho 2025-07-05 16:00:46 +01:00
  • eae0da08b3 fix issue with two runs of anitbot extractor msramalho 2025-07-05 16:00:03 +01:00
  • 0d1447117c updates docs to reflect new general approach extractor msramalho 2025-07-05 15:56:13 +01:00
  • 0f56a5aae5 Merge pull request #331 from bellingcat/dev v1.1.1 Miguel Sozinho Ramalho 2025-06-30 02:36:25 +01:00
  • 649412053e exclude non-ready code msramalho 2025-06-30 02:27:21 +01:00
  • c2c9718f73 make python api tests work on gh when no env is set msramalho 2025-06-30 02:20:51 +01:00
  • 30ea8a0ba4 bumps dependencies msramalho 2025-06-30 02:20:09 +01:00
  • 73c8dc583f closes #333 msramalho 2025-06-30 01:52:22 +01:00
  • b2648fa3cd follow docs advice on exponential backoff of SheetsAPI msramalho 2025-06-30 01:47:12 +01:00
  • 4ad71b3589 adds retry to worksheet read for slow worksheets msramalho 2025-06-30 01:42:34 +01:00
  • 7c9475cde2 allow for human readable console logs, but defaults to JSON on file logs. msramalho 2025-06-30 00:53:10 +01:00
  • afd9090a4c concludes logging standardization refactor msramalho 2025-06-26 17:20:04 +01:00
  • ad29cb4447 adds post_data to metadata for instagram msramalho 2025-06-26 15:48:10 +01:00
  • ce4d7ac649 WIP refactor logging msramalho 2025-06-21 15:54:51 +01:00
  • ade7feb5a0 version bump msramalho 2025-06-18 17:38:17 +01:00
  • 12b457706b closes #166 adds story URL feature to telethon extractor msramalho 2025-06-18 17:37:44 +01:00
  • 592dc30415 closes #330 msramalho 2025-06-18 16:40:55 +01:00
  • 4a36e6f6b0 fix tests msramalho 2025-06-18 13:50:21 +01:00
  • d46eeee9b6 docs improved msramalho 2025-06-18 13:35:51 +01:00
  • 302e6f4258 logs improved msramalho 2025-06-18 13:35:43 +01:00
  • e803c5d0e3 Merge branch 'main' into dev Miguel Sozinho Ramalho 2025-06-18 13:35:21 +01:00
  • e1d0314a9e Merge branch 'dev' of https://github.com/bellingcat/auto-archiver into dev msramalho 2025-06-18 13:26:48 +01:00
  • 5d5119e053 Merge pull request #329 from bellingcat/dev Miguel Sozinho Ramalho 2025-06-18 00:31:09 +01:00
  • d6c90d87f1 installs ffmpeg in readthedocs msramalho 2025-06-18 00:29:36 +01:00
  • 212bf67ab1 installs ffmpeg in readthedocs msramalho 2025-06-18 00:29:36 +01:00
  • 6abe2edb13 Merge pull request #328 from bellingcat/dev Miguel Sozinho Ramalho 2025-06-18 00:22:39 +01:00
  • 03c0cf09ae fix issue with grid in scripts/config_editor @mui lib upgrade msramalho 2025-06-18 00:20:31 +01:00
  • 0db77c7e68 Merge pull request #326 from bellingcat/dependabot/npm_and_yarn/scripts/settings/actions-27795ad889 Miguel Sozinho Ramalho 2025-06-18 00:12:51 +01:00
  • cd6607943d Bump @types/react dependabot[bot] 2025-06-17 22:58:23 +00:00
  • 3869ea73d7 Merge pull request #312 from bellingcat/dev v1.1.0 v1.1.0 Miguel Sozinho Ramalho 2025-06-17 23:57:22 +01:00
  • 918cb220be minor indentation issue msramalho 2025-06-17 23:51:10 +01:00
  • 76fd329fe5 twitter tests fix msramalho 2025-06-17 23:51:03 +01:00
  • a3ae9ebbb3 log level updates msramalho 2025-06-17 20:36:33 +01:00
  • 23b781c866 new check for edge case msramalho 2025-06-17 20:36:22 +01:00
  • 2aec240128 thumbnail enricher always run probe by default msramalho 2025-06-17 20:28:20 +01:00
  • c5a2fd45f9 log levels updated msramalho 2025-06-17 20:04:40 +01:00
  • 216226e7cc browsertrix version bump msramalho 2025-06-17 19:22:20 +01:00
  • ad168785e7 retry for Google API 503s msramalho 2025-06-17 19:22:09 +01:00
  • 74a1561c3d logging and clean up msramalho 2025-06-17 19:21:40 +01:00
  • 55d9ffaacd typo msramalho 2025-06-17 18:51:21 +01:00
  • f19fb575a7 logging updates msramalho 2025-06-17 18:50:54 +01:00
  • f53b2075ba fixes gdrive error msramalho 2025-06-17 18:45:55 +01:00
  • d20486c02a Merge pull request #320 from djhmateer/v1-dm-changes Miguel Sozinho Ramalho 2025-06-17 16:13:37 +01:00
  • 6085a66c58 revert metadata json renaming msramalho 2025-06-17 16:10:24 +01:00
  • 33cca734d9 original_url changes still constitute empty result msramalho 2025-06-17 16:06:25 +01:00
  • 2f1a07abbf renaming and code improvements to json_e richer msramalho 2025-06-17 16:06:04 +01:00
  • 664ee8d037 fixes bugs and limited configuration of multi-level logs msramalho 2025-06-17 14:10:46 +01:00
  • 1b260788de do not add commit comments to code msramalho 2025-06-17 13:18:12 +01:00
  • f0b876e67c removes dev specific instructions msramalho 2025-06-17 13:16:36 +01:00
  • 8067da0f60 custom user to its own file msramalho 2025-06-17 13:15:13 +01:00
  • 6f949738a3 Merge branch 'dev' into v1-dm-changes Miguel Sozinho Ramalho 2025-06-17 13:05:34 +01:00
  • 1b6d85884b complements authentication changes msramalho 2025-06-17 12:54:43 +01:00
  • 7ab804d163 dependencies update msramalho 2025-06-17 12:50:35 +01:00
  • b3adc5603a metadata.json hardcode in storage. add new metadata_json_enricher. log level change in orchestrator Dave Mateer 2025-06-17 09:51:19 +01:00
  • ba3f1a52e8 Logging each_level_in_separate_file feature Dave Mateer 2025-06-16 16:15:54 +01:00
  • a60d800b31 Changed log level for media Dave Mateer 2025-06-16 15:07:39 +01:00
  • f2e80758a7 typo on authentication docs. Updated install docs. Dave Mateer 2025-06-16 14:59:55 +01:00
  • f07fdbc500 Custom local version comment in toml file Dave Mateer 2025-06-16 14:54:15 +01:00
  • b236f2510d Updates to installation docs Dave Mateer 2025-06-16 14:40:40 +01:00
  • 529d8b60bf Gitgnore to include launch.json and installtion docs to include build script. Dave Mateer 2025-06-16 14:37:21 +01:00
  • cd6a2b6031 generic_extractor download tests adaptations msramalho 2025-06-11 20:05:35 +01:00
  • dfb361e3a0 reset generic_extractor description in result msramalho 2025-06-11 19:55:54 +01:00
  • 3d31c7605b Merge pull request #319 from bellingcat/feat/linkedin-antibot Miguel Sozinho Ramalho 2025-06-11 19:42:38 +01:00
  • d7a48e465b fix copypasta msramalho 2025-06-11 18:04:49 +01:00
  • aaa9ead39d adds documentation for dropins msramalho 2025-06-11 17:58:53 +01:00
  • f5be7a50c1 Testing Linkedin Dropin for Antibot msramalho 2025-06-11 16:52:03 +01:00
  • 2adcf231f7 new LinkedIn Dropin for Antibot msramalho 2025-06-11 16:51:52 +01:00
  • cd19181d8f minor improvements msramalho 2025-06-11 16:51:42 +01:00
  • b60469767a more flexibility to antibot dropins media finding process msramalho 2025-06-11 16:51:22 +01:00
  • d60d02c16e improves download_from_url msramalho 2025-06-11 16:50:31 +01:00
  • e567bba6f9 improves docs for how-to and migrations msramalho 2025-06-11 13:37:03 +01:00
  • 3cf51dd874 adds tracker remove feature and tests msramalho 2025-06-11 11:56:42 +01:00
  • 69ddb72146 separate reddit tests msramalho 2025-06-11 11:27:11 +01:00
  • 1039e9631f new reddit tests with .env.test msramalho 2025-06-11 11:22:23 +01:00
  • 79f42c3c41 Merge pull request #318 from bellingcat/feat/antibot-reddit Miguel Sozinho Ramalho 2025-06-10 18:39:34 +01:00
  • 8314833ae8 removes exclude_media_extensions option msramalho 2025-06-10 18:34:33 +01:00
  • 6279610a43 updates docs msramalho 2025-06-10 18:28:45 +01:00
  • fc89d96517 escape sequence msramalho 2025-06-10 18:04:33 +01:00
  • 54fda9cad4 antibot in docker uses a different user_data_dir msramalho 2025-06-10 18:04:27 +01:00
  • 71636233cb adds migration information and VkDropin info. msramalho 2025-06-10 17:07:10 +01:00
  • fdbe96f2e4 vk and reddit should work without credentials but log the error msramalho 2025-06-10 16:44:14 +01:00
  • 22bd8727df python dependencies bump msramalho 2025-06-10 16:43:55 +01:00
  • 499c272260 dependabot switch to monthly msramalho 2025-06-10 16:37:52 +01:00
  • f232bc45b8 Merge pull request #315 from bellingcat/dependabot/docker/webrecorder/browsertrix-crawler-1.6.2 Miguel Sozinho Ramalho 2025-06-10 16:34:30 +01:00
  • 4270e06728 npm update on scripts/settings msramalho 2025-06-10 16:33:47 +01:00
  • ca00aa302d version bump breaking msramalho 2025-06-10 16:31:32 +01:00