Commit Graph

1079 Commits

Author SHA1 Message Date
Patrick Robertson
733aef0b08 Merge pull request #246 from bellingcat/webdriver-cookies
Better checking of cookies to add to webdriver + generic extractor tweaks
2025-03-14 13:35:58 +00:00
erinhmclark
562d06916e Revert pre commit 2025-03-14 13:08:57 +00:00
Patrick Robertson
a8e5585e6c github format 2025-03-14 12:52:01 +00:00
Patrick Robertson
abaeec0cc6 Add ruff check 2025-03-14 12:48:06 +00:00
Patrick Robertson
19715c8ec2 Merge branch 'main' into webdriver-cookies 2025-03-14 12:44:48 +00:00
Erin Clark
a01a873f37 Merge pull request #244 from bellingcat/linting_etc
Add Linting and Formatting with Ruff.

- Add pre-commit hook for formatting (not lint fixing, this should be done manually)
- Move Makefile to project root and add commands for linting, test, docs, Docker
- Add GH Action for ruff to check linting and formatting
- Make suggested changes from current ruff configs
2025-03-14 12:27:04 +00:00
erinhmclark
72f48f0147 Fix merge conflicts. 2025-03-14 12:11:24 +00:00
erinhmclark
846474a4e2 Merge branch 'main' into linting_etc 2025-03-14 10:50:13 +00:00
Patrick Robertson
f504d2e304 Merge branch 'main' into webdriver-cookies 2025-03-14 09:37:12 +00:00
Patrick Robertson
5f7a8b1ac0 Merge pull request #249 from bellingcat/tikwm_dropin
Move tikwm extractor into a droping for the generic extractor
2025-03-14 09:28:37 +00:00
erinhmclark
4af3cd7b2a Revert ruff to separate commands. 2025-03-13 21:47:09 +00:00
erinhmclark
ad2784c5de Update style_guide.md 2025-03-13 20:52:21 +00:00
erinhmclark
c7c24fbaf2 Update style_guide.md to clarify pre-commit setup, add Docker commands to Makefile and merge ruff actions. 2025-03-13 20:26:29 +00:00
msramalho
4d67dce4c8 minor log fix 2025-03-13 19:24:05 +00:00
Patrick Robertson
f6b13327f0 Tweaks and additional debug logging 2025-03-13 17:41:41 +00:00
Patrick Robertson
589c834047 Fix parsing ytdlp args - we should first run them through the parse_options method 2025-03-13 17:41:40 +00:00
Patrick Robertson
0efeaaabb1 Revert to using time.sleep and .click() - since we only want to be waiting the first time (for the page to load) 2025-03-13 17:41:16 +00:00
Patrick Robertson
2e25e59fa6 Fix unit tests - make caplog checks more robust, having added a new logger/debug call 2025-03-13 16:07:49 +00:00
Patrick Robertson
10ceb7aa15 Move tikwm extractor into a droping for the generic extractor 2025-03-13 15:59:42 +00:00
erinhmclark
0bef78b0b4 Remove autouse property of mock_sleep. 2025-03-13 15:23:35 +00:00
erinhmclark
16012df30b Revert exception check in test. 2025-03-13 13:57:04 +00:00
erinhmclark
8673bc5979 Fix unused imports and include rule. 2025-03-13 13:55:31 +00:00
erinhmclark
e76551ba22 Add documentation, pre-commit hook, more make commands and 2025-03-13 13:21:32 +00:00
erinhmclark
6e52a534e7 More fixes from Bugbear suggestions 2025-03-12 16:07:05 +00:00
erinhmclark
753c3c6214 Linting tests 2025-03-12 14:27:45 +00:00
erinhmclark
79f576be1d Run fix on tests. 2025-03-12 10:38:16 +00:00
erinhmclark
94aeee8313 Move Makefile to the root of the project and add commands for tests, linting and running docker. 2025-03-12 10:37:30 +00:00
erinhmclark
abc90b19d5 Update pyproject.toml 2025-03-12 10:35:56 +00:00
erinhmclark
8ca7698fa0 Move Makefile and fix import error with unused import. 2025-03-11 19:58:02 +00:00
erinhmclark
28c5396b74 Move ruff to dev dependencies. 2025-03-11 17:25:24 +00:00
Patrick Robertson
7a81ab617a Better checking of cookies to add to webdriver 2025-03-11 11:57:25 +00:00
erinhmclark
81aa343f21 Merge main. 2025-03-11 10:45:07 +00:00
erinhmclark
441f341139 Merge branch 'main' into linting_etc
# Conflicts:
#	src/auto_archiver/core/consts.py
#	src/auto_archiver/core/orchestrator.py
#	src/auto_archiver/core/storage.py
#	src/auto_archiver/modules/local_storage/local_storage.py
#	src/auto_archiver/modules/s3_storage/s3_storage.py
#	tests/storages/test_S3_storage.py
#	tests/storages/test_local_storage.py
#	tests/storages/test_storage_base.py
2025-03-11 10:39:47 +00:00
Patrick Robertson
e2442b2f6b Merge pull request #243 from bellingcat/fix-long-path-names
Unit tests for storage types + fix storage too long issues for local storage
2025-03-11 10:05:09 +00:00
erinhmclark
e7fa88f1c7 Implementing ruff suggestions. 2025-03-10 21:45:30 +00:00
erinhmclark
ca44a40b88 Ruff fix on src. 2025-03-10 19:03:45 +00:00
erinhmclark
85abe1837a Ruff format with defaults. 2025-03-10 18:44:54 +00:00
Miguel Sozinho Ramalho
3fcec57492 minor string fix 2025-03-10 17:17:59 +00:00
Patrick Robertson
2b91dc9514 Fix up unit tests 2025-03-10 16:51:16 +00:00
Patrick Robertson
a9c3477289 Improve docs on the path_generator and filename_generator config options 2025-03-10 16:43:14 +00:00
Patrick Robertson
770f4c8a3d Refactoring of storage code:
1. Fix some bugs in local_storage
2. Refactor unit tests to not set Media.key explicitly (unless it's well-known beforehand, which it isn't)
3. Limit length of URL for 'url' type path_generator
4. Throw an error if 'save_to' of local storage is too long
5. A few other tidyups
2025-03-10 16:39:48 +00:00
erinhmclark
cbb0414e5f Switch to ruff 2025-03-10 16:05:23 +00:00
erinhmclark
f4f2424eb5 Add black and flake8 2025-03-10 13:15:11 +00:00
Miguel Sozinho Ramalho
58bd38e292 Adds new extractor for tiktok via unofficial API (#237)
* minor update to defaults in api_db

* readme typo

* adds and tests new tikwm tiktok downloader

* addresses PR comments
2025-03-10 11:56:45 +00:00
Patrick Robertson
e89a8da3b4 Unit tests for storage types + fix storage too long issues for local storage 2025-03-10 11:30:15 +00:00
Erin Clark
ce46a8a7ac Merge pull request #240 from bellingcat/update_release
Update project version to: 0.13.5.

Update the release process docs and the latest version in pyproject.toml
v0.13.5
2025-03-07 18:21:17 +00:00
erinhmclark
7e10040bbd Update the release description to tag on release 2025-03-07 18:04:51 +00:00
erinhmclark
b386ae6287 Add poetry.lock and pyproject.toml paths to trigger tests. 2025-03-07 18:01:08 +00:00
erinhmclark
1a2d9de819 Update the release process docs and the latest version in pyproject.toml 2025-03-07 17:33:32 +00:00
Erin Clark
4c21795d5f Merge pull request #226 from bellingcat/merge_modules
Merge modules with multi-functionality:
- gsheet_feeder and gsheet_db are now one module, gsheet_feeder_db
- atlos_feeder, atlos_db and atlos_storage are now one module, atlos_feeder_db_storage.

This pull request also add documentation and updates references.
2025-03-07 16:47:30 +00:00