40 Commits

Author SHA1 Message Date
Dave Mateer
529d8b60bf Gitgnore to include launch.json and installtion docs to include build script. 2025-06-16 14:37:21 +01:00
msramalho
cd19181d8f minor improvements 2025-06-11 16:51:42 +01:00
msramalho
1039e9631f new reddit tests with .env.test 2025-06-11 11:22:23 +01:00
msramalho
e2e6490b49 minimal changes 2025-06-07 18:15:21 +01:00
msramalho
05231445d9 removes unnecessary ignored files 2025-06-04 14:19:25 +01:00
msramalho
5cf640af8a experiments with seleniumbase 2025-04-28 11:08:00 +01:00
Patrick Robertson
c4a3a45bf7 Script to auto-generate a service account 2025-03-17 15:42:43 +00:00
Patrick Robertson
92c79ed994 Remove schema.json file from git - is auto-generated on release 2025-03-16 10:27:08 +00:00
Patrick Robertson
f54d6519a8 Fix sorting of steps in the outputted file 2025-03-04 11:51:26 +00:00
Patrick Robertson
7562938151 Proof of concept for settings page 2025-02-21 18:04:48 +00:00
Patrick Robertson
2650cd8fb2 Use a script to auto-generate documentation for the core modules from the manifest file 2025-02-10 22:51:04 +00:00
erinhmclark
170f8d18a6 Add instructions to README.md, include build directories in .gitignore and do a bit more tidying, 2025-01-16 20:46:10 +00:00
erinhmclark
e9a7f435a3 Add package dist directory to .gitignore 2025-01-13 13:33:23 +00:00
R. Miles McCain
f603400d0d Add direct Atlos integration (#137)
* Add Atlos feeder

* Add Atlos db

* Add Atlos storage

* Fix Atlos storages

* Fix Atlos feeder

* Only include URLs in Atlos feeder once they're processed

* Remove print

* Add Atlos documentation to README

* Formatting fixes

* Don't archive existing material

* avoid KeyError in atlos_db

* version bump

---------

Co-authored-by: msramalho <19508417+msramalho@users.noreply.github.com>
2024-04-15 19:25:17 +01:00
msramalho
9bd8ea0994 cleanup 2023-01-21 19:44:46 +00:00
msramalho
b763fc4188 final naming cleanup + new feeders/dbs 2023-01-21 19:44:12 +00:00
msramalho
753039240f pyproject 2023-01-21 19:01:02 +00:00
msramalho
b3860cfec1 telethon join channels working 2022-12-14 14:01:39 +00:00
msramalho
6c80a5b82d session file logic 2022-10-18 17:35:59 +01:00
msramalho
93be1af93f adds instagram post/profile 2022-10-18 15:45:10 +01:00
msramalho
57464f1506 refactors for edges in browsertrix and s3 upload, adds timeout parameter 2022-10-17 14:07:31 +01:00
Ed Summers
c34fb9cf10 Add browsertrix profile config option
This commit adds a browsertrix profile option to the configuration. In
order to not require the passing of the browsertrix config to every
Archiver, the Archiver constructors (include the base) were modified to
accept a Storage and Config instance. Some of the constructors them pick
out the pieces they need from the Config, in addition to calling the
parent constructor. In order to avoid a circular import that this
created the Config object now defines the default hash function to use,
rather than having it be a static property of the Archiver class.
2022-10-11 16:21:42 -04:00
msramalho
6124bc5f72 refactored and simplified obtaining credentials 2022-07-25 14:52:50 +01:00
Dave Mateer
363a8ef67a Added hash_algorithm to config to choose between SHA256 and SHA3_512 2022-07-18 13:15:48 +01:00
msramalho
7ab8d0e825 tmp folder randomly created in folder 2022-06-16 19:58:26 +02:00
msramalho
59afe7fd63 vk-archiver implemented 2022-06-15 16:38:18 +02:00
msramalho
dc60bb1558 json -> yaml 2022-06-14 21:18:18 +02:00
msramalho
f87acb6d1d refactor 2022-06-07 18:41:58 +02:00
msramalho
ea261635a2 cleanup 2022-05-25 10:32:26 +02:00
msramalho
f6bc45361a ignore custom configs 2022-05-10 20:48:40 +02:00
msramalho
b680700b22 ignoring config file 2022-05-03 20:32:23 +02:00
msramalho
0035603bfb telethon-poc 2022-03-15 18:45:53 +01:00
msramalho
3b9b42b854 minor code cleanup 2022-03-15 11:32:39 +01:00
Logan Williams
6a62c5798c Add Twitter non-video archiver 2022-02-25 13:55:43 +01:00
msramalho
3096725a2b Merge branch 'refactor-archivers' of https://github.com/bellingcat/auto-archiver into refactor-archivers 2022-02-23 16:12:47 +01:00
msramalho
f3ce226665 split into multiple files MVP 2022-02-21 14:19:09 +01:00
Logan Williams
51d448f0cb Refactor archivers to make it easier to add support for new types of URLs 2022-02-20 10:36:53 +01:00
Logan Williams
ebafd1a744 Update Pipfile 2021-06-01 09:19:12 +00:00
Logan Williams
a0180ccaaa pep8 format 2021-03-15 10:08:02 +01:00
Logan Williams
2af24085c9 Initial commit 2021-01-15 11:31:58 +01:00