Commit Graph

46 Commits

Author SHA1 Message Date
msramalho
39f27ec1bc reenable telethon 2022-05-10 20:23:13 +02:00
msramalho
e0276dfab1 additional cleanup 2022-05-09 18:19:38 +02:00
msramalho
0d65798308 wip: configurations and logic 2022-05-09 14:54:48 +02:00
msramalho
f592c7fcfe refactor to use config.py 2022-05-03 20:34:04 +02:00
msramalho
3bdeec1d2f fix deprecation warning for selenium 2022-03-30 11:05:31 +02:00
Logan Williams
398f296789 Fix Selenium driver issues with telegram links 2022-03-18 11:10:27 +01:00
Logan Williams
538bb05395 Merge branch 'main' of github.com:bellingcat/auto-archiver into main 2022-03-18 09:53:29 +01:00
Logan Williams
050b04e31d Add flag for storage privacy 2022-03-18 09:53:21 +01:00
msramalho
0035603bfb telethon-poc 2022-03-15 18:45:53 +01:00
Logan Williams
0304860bce Don't check status for empty URL rows 2022-03-14 11:10:51 +01:00
msramalho
f121c9dab7 enable tolower 2022-03-12 20:14:16 +01:00
msramalho
69483d432c adds logs 2022-03-12 20:04:08 +01:00
msramalho
486c3295b5 log 2022-03-12 19:54:10 +01:00
msramalho
6c5d6f521e implements fresh status retrieval if needed 2022-03-10 19:00:02 +01:00
msramalho
52333874c9 making column names configurable through the command line 2022-03-09 12:38:04 +01:00
msramalho
ff874fe0d3 simplifies access to google sheets, single get_values 2022-03-09 12:17:51 +01:00
msramalho
544e7578a6 removes duplicate code 2022-03-09 11:46:14 +01:00
Logan Williams
aa4b175dea Fix issue with timestamps being convereted to user format 2022-02-28 12:54:58 +01:00
Logan Williams
c6b159905b Switch to headless Firefox 2022-02-28 11:45:32 +01:00
Logan Williams
6ebce974f0 WIP: Make timezones more consistent in UTC 2022-02-28 08:42:59 +01:00
Logan Williams
63a2847ac9 Add header argument; set up webdriver 2022-02-25 16:09:35 +01:00
msramalho
4bbbdcc7fd minor update 2022-02-23 18:30:06 +01:00
msramalho
214d52d36f improved tmp folder management 2022-02-23 16:43:42 +01:00
msramalho
3cafc444fc creates tmp folder if not exists 2022-02-23 16:32:38 +01:00
msramalho
1d62009c4f creates utils module and moves gworkseet there 2022-02-23 16:24:59 +01:00
msramalho
9550cd509e making code more resilient to exceptions 2022-02-23 13:57:11 +01:00
msramalho
644aa0811c todo 2022-02-23 09:57:44 +01:00
msramalho
374852e740 cleanup 2022-02-23 09:57:04 +01:00
msramalho
2d145802b5 extracted worksheet operations 2022-02-23 09:54:03 +01:00
msramalho
e4603a9423 refactoring storage and bringing changes from origin 2022-02-22 16:03:35 +01:00
msramalho
f3ce226665 split into multiple files MVP 2022-02-21 14:19:09 +01:00
Logan Williams
51d448f0cb Refactor archivers to make it easier to add support for new types of URLs 2022-02-20 10:36:53 +01:00
Logan Williams
7e2e9f999d Fix merge conflicts with get_key, remove os module as a function input 2021-10-21 10:01:05 +00:00
Logan Williams
0492eee65e Generate storage keys in reusable way 2021-10-21 09:55:50 +00:00
James Arnall
b639a9b819 Minor refactoring to avoid redundant code 2021-09-08 14:20:23 -07:00
Logan Williams
2097e42df0 Dynamically adjust number of keyframes for contact sheet view. 2021-08-25 11:04:14 +00:00
Logan Williams
e3b400ca4e Add Facebook cookie option 2021-08-24 11:17:29 +02:00
Logan Williams
4472389ae5 Remove case-sensitivity to column titles 2021-08-24 10:43:22 +02:00
Logan Williams
af99f16f71 Use Internet Archive Save Now V2 API 2021-06-03 11:20:06 +02:00
Logan Williams
0b8f55de18 Check to see if Internet Archive request succeeded 2021-06-01 14:35:32 +02:00
Logan Williams
3e1e0e6a5f Add in progress status to Archive requests 2021-06-01 11:35:14 +02:00
Logan Williams
2540b54113 Allow redirects for Archive requets 2021-06-01 11:30:47 +02:00
Logan Williams
cbfc054203 Use service_account.json 2021-06-01 11:05:13 +02:00
Logan Williams
aaf75eb284 Add internet archive fallback 2021-06-01 11:00:40 +02:00
Logan Williams
866c4fa7fd Support (some) Telegram links 2021-05-03 14:16:09 +02:00
Logan Williams
3ba3f3cdf8 Add auto-auto archiver 2021-03-25 13:42:42 +01:00