Commit Graph

36 Commits

Author SHA1 Message Date
msramalho
94b37b02ba telethon refactor for failures 2022-05-10 20:23:44 +02:00
msramalho
6bd6f88b46 refactor 2022-05-09 17:45:54 +02:00
msramalho
0d65798308 wip: configurations and logic 2022-05-09 14:54:48 +02:00
msramalho
f00e31c23d introduce config.py 2022-05-03 20:33:54 +02:00
msramalho
ac9ed1a0d7 extract wayback config 2022-05-03 20:33:38 +02:00
msramalho
a7948ac768 extract telegram config 2022-05-03 20:33:19 +02:00
Dave Mateer
fec380e93d Fixed wwww (4 w's) to www in youtubedl 2022-04-27 10:18:10 +01:00
msramalho
8358ab0bfc assert post is not None 2022-03-30 11:12:06 +02:00
msramalho
3bdeec1d2f fix deprecation warning for selenium 2022-03-30 11:05:31 +02:00
msramalho
e5168fa07c removing TODO 2022-03-30 10:55:57 +02:00
msramalho
576f1a8f68 fix the UTF-8 issue for cyrilic 2022-03-30 10:55:33 +02:00
Logan Williams
398f296789 Fix Selenium driver issues with telegram links 2022-03-18 11:10:27 +01:00
Logan Williams
538bb05395 Merge branch 'main' of github.com:bellingcat/auto-archiver into main 2022-03-18 09:53:29 +01:00
Logan Williams
d611aa1e14 Some videos don't render a duration for some reason 2022-03-18 09:44:17 +01:00
msramalho
450065b6fb removes print 2022-03-16 19:56:18 +01:00
msramalho
516db483d6 telethon archiver working for 0,1,1+ media objects 2022-03-16 19:51:02 +01:00
msramalho
c2ae382a4e isloates html page generation logic so it can be reused 2022-03-16 19:50:44 +01:00
msramalho
0035603bfb telethon-poc 2022-03-15 18:45:53 +01:00
msramalho
3b9b42b854 minor code cleanup 2022-03-15 11:32:39 +01:00
msramalho
077c71f941 fixes index out fo range bug 2022-03-09 12:18:06 +01:00
Logan Williams
82ca6792c4 Fix issue with extracting time from Telegram media posts 2022-03-02 14:45:36 +01:00
Logan Williams
aa4b175dea Fix issue with timestamps being convereted to user format 2022-02-28 12:54:58 +01:00
Logan Williams
6ebce974f0 WIP: Make timezones more consistent in UTC 2022-02-28 08:42:59 +01:00
Logan Williams
2d50703489 Generate archivers for Telegram posts with images; move generation to function in base_archiver 2022-02-28 08:41:45 +01:00
Logan Williams
63a2847ac9 Add header argument; set up webdriver 2022-02-25 16:09:35 +01:00
Logan Williams
09dc5b5b81 Fix issue with query parameters by using urllib 2022-02-25 15:29:56 +01:00
Logan Williams
6a62c5798c Add Twitter non-video archiver 2022-02-25 13:55:43 +01:00
Logan Williams
1eb17e4de5 Add hash and screenshot methods; switch to more recent ytdl fork 2022-02-25 13:54:40 +01:00
msramalho
8bce84082a minor updates 2022-02-23 18:32:40 +01:00
msramalho
214d52d36f improved tmp folder management 2022-02-23 16:43:42 +01:00
msramalho
3cafc444fc creates tmp folder if not exists 2022-02-23 16:32:38 +01:00
msramalho
9a264a7dfe cleanup and docs 2022-02-23 16:07:58 +01:00
msramalho
9550cd509e making code more resilient to exceptions 2022-02-23 13:57:11 +01:00
msramalho
2d145802b5 extracted worksheet operations 2022-02-23 09:54:03 +01:00
msramalho
e4603a9423 refactoring storage and bringing changes from origin 2022-02-22 16:03:35 +01:00
msramalho
f3ce226665 split into multiple files MVP 2022-02-21 14:19:09 +01:00