Commit Graph

  • ceefd87cf2 path msramalho 2023-02-07 23:00:00 +00:00
  • 67037ab291 Bump version to v0.2.6 for release v0.2.6 msramalho 2023-02-07 22:56:56 +00:00
  • 7061ddcf62 toml msramalho 2023-02-07 22:56:43 +00:00
  • d205846d1d Bump version to v0.2.5 for release v0.2.5 msramalho 2023-02-07 22:54:07 +00:00
  • 92089f5f1f simplify msramalho 2023-02-07 22:53:57 +00:00
  • c198257e23 Bump version to v0.2.4 for release v0.2.4 msramalho 2023-02-07 22:33:01 +00:00
  • 3913528275 pipenv msramalho 2023-02-07 22:32:50 +00:00
  • 5c9ca9da1d Bump version to v0.2.3 for release v0.2.3 msramalho 2023-02-07 22:28:53 +00:00
  • 277d15687c setup.py 3.10 msramalho 2023-02-07 22:28:44 +00:00
  • 3acb1b5f64 Bump version to v0.2.2 for release v0.2.2 msramalho 2023-02-07 22:20:53 +00:00
  • c570eaa64b pipenv workflow msramalho 2023-02-07 22:20:44 +00:00
  • 217ec40921 Bump version to v0.2.1 for release v0.2.1 msramalho 2023-02-07 22:15:43 +00:00
  • a5995e2262 pypi workflow msramalho 2023-02-07 22:15:27 +00:00
  • 9b4a41e654 Bump version to v0.2.0 for release v0.2.0 msramalho 2023-02-07 22:07:23 +00:00
  • 29680b0be5 gsheet_db bug fix on missing thumbnail msramalho 2023-02-07 21:59:41 +00:00
  • 51a3134065 adds gd_drive storage msramalho 2023-02-07 21:59:24 +00:00
  • 32a8db1223 disable bot_token msramalho 2023-02-02 14:01:08 +00:00
  • 4854929a1d thumbnail and bot token msramalho 2023-02-02 13:49:56 +00:00
  • e758bd076b test msramalho 2023-02-02 12:43:23 +00:00
  • 9bcca427a0 wacz in gsheets msramalho 2023-02-02 12:41:06 +00:00
  • 77a8c290f7 logs msramalho 2023-02-02 12:24:04 +00:00
  • 2f7b6dfc44 revert msramalho 2023-02-02 12:23:43 +00:00
  • ab4bce6602 test msramalho 2023-02-02 12:20:30 +00:00
  • 8b8845d607 bot_token msramalho 2023-02-02 12:15:57 +00:00
  • 80b4f207d9 logs msramalho 2023-02-02 12:11:46 +00:00
  • 9159f0abd5 logs msramalho 2023-02-02 12:05:23 +00:00
  • cf4be2f339 logs msramalho 2023-02-02 11:59:53 +00:00
  • d8a79b930b imrpove logs msramalho 2023-02-02 11:55:22 +00:00
  • 11eda6d03e staticmethod fix msramalho 2023-02-02 11:26:00 +00:00
  • 5b0593ce82 arg parse fix msramalho 2023-02-02 11:00:24 +00:00
  • 39bfde2026 thumbnails bug fix msramalho 2023-02-01 00:35:48 +00:00
  • d1e4dde3f6 fixing imports msramalho 2023-01-27 00:19:58 +00:00
  • ac000d5943 cleanup msramalho 2023-01-27 00:03:30 +00:00
  • f5b7c3a5ea mute formatter and docker msramalho 2023-01-26 23:38:58 +00:00
  • c261361ac8 try/catch enrichers msramalho 2023-01-26 23:03:51 +00:00
  • 2508bb8a1b cleanup + rearchivable logic msramalho 2023-01-26 23:01:34 +00:00
  • 9dd8afed8c minor improvements msramalho 2023-01-22 23:15:54 +00:00
  • 092ffdb6d8 replaywebpage msramalho 2023-01-22 00:48:09 +00:00
  • 746f6a333e further cleanup msramalho 2023-01-21 19:57:54 +00:00
  • 9bd8ea0994 cleanup msramalho 2023-01-21 19:44:46 +00:00
  • b763fc4188 final naming cleanup + new feeders/dbs msramalho 2023-01-21 19:44:12 +00:00
  • 753039240f pyproject msramalho 2023-01-21 19:01:02 +00:00
  • ea2c266fa2 clean up and wacz WIP msramalho 2023-01-19 00:27:11 +00:00
  • 9bbc13e9be vk and yt-dlp msramalho 2023-01-18 23:15:25 +00:00
  • 176ce7e8da vk cleanup msramalho 2023-01-18 21:37:29 +00:00
  • eb0859fbaf vk archiver msramalho 2023-01-18 21:34:40 +00:00
  • 085376f63f telegram archiver msramalho 2023-01-18 21:14:20 +00:00
  • 63d1abbe4b tiktok archiver though info is no longer working msramalho 2023-01-18 16:56:35 +00:00
  • 1def8bb03d instagram archiver msramalho 2023-01-18 16:16:23 +00:00
  • 725bab8240 twitter archivers msramalho 2023-01-18 00:15:18 +00:00
  • f1bc83818d template updates msramalho 2023-01-17 17:01:25 +00:00
  • 47dc788143 thumbnails enricher msramalho 2023-01-17 16:29:27 +00:00
  • 74e50eccf1 hash enricher and media refactor msramalho 2023-01-13 02:12:08 +00:00
  • 6ca46417fe local storage + multiple storage support msramalho 2023-01-12 02:09:39 +00:00
  • 0cb593fd21 wayback enricher ready msramalho 2023-01-11 00:03:47 +00:00
  • d4825196f1 html template working with jinja templates msramalho 2023-01-10 00:22:16 +00:00
  • aac16fa8c2 minor comments msramalho 2023-01-09 22:24:44 +00:00
  • 1cdc006b27 s3 storaging + WIP gsheets DB msramalho 2023-01-04 18:02:44 +00:00
  • bb512b36c9 gsheet feeder + db WIP msramalho 2023-01-04 16:37:36 +00:00
  • 96845305a3 media concept implemented msramalho 2022-12-14 19:01:20 +00:00
  • 9c056d001c merge logic started msramalho 2022-12-14 16:11:06 +00:00
  • 53ffa2d4ae telethon_archiver working for multiple media msramalho 2022-12-14 15:37:34 +00:00
  • b3860cfec1 telethon join channels working msramalho 2022-12-14 14:01:39 +00:00
  • 955891a411 WIP feeder msramalho 2022-12-10 12:03:46 +00:00
  • 9dc709d3b9 demo feeder logic working msramalho 2022-11-24 15:44:25 +00:00
  • 618e7ed0a3 subproperties in config msramalho 2022-11-24 11:53:21 +00:00
  • 65dd155c90 WIP refactor logic msramalho 2022-11-15 15:00:52 +00:00
  • 6a0ce5ced1 orchestrator design structure msramalho 2022-11-11 02:08:48 +00:00
  • 04263094ad WIP docker changes for cli and auto_archiver msramalho 2022-11-10 17:46:40 +00:00
  • 390b84eb22 dockerization complete msramalho 2022-11-08 15:55:33 +00:00
  • 81eadd4672 disable browsertrix on docker, see #66 msramalho 2022-11-08 14:22:13 +00:00
  • a8f7055696 reduces uncontrolled exceptions msramalho 2022-11-08 13:59:59 +00:00
  • 09f47383a3 dockerfile improvements msramalho 2022-11-08 13:59:35 +00:00
  • 629cd586db adds session_file for missing archivers msramalho 2022-11-08 13:59:09 +00:00
  • 889eb1d270 Merge branch 'dev' into dockerize msramalho 2022-11-02 17:01:00 +00:00
  • 50e03ba565 closes #65 with simpler solution msramalho 2022-11-02 16:59:44 +00:00
  • a9df992f66 WiP msramalho 2022-11-02 16:51:32 +00:00
  • c8fa077df7 docker initial files msramalho 2022-10-31 17:10:55 +00:00
  • 29e1872e87 fix: rm stopped containers only msramalho 2022-10-31 10:41:27 +00:00
  • 7a700acd8e hotfix for #65 msramalho 2022-10-31 10:35:01 +00:00
  • 22363cb8b9 adds information on browsertrix usage msramalho 2022-10-20 11:59:23 +01:00
  • ac4f1b6132 readme updates msramalho 2022-10-19 11:37:04 +01:00
  • 4d2b7b4040 reverse order of login attempts msramalho 2022-10-19 11:27:17 +01:00
  • 54c572258c fix tty msramalho 2022-10-18 17:46:40 +01:00
  • 6c80a5b82d session file logic msramalho 2022-10-18 17:35:59 +01:00
  • 63f53358d3 adds traceback msramalho 2022-10-18 16:38:12 +01:00
  • 3f121d800e catch bad instagram login msramalho 2022-10-18 16:36:27 +01:00
  • 93be1af93f adds instagram post/profile msramalho 2022-10-18 15:45:10 +01:00
  • f0f844a569 improves browsertrix configurations msramalho 2022-10-18 11:21:10 +01:00
  • df502f3bde updates yt-dlp msramalho 2022-10-18 11:20:53 +01:00
  • 26903190fd adds wacz link msramalho 2022-10-17 14:41:34 +01:00
  • 683f2d7500 Merge pull request #64 from bellingcat/dev Miguel Sozinho Ramalho 2022-10-17 14:40:15 +01:00
  • 23a4dc20c5 Merge pull request #63 from edsu/browsertrix-crawler Miguel Sozinho Ramalho 2022-10-17 14:39:34 +01:00
  • 57464f1506 refactors for edges in browsertrix and s3 upload, adds timeout parameter msramalho 2022-10-17 14:07:31 +01:00
  • dc0ca8bdd6 adds browsertrix to all archivers flows msramalho 2022-10-17 14:06:50 +01:00
  • 20ca50dc90 Clean up browsertrix-crawler files Ed Summers 2022-10-11 16:49:19 -04:00
  • c34fb9cf10 Add browsertrix profile config option Ed Summers 2022-10-11 16:14:25 -04:00
  • 82fcf74450 Merge pull request #62 from bellingcat/main Miguel Sozinho Ramalho 2022-10-06 08:24:51 +01:00
  • 3b87dffe6b Add browsertrix-crawler capture Ed Summers 2022-09-25 19:40:20 +00:00
  • 0bdd06f641 Update README.md Miguel Sozinho Ramalho 2022-09-22 15:58:41 +02:00