refactors for edges in browsertrix and s3 upload, adds timeout parameter

This commit is contained in:
msramalho
2022-10-17 14:07:31 +01:00
parent dc0ca8bdd6
commit 57464f1506
7 changed files with 30 additions and 18 deletions

View File

@@ -19,7 +19,7 @@ You also need:
4. [fonts-noto](https://fonts.google.com/noto) to deal with multiple unicode characters during selenium/geckodriver's screenshots: `sudo apt install fonts-noto -y`.
5. Internet Archive credentials can be retrieved from https://archive.org/account/s3.php.
6. If you would like to take archival WACZ snapshots using [browsertrix-crawler](https://github.com/webrecorder/browsertrix-crawler)
in addition to screenshots you will need to install Docker.
in addition to screenshots you will need to install [Docker](https://www.docker.com/).
### Configuration file
Configuration is done via a config.yaml file (see [example.config.yaml](example.config.yaml)) and some properties of that file can be overwritten via command line arguments. Here is the current result from running the `python auto_archive.py --help`: