Erin Clark 4c21795d5f Merge pull request #226 from bellingcat/merge_modules
Merge modules with multi-functionality:
- gsheet_feeder and gsheet_db are now one module, gsheet_feeder_db
- atlos_feeder, atlos_db and atlos_storage are now one module, atlos_feeder_db_storage.

This pull request also add documentation and updates references.
2025-03-07 16:47:30 +00:00
2022-10-31 17:10:55 +00:00
2025-02-27 22:02:44 +00:00
2025-02-12 11:41:54 +00:00
2025-03-07 11:52:14 +00:00
2021-06-24 16:14:32 +02:00
2025-03-05 10:24:54 +00:00

Auto Archiver

PyPI version Docker Image Version (latest by date) Core Test Status Download Test Status

Auto Archiver is a Python tool to automatically archive content on the web in a secure and verifiable way. It takes URLs from different sources (e.g. a CSV file, Google Sheets, command line etc.) and archives the content of each one. It can archive social media posts, videos, images and webpages. Content can enriched, then saved either locally or remotely (S3 bucket, Google Drive). The status of the archiving process can be appended to a CSV report, or if using Google Sheets back to the original sheet.

Read the article about Auto Archiver on bellingcat.com.

Installation

View the Installation Guide for full instructions

Advanced:

To get started quickly using Docker:

docker pull bellingcat/auto-archiver && docker run --rm -v secrets:/app/secrets bellingcat/auto-archiver --config secrets/orchestration.yaml

Or pip:

pip install auto-archiver && auto-archiver --help

Contributing

We welcome contributions to the Auto Archiver project! See the Contributing Guide for how to get involved!

Description
Automatically archive links to videos, images, and social media content from Google Sheets (and more).
Readme MIT 16 MiB
Languages
Python 92.9%
TypeScript 3.6%
HTML 2.1%
Shell 0.8%
Dockerfile 0.3%
Other 0.3%