2026-03-12 11:47:20 +00:00
2026-02-23 11:27:12 +00:00
2022-10-31 17:10:55 +00:00
2025-02-12 11:41:54 +00:00
2026-03-02 15:37:55 +00:00
2026-03-02 15:37:55 +00:00
2021-06-24 16:14:32 +02:00
2026-03-10 14:33:25 +00:00
2026-03-10 14:33:25 +00:00

Auto Archiver

Documentation Status PyPI version Docker Image Version (latest by date) Core Test Status

Auto Archiver is a Python tool to automatically archive content on the web in a secure and verifiable way. It takes URLs from different sources (e.g. a CSV file, Google Sheets, command line etc.) and archives the content of each one. It can archive social media posts, videos, images and webpages. Content can be enriched, then saved either locally or remotely (S3 bucket, Google Drive). The status of the archiving process can be appended to a CSV report, or if using Google Sheets back to the original sheet.

Read the article about Auto Archiver on bellingcat.com.

One-Click Cloud Deploy

Deploy your own Auto Archiver instance to the cloud — no coding required:

Deploy on Railway

What you get: A web interface where you can paste URLs and archive them instantly. Optionally connect a Google Sheet for automated monitoring, S3 for cloud storage, and Telegram for archiving channels.

Only required setting: AUTH_PASSWORD — everything else is optional and can be configured later via the Railway dashboard.

📋 Environment variables reference
Variable Required Description
AUTH_PASSWORD Yes Password to access the web interface
GSHEET_URL No Google Sheet URL to monitor for new URLs use this template
GOOGLE_SERVICE_ACCOUNT_JSON No Google service account JSON (required with Sheets) follow these instructions
POLL_INTERVAL No Seconds between Sheet checks (default: 300)
S3_BUCKET No S3 bucket name for archived content, ideal for cloud hosting your archives but not mandatory, any S3-compatible storage works
S3_KEY / S3_SECRET No S3 credentials
S3_REGION No S3 region (default: us-east-1)
S3_ENDPOINT No S3 endpoint URL
TELEGRAM_API_ID / TELEGRAM_API_HASH No Telegram API credentials
TELEGRAM_BOT_TOKEN No Telegram bot token
ENABLE_SCREENSHOTS No Set to true for full-page screenshots
ENABLE_THUMBNAILS No Set to true for video thumbnails
ENABLE_CSV_DB No Set to true for CSV logging
LOG_LEVEL No DEBUG, INFO, WARNING, ERROR (default: INFO)

Traditional Installation

View the Installation Guide for full instructions

Advanced:

To get started quickly using Docker:

docker pull bellingcat/auto-archiver && docker run -it --rm -v secrets:/app/secrets bellingcat/auto-archiver --config secrets/orchestration.yaml

Or pip:

pip install auto-archiver && auto-archiver --help

Contributing

We welcome contributions to the Auto Archiver project! See the Contributing Guide for how to get involved!

Description
Automatically archive links to videos, images, and social media content from Google Sheets (and more).
Readme MIT 16 MiB
Languages
Python 92.9%
TypeScript 3.6%
HTML 2.1%
Shell 0.8%
Dockerfile 0.3%
Other 0.3%