Speed profiles control timeout, retries, batch size, and delays:
fast: 30s timeout, 7 retries, batch 10, 1s delay (~5x faster)
medium: 60s timeout, 5 retries, batch 5, 2s delay (default)
slow: 300s timeout, 3 retries, batch 5, 5s delay (safe)
Analysis showed 54% of batches hit 300s timeout on Olla bad routes,
wasting 7.7h on 155 batches. Fast mode reduces timeout waste from
300s to 30s per bad route — real embeds take ~18s on average.
Also reduced default batch delay from 5s to 2s in config.yaml.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
28 persona workspace with document upload, OCR pipeline, and vector embedding
assignment via AnythingLLM API. Supports 5 clusters (intel, cyber, military,
humanities, engineering) with batch processing and resume capability.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>