4 Commits

Author SHA1 Message Date
salvacybersec
0a07045e17 Add --speed fast/medium/slow profiles for embed operations
Speed profiles control timeout, retries, batch size, and delays:
  fast:   30s timeout, 7 retries, batch 10, 1s delay (~5x faster)
  medium: 60s timeout, 5 retries, batch 5, 2s delay (default)
  slow:   300s timeout, 3 retries, batch 5, 5s delay (safe)

Analysis showed 54% of batches hit 300s timeout on Olla bad routes,
wasting 7.7h on 155 batches. Fast mode reduces timeout waste from
300s to 30s per bad route — real embeds take ~18s on average.

Also reduced default batch delay from 5s to 2s in config.yaml.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-07 10:30:50 +03:00
salvacybersec
e54ed045fe Map remaining 4 files: _oneshots + KonferansSlaytlari to Neo
Full library coverage complete — 0 unmapped content folders.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-07 00:15:34 +03:00
salvacybersec
24f22b5b6c Expand config coverage from 73 to 86 mapped paths across 39K files
- Add SiyasetVeTeori (1262 files) to Tribune + Sage personas
- Add Marketing (12 files) to Herald persona
- Add 7 regional FOIA-CIA folders to Frodo + Scribe
- Add NATO/FOIA-NATO to Scribe + Warden
- Add MobilGuvenlik to Neo + Specter, KonferansSunumlari to Neo
- Add HHSGuvenlikEgitimi to Bastion
- Expand skip_extensions: add .djvu .mobi .azw3 .json .log .py .js
  .jsx .htm .png .gif .mp3 .flac .gov .org .db

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-07 00:13:30 +03:00
salvacybersec
9e9b75e0b3 Initial commit: AnythingLLM persona RAG integration
28 persona workspace with document upload, OCR pipeline, and vector embedding
assignment via AnythingLLM API. Supports 5 clusters (intel, cyber, military,
humanities, engineering) with batch processing and resume capability.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-06 23:07:44 +03:00