docs(01-04): complete scan engine plan

- SUMMARY.md with pipeline implementation details
- STATE.md updated with progress and decisions
- ROADMAP.md and REQUIREMENTS.md updated
This commit is contained in:
salvacybersec
2026-04-05 12:22:49 +03:00
parent cea2e371cc
commit d0396bb384
4 changed files with 160 additions and 11 deletions

View File

@@ -9,11 +9,11 @@ Requirements for initial release. Each maps to roadmap phases.
### Core Engine
- [ ] **CORE-01**: Scanner engine detects API keys using keyword pre-filtering + regex matching pipeline
- [x] **CORE-01**: Scanner engine detects API keys using keyword pre-filtering + regex matching pipeline
- [x] **CORE-02**: Provider definitions loaded from YAML files embedded at compile time via Go embed
- [x] **CORE-03**: Provider registry manages 108+ provider definitions with pattern, keyword, confidence, and verify metadata
- [ ] **CORE-04**: Entropy analysis as secondary signal for low-confidence providers (generic key formats)
- [ ] **CORE-05**: Worker pool parallelism with configurable worker count (default: CPU count)
- [x] **CORE-04**: Entropy analysis as secondary signal for low-confidence providers (generic key formats)
- [x] **CORE-05**: Worker pool parallelism with configurable worker count (default: CPU count)
- [x] **CORE-06**: Aho-Corasick keyword pre-filter runs before regex for 10x performance on large files
- [ ] **CORE-07**: mmap-based large file reading for memory efficiency