Commit Graph

19 Commits

Author SHA1 Message Date
salvacybersec
35dbbc71f1 feat(03-01): add 8 Tier 4 Chinese providers (DeepSeek, Zhipu, Moonshot, Qwen, Baidu, ByteDance, 01.AI, MiniMax)
- DeepSeek, Moonshot, Qwen use documented sk- prefix patterns
- Zhipu, Baidu, ByteDance use keyword-only detection (no documented key format)
- All dual-located in providers/ and pkg/providers/definitions/
2026-04-05 14:41:50 +03:00
salvacybersec
469ed0c0dd feat(03-06): add Salesforce, ServiceNow, SAP, Palantir Tier 9 providers
- Keyword-only detection; strong env var anchors
- Dual-located in providers/ and pkg/providers/definitions/
2026-04-05 14:41:42 +03:00
salvacybersec
7ad9588212 feat(03-02): add search and embeddings Tier 3 providers
- Perplexity (pplx- prefix, high confidence)
- You.com (keyword-only)
- Voyage AI (pa- prefix, medium confidence)
- Jina AI (jina_ prefix, high confidence)
- Unstructured.io (keyword-only)
- AssemblyAI (hex32, low confidence)
2026-04-05 14:41:33 +03:00
salvacybersec
a9ee75eb45 feat(03-03): add OpenRouter, LiteLLM, Cloudflare, Vercel, Portkey, Helicone gateways
- sk-or-v1- and sk-helicone- high-confidence prefix regex
- LiteLLM low-confidence sk- pattern with master key keyword
- Cloudflare, Vercel, Portkey keyword-anchored detection
2026-04-05 14:41:30 +03:00
salvacybersec
9f10357f91 feat(03-04): add GitHub Copilot, Cursor, Tabnine, Codeium, Sourcegraph providers
- GitHub Copilot with ghu_/gho_ token patterns
- Sourcegraph Cody with documented sgp_ high-confidence pattern
- Cursor, Tabnine, Codeium as keyword-only (no documented formats)
2026-04-05 14:41:27 +03:00
salvacybersec
58f302b67d test(02-05): add tier1/tier2 provider guardrail test
- TestTier1Count asserts exactly 12 Tier 1 providers loaded
- TestTier2Count asserts exactly 14 Tier 2 providers loaded
- TestAllPatternsCompile verifies every regex compiles under RE2
- TestAllProvidersHaveKeywords guards Aho-Corasick pre-filter
- TestTier1/Tier2ProviderNames lock in expected provider names

Locks Phase 2 coverage against silent regressions in Phase 3+.
Addresses PROV-01, PROV-02.
2026-04-05 14:15:00 +03:00
salvacybersec
d74200b5ef feat(02-01): add Google AI, Vertex AI, AWS Bedrock, xAI providers
- google-ai: AIzaSy pattern for Gemini
- vertex-ai: AIzaSy + Bearer verify on aiplatform endpoint
- aws-bedrock: ABSK long-token and AKIA medium patterns
- xai: xai- 80-char token pattern
- All dual-located in providers/ and pkg/providers/definitions/
2026-04-05 14:12:03 +03:00
salvacybersec
5b5a47d3cc feat(02-04): add SambaNova, OctoAI, Friendli provider YAMLs
- SambaNova with live verify endpoint (api.sambanova.ai/v1/models)
- OctoAI generic-format with keyword anchors
- Friendli with flp_ prefix pattern (medium confidence)
- Dual-located in providers/ and pkg/providers/definitions/
- Completes PROV-02: all 14 Tier 2 providers defined
2026-04-05 14:12:02 +03:00
salvacybersec
5e36f24a4f feat(02-03): add Together, Fireworks, Baseten, DeepInfra provider YAMLs
- Together AI: keyword-anchored, 64-hex generic pattern
- Fireworks AI: fw_ prefix (medium) + generic (low)
- Baseten: keyword + Api-Key header auth
- DeepInfra: keyword-anchored generic pattern
- Dual-located in providers/ and pkg/providers/definitions/
2026-04-05 14:11:59 +03:00
salvacybersec
adad602ec9 feat(02-02): add Mistral, Inflection, AI21 provider YAMLs
- 3 Tier 1 low-confidence providers with keyword anchoring
- Dual-located in providers/ and pkg/providers/definitions/
- Tier 1 total now at 12/12 providers
2026-04-05 14:11:51 +03:00
salvacybersec
622eabed74 feat(02-04): add Lepton, Modal, Cerebrium, Novita provider YAMLs
- Lepton AI generic-format with keyword anchors
- Modal dual token (token_id ak-, token_secret as-) medium confidence
- Cerebrium generic-format with keyword anchors
- NovitaAI with live verify endpoint (api.novita.ai/v3/openai/models)
- Dual-located in providers/ and pkg/providers/definitions/
2026-04-05 14:11:36 +03:00
salvacybersec
a1f0b2dd3e feat(02-03): add Groq, Replicate, Anyscale provider YAMLs
- Groq: gsk_ prefix, 52 chars (high confidence)
- Replicate: r8_ prefix, 37 chars (high confidence)
- Anyscale: esecret_ prefix (high confidence)
- Dual-located in providers/ and pkg/providers/definitions/
2026-04-05 14:11:27 +03:00
salvacybersec
bca842271e feat(02-02): add Azure OpenAI, Meta AI, Cohere provider YAMLs
- 3 Tier 1 medium/low-confidence providers with keyword anchoring
- Dual-located in providers/ and pkg/providers/definitions/
- Registry test passes
2026-04-05 14:11:19 +03:00
salvacybersec
c0d3add7e1 feat(02-01): upgrade OpenAI and Anthropic provider YAMLs
- OpenAI: add sk-svcacct- and legacy T3BlbkFJ patterns
- Anthropic: add api03 AA suffix and sk-ant-admin01- pattern
- Sync both to pkg/providers/definitions/ for go:embed
2026-04-05 14:11:12 +03:00
salvacybersec
1e3f112d79 merge: plan 01-02 provider registry 2026-04-05 00:14:05 +03:00
salvacybersec
a9859b3384 feat(01-02): embed loader, registry with Aho-Corasick, and filled test stubs
- loader.go with go:embed definitions/*.yaml for compile-time embedding
- registry.go with List(), Get(), Stats(), AC() methods
- Aho-Corasick automaton built from all provider keywords at NewRegistry()
- pkg/providers/definitions/ with 3 YAML files for embed
- All 5 provider tests pass: load, get, stats, AC, schema validation
2026-04-05 00:10:56 +03:00
salvacybersec
58259cb9d3 feat(01-01): create main.go, test scaffolding, and testdata fixtures
- main.go entry point (7 lines) delegates to cmd.Execute()
- cmd/root.go stub so go build ./... compiles (Plan 05 replaces)
- pkg/providers, pkg/storage, pkg/engine package stubs
- Test stubs with t.Skip() for providers, storage, engine packages
- testdata/samples: openai_key.txt, anthropic_key.txt, multiple_keys.txt, no_keys.txt
- go build ./... and go test ./... -short both exit 0
2026-04-05 00:04:42 +03:00
salvacybersec
4fcdc42c70 feat(01-02): provider YAML schema structs with validation and reference YAML files
- Provider, Pattern, VerifySpec, RegistryStats structs in schema.go
- UnmarshalYAML validates format_version >= 1 and last_verified non-empty
- Three reference YAML files: openai, anthropic, huggingface
2026-04-05 00:04:29 +03:00
salvacybersec
ebaf7d7c2d test(01-02): add failing tests for provider schema validation and registry 2026-04-05 00:03:55 +03:00