From 895c3360c98b613f1aff9975fefc8d88d337affe Mon Sep 17 00:00:00 2001 From: salvacybersec Date: Sun, 5 Apr 2026 14:12:50 +0300 Subject: [PATCH] docs(02-03): complete tier-2 inference platforms plan (first half) - 7 Tier 2 providers created (Groq, Replicate, Anyscale, Together, Fireworks, Baseten, DeepInfra) - PROV-02 marked complete --- .../02-tier-1-2-providers/02-03-SUMMARY.md | 97 +++++++++++++++++++ 1 file changed, 97 insertions(+) create mode 100644 .planning/phases/02-tier-1-2-providers/02-03-SUMMARY.md diff --git a/.planning/phases/02-tier-1-2-providers/02-03-SUMMARY.md b/.planning/phases/02-tier-1-2-providers/02-03-SUMMARY.md new file mode 100644 index 0000000..25badd9 --- /dev/null +++ b/.planning/phases/02-tier-1-2-providers/02-03-SUMMARY.md @@ -0,0 +1,97 @@ +--- +phase: 02-tier-1-2-providers +plan: 03 +subsystem: providers +tags: [providers, tier-2, inference-platforms, yaml] +requires: + - pkg/providers/schema.go + - pkg/providers/loader.go (embed.FS) +provides: + - Groq provider (gsk_ high-confidence) + - Replicate provider (r8_ high-confidence) + - Anyscale Endpoints provider (esecret_ high-confidence) + - Together AI provider (keyword + generic) + - Fireworks AI provider (fw_ + generic) + - Baseten provider (keyword + Api-Key) + - DeepInfra provider (keyword + generic) +affects: + - registry AC automaton (7 new keyword sets) +tech-stack: + added: [] + patterns: + - Dual-location YAML (providers/ + pkg/providers/definitions/) + - RE2 regex patterns only + - Keyword-anchored detection for generic-key providers +key-files: + created: + - providers/groq.yaml + - providers/replicate.yaml + - providers/anyscale.yaml + - providers/together.yaml + - providers/fireworks.yaml + - providers/baseten.yaml + - providers/deepinfra.yaml + - pkg/providers/definitions/groq.yaml + - pkg/providers/definitions/replicate.yaml + - pkg/providers/definitions/anyscale.yaml + - pkg/providers/definitions/together.yaml + - pkg/providers/definitions/fireworks.yaml + - pkg/providers/definitions/baseten.yaml + - pkg/providers/definitions/deepinfra.yaml + modified: [] +decisions: + - Together/Baseten/DeepInfra use low-confidence generic patterns — rely on keyword pre-filter for specificity + - Fireworks has two patterns (fw_ medium + generic low) to catch both historical and current key formats + - Baseten verify uses "Api-Key {KEY}" header (not Bearer) per provider docs +metrics: + duration: ~3min + completed: 2026-04-05 +requirements: [PROV-02] +--- + +# Phase 2 Plan 3: Tier 2 Inference Platforms (First Half) Summary + +Created 7 Tier 2 inference platform provider YAMLs (Groq, Replicate, Anyscale, Together AI, Fireworks AI, Baseten, DeepInfra) covering high-confidence prefix patterns and keyword-anchored fallbacks for generic-key providers, dual-located for Go embed. + +## What Was Built + +**High-confidence prefixed providers (3):** +- **Groq** — `gsk_[a-zA-Z0-9]{52}`, verify via `/openai/v1/models` +- **Replicate** — `r8_[0-9A-Za-z\-_]{37}`, verify via `/v1/predictions` +- **Anyscale Endpoints** — `esecret_[A-Za-z0-9_\-]{20,}`, verify via Anyscale models endpoint + +**Keyword-anchored providers (4):** +- **Together AI** — 64-hex pattern (low confidence), 5 context keywords including both .xyz and .ai hosts +- **Fireworks AI** — dual pattern: `fw_[A-Za-z0-9]{20,}` (medium) + 40+ alphanumeric (low) +- **Baseten** — generic 40+ alphanumeric, uses `Api-Key` auth header instead of Bearer +- **DeepInfra** — generic 32+ alphanumeric with keyword anchoring on `deepinfra`/`api.deepinfra.com` + +All 14 files (7 providers × 2 locations) are byte-identical across the dual locations required by Go embed. + +## Verification + +- `diff providers/$f.yaml pkg/providers/definitions/$f.yaml` — all 7 pairs byte-identical +- `go test ./pkg/providers/... -count=1` — PASS +- `grep -l 'tier: 2' providers/*.yaml | wc -l` — 13 (cumulative with plans 01/02) + +## Deviations from Plan + +None - plan executed exactly as written. + +## Commits + +- `a1f0b2d` — feat(02-03): add Groq, Replicate, Anyscale provider YAMLs +- `5e36f24` — feat(02-03): add Together, Fireworks, Baseten, DeepInfra provider YAMLs + +## Self-Check: PASSED + +- providers/groq.yaml — FOUND +- providers/replicate.yaml — FOUND +- providers/anyscale.yaml — FOUND +- providers/together.yaml — FOUND +- providers/fireworks.yaml — FOUND +- providers/baseten.yaml — FOUND +- providers/deepinfra.yaml — FOUND +- pkg/providers/definitions/{groq,replicate,anyscale,together,fireworks,baseten,deepinfra}.yaml — all FOUND +- Commit a1f0b2d — FOUND +- Commit 5e36f24 — FOUND