docs(02-04): complete tier-2 inference platforms plan
Adds 02-04-SUMMARY.md; updates STATE.md and ROADMAP.md with execution metrics. Completes PROV-02 (all 14 Tier 2 providers defined).
This commit is contained in:
114
.planning/phases/02-tier-1-2-providers/02-04-SUMMARY.md
Normal file
114
.planning/phases/02-tier-1-2-providers/02-04-SUMMARY.md
Normal file
@@ -0,0 +1,114 @@
|
|||||||
|
---
|
||||||
|
phase: 02-tier-1-2-providers
|
||||||
|
plan: 04
|
||||||
|
subsystem: providers
|
||||||
|
tags: [providers, tier-2, inference-platforms, yaml]
|
||||||
|
requires:
|
||||||
|
- pkg/providers/schema.go
|
||||||
|
- pkg/providers/loader.go
|
||||||
|
- pkg/providers/registry.go
|
||||||
|
provides:
|
||||||
|
- providers/lepton.yaml
|
||||||
|
- providers/modal.yaml
|
||||||
|
- providers/cerebrium.yaml
|
||||||
|
- providers/novita.yaml
|
||||||
|
- providers/sambanova.yaml
|
||||||
|
- providers/octoai.yaml
|
||||||
|
- providers/friendli.yaml
|
||||||
|
affects:
|
||||||
|
- pkg/providers/definitions/*.yaml (embed FS)
|
||||||
|
tech_stack_added: []
|
||||||
|
patterns:
|
||||||
|
- Generic-format providers rely on strong keyword lists for Aho-Corasick pre-filtering
|
||||||
|
- Dual-location copy (providers/ + pkg/providers/definitions/) maintained
|
||||||
|
- Medium-confidence prefixed regex for Modal (ak-/as-) and Friendli (flp_)
|
||||||
|
- Low-confidence fallback regex anchored by keywords for generic providers
|
||||||
|
key_files_created:
|
||||||
|
- providers/lepton.yaml
|
||||||
|
- providers/modal.yaml
|
||||||
|
- providers/cerebrium.yaml
|
||||||
|
- providers/novita.yaml
|
||||||
|
- providers/sambanova.yaml
|
||||||
|
- providers/octoai.yaml
|
||||||
|
- providers/friendli.yaml
|
||||||
|
- pkg/providers/definitions/lepton.yaml
|
||||||
|
- pkg/providers/definitions/modal.yaml
|
||||||
|
- pkg/providers/definitions/cerebrium.yaml
|
||||||
|
- pkg/providers/definitions/novita.yaml
|
||||||
|
- pkg/providers/definitions/sambanova.yaml
|
||||||
|
- pkg/providers/definitions/octoai.yaml
|
||||||
|
- pkg/providers/definitions/friendli.yaml
|
||||||
|
key_files_modified: []
|
||||||
|
decisions:
|
||||||
|
- Modal defined with dual patterns (ak-/as-) — token_id and token_secret both needed for auth
|
||||||
|
- NovitaAI and SambaNova given live verify endpoints (OpenAI-compatible)
|
||||||
|
- Lepton, Cerebrium, OctoAI, Friendli left with empty verify stubs — no documented public models endpoint at research time
|
||||||
|
- Friendli promoted to medium confidence on flp_ prefix pattern
|
||||||
|
metrics:
|
||||||
|
duration_seconds: 58
|
||||||
|
tasks_completed: 2
|
||||||
|
files_created: 14
|
||||||
|
completed: "2026-04-05"
|
||||||
|
---
|
||||||
|
|
||||||
|
# Phase 02 Plan 04: Tier 2 Inference Platforms (Generic Format) Summary
|
||||||
|
|
||||||
|
Final 7 Tier 2 provider YAMLs (Lepton, Modal, Cerebrium, Novita, SambaNova, OctoAI, Friendli) added with keyword-anchored generic-format detection, completing PROV-02 (14/14 Tier 2 providers).
|
||||||
|
|
||||||
|
## Objective
|
||||||
|
|
||||||
|
Create the remaining 7 Tier 2 inference platform provider YAMLs that use generic-format key layouts (no distinctive prefix). Each requires a strong keyword list for Aho-Corasick pre-filtering so the low-confidence regex patterns do not fire indiscriminately.
|
||||||
|
|
||||||
|
## Tasks Completed
|
||||||
|
|
||||||
|
| Task | Name | Commit | Files |
|
||||||
|
| ---- | ---------------------------------------- | ------- | ----- |
|
||||||
|
| 1 | Lepton, Modal, Cerebrium, Novita YAMLs | 622eabe | 8 |
|
||||||
|
| 2 | SambaNova, OctoAI, Friendli YAMLs | 5b5a47d | 6 |
|
||||||
|
|
||||||
|
## Implementation Notes
|
||||||
|
|
||||||
|
- **Modal** is the only dual-token provider in this batch — `MODAL_TOKEN_ID` (`ak-*`) and `MODAL_TOKEN_SECRET` (`as-*`) are both required for authentication. Both are emitted as medium-confidence patterns.
|
||||||
|
- **Friendli** uses `flp_` prefix → medium-confidence primary pattern with a low-confidence fallback for older token formats.
|
||||||
|
- **NovitaAI** and **SambaNova** expose OpenAI-compatible `/v1/models` endpoints and therefore have live verify specs ready for Phase 5 verification engine.
|
||||||
|
- **Lepton**, **Cerebrium**, **OctoAI**, **Friendli** ship with empty verify stubs — their verification endpoints weren't resolved in research and will be filled in during Phase 5.
|
||||||
|
- All 7 providers maintain tier: 2 and use lowercase keyword lists (lepton.ai, novita.ai, sambanova.ai, etc.) for AC matching.
|
||||||
|
|
||||||
|
## Verification Results
|
||||||
|
|
||||||
|
- `go test ./pkg/providers/... -count=1`: PASS (after task 1 and task 2)
|
||||||
|
- Dual-location diffs empty for all 7 providers
|
||||||
|
- Schema validation (UnmarshalYAML) passes for all files (format_version=1, last_verified set, valid confidences)
|
||||||
|
|
||||||
|
## Deviations from Plan
|
||||||
|
|
||||||
|
None - plan executed exactly as written.
|
||||||
|
|
||||||
|
## Acceptance Criteria Status
|
||||||
|
|
||||||
|
- [x] 7 generic-format Tier 2 providers created
|
||||||
|
- [x] Dual-location verified (14 files, 7 in each location)
|
||||||
|
- [x] Registry loads cleanly (go test passes)
|
||||||
|
- [x] `grep -q 'MODAL_TOKEN_ID' providers/modal.yaml`
|
||||||
|
- [x] `grep -q 'cerebrium.ai' providers/cerebrium.yaml`
|
||||||
|
- [x] `grep -q 'api.novita.ai' providers/novita.yaml`
|
||||||
|
- [x] `grep -q 'lepton.ai' providers/lepton.yaml`
|
||||||
|
- [x] `grep -q 'sambanova' providers/sambanova.yaml`
|
||||||
|
- [x] `grep -q 'octo.ai' providers/octoai.yaml`
|
||||||
|
- [x] `grep -q 'FRIENDLI_TOKEN' providers/friendli.yaml`
|
||||||
|
|
||||||
|
## Known Stubs
|
||||||
|
|
||||||
|
The following providers have empty verify specs (method/url/status lists empty) and will require Phase 5 follow-up once verification endpoints are researched:
|
||||||
|
|
||||||
|
- providers/lepton.yaml (verify.url empty)
|
||||||
|
- providers/cerebrium.yaml (verify.url empty)
|
||||||
|
- providers/octoai.yaml (verify.url empty)
|
||||||
|
- providers/friendli.yaml (verify.url empty)
|
||||||
|
- providers/modal.yaml (verify.url empty — Modal auth is non-trivial, uses token_id+token_secret pair)
|
||||||
|
|
||||||
|
These are intentional per plan instructions (action block explicitly uses empty verify stubs for providers without documented public endpoints). Verification will be completed in Phase 5 (verification engine).
|
||||||
|
|
||||||
|
## Self-Check: PASSED
|
||||||
|
|
||||||
|
All 14 provider YAML files verified on disk. Both task commits (622eabe, 5b5a47d) present in git log.
|
||||||
Reference in New Issue
Block a user