docs: update README, TODO, and deployment docs
README: Added hierarchical routing, classifier bucket mapping, two-level dispatch, model groups table, DeepSeek language note, deploy script, and updated model names to match current models.dev registry. TODO: Added 15 completed items covering model groups, routing, dispatch, and provider fixes from May 7 session. deployment.md: Added deploy.sh instructions.
This commit is contained in:
@@ -15,11 +15,24 @@
|
||||
- [x] Dashboard Analytics & Usage Summary (Fixed SQL robustness)
|
||||
- [x] WebSocket for real-time dashboard updates (Hub with client counting)
|
||||
- [x] Asynchronous Request Logging to SQLite
|
||||
- [x] Update documentation (README, deployment, architecture)
|
||||
- [x] Cost Tracking accuracy (Registry integration with `models.dev`)
|
||||
- [x] Model Listing endpoint (`/v1/models`) with provider filtering
|
||||
- [x] System Metrics endpoint (`/api/system/metrics` using `gopsutil`)
|
||||
- [x] Fixed dashboard 404s and 500s
|
||||
- [x] Model groups with heuristic and classifier routing strategies
|
||||
- [x] Hierarchical routing — groups can target other groups with cycle detection
|
||||
- [x] Classifier bucket mapping via complexity_threshold (1-10 scale -> N targets)
|
||||
- [x] Two-level dispatch — classifier router delegates to tier groups
|
||||
- [x] Model groups exposed in /v1/models endpoint (owned_by: gophergate)
|
||||
- [x] logic_level and primary_use metadata on model groups
|
||||
- [x] Model group CRUD dashboard page
|
||||
- [x] dispatcher, heavy-logic, standard-pro, fast-flow seed groups
|
||||
- [x] Provider selection moved after routing resolution (fixes group routing)
|
||||
- [x] Classifier selector model routed to correct provider (selectProvider)
|
||||
- [x] DeepSeek English system prompt injection (ensureEnglish)
|
||||
- [x] Deploy script (deploy.sh)
|
||||
- [x] Recent Activity pane shows resolved model + group annotation
|
||||
- [x] Model names aligned with models.dev registry
|
||||
|
||||
## Planned Resolutions (High Priority)
|
||||
|
||||
|
||||
Reference in New Issue
Block a user