GopherGate

Author	SHA1	Message	Date
hobokenchicken	aeffeb8c03	fix: remove tool call ID truncation and improve DeepSeek reasoning handling CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details The 40-character truncation of tool call IDs in helper.go caused collisions when models (like deepseek-v4-flash) generated longer IDs, leading to "Duplicate value for 'tool_call_id'" errors. Removed the limit to allow full unique IDs. DeepSeek: updated reasoning_content injection to use an empty string instead of a space, better matching provider expectations for history. Improved API error reporting across all providers by capturing raw body content when response parsing fails or returns empty strings.	2026-05-11 03:12:38 +00:00
hobokenchicken	28b8271c1d	fix: inject English system prompt for DeepSeek provider CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details DeepSeek models default to Chinese for some prompts. The ensureEnglish() function prepends 'Always respond in English' as a system message when no system prompt is already set. Applied to both ChatCompletion and ChatCompletionStream paths.	2026-05-07 14:03:39 -04:00
hobokenchicken	e5ef39f327	feat: add OpenAI Responses API support (POST /v1/responses) CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details Add full Responses API endpoint alongside existing Chat Completions, with identical logging/tracking/cost pipeline. New: - internal/models/responses.go — request/response/stream types + ToUsage() bridge - internal/providers/openai_responses.go — OpenAI Responses/ResponsesStream Modified: - provider.go — Responses()+ResponsesStream() added to Provider interface - helpers.go — BuildOpenAIResponsesBody, parsers, SSE stream reader - circuit_breaker.go — CB wraps Responses, passthrough for stream - server.go — POST /v1/responses route + handleResponses handler - all non-OpenAI providers — stub methods with clear error messages Logging: ResponsesUsage.ToUsage() bridges to models.Usage, feeding same logRequest() -> DB insert -> dashboard WS -> client stats -> cost calc pipeline. No schema or logger changes needed.	2026-05-02 16:38:17 -04:00
hobokenchicken	eb67287b56	fix: raise provider HTTP timeouts from 30s to 10min CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details 30-second resty client timeout was killing long streaming responses mid-generation. Models with large output windows (e.g. deepseek-v4-pro at 384K max_tokens) routinely exceed 30s. Raised all providers to 10 minutes (Ollama already at 15min, unchanged). Circuit breaker recovery timeout raised from 30s to 5min.	2026-04-30 10:17:45 -04:00
hobokenchicken	5ee539d95c	feat: add image generation for OpenAI DALL-E and Gemini Imagen CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details New `/v1/images/generations` endpoint proxies DALL-E 2/3 (OpenAI) and Imagen 3 (Gemini). Same auth/logging as chat completions. - Add ImageGenerationRequest/Response models - Extend Provider interface with ImageGeneration() - OpenAI: forward to /v1/images/generations - Gemini: call /v1beta/models/{model}:predict, map OpenAI params - Circuit breaker wraps image gen like chat completions - Model routing: dall-e* -> openai, imagen/gemini -> gemini - Unsupported providers (deepseek/moonshot/grok/ollama) return error - Fix pre-existing CachedContentTokenCount bug in StreamGemini	2026-04-27 10:06:07 -04:00
hobokenchicken	14e26a4323	feat: capture Gemini cached content tokens in cost tracking CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Add CachedContentTokenCount to UsageMetadata parsing for both streaming (helpers.go) and non-streaming (gemini.go) requests - CacheReadTokens now populated from Gemini cachedContentTokenCount - Add uint32Ptr helper for nil-safe uint32 pointer creation	2026-04-26 21:14:53 -04:00
hobokenchicken	db76858072	fix: import block syntax in split dashboard files CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Add missing closing ) in clients.go, providers_admin.go, users.go, system.go - Add SetTimeout(30s) to OpenAI provider (was resty.New() with no timeout)	2026-04-26 14:55:29 -04:00
hobokenchicken	af2c5b95f7	feat: Phase 3 - architecture & maintainability CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Split 1474-line dashboard.go into 5 domain files (clients, providers, users, system) - Unit tests for ModelRegistry.FindModel and CalculateCost - go mod tidy + verify (deps clean) - .gitignore excludes tool cache dirs (.pi-lens/, .opencode/)	2026-04-26 14:52:10 -04:00
hobokenchicken	1f574d8134	feat: Phase 2 - reliability & observability CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Circuit breaker: proper thresholds (3 failures, 30s timeout) - HTTP timeouts: 30s on all providers (was no timeout) - Structured logging: slog replaces fmt.Printf throughout - Stream errors: propagated as SSE error events to client - Registry fetch: retry with backoff (3 attempts) - Registry reads in dashboard protected by RWMutex	2026-04-26 14:48:56 -04:00
hobokenchicken	8a8d8d1477	fix: Phase 1 - security & stability patches CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - AuthMiddleware now requires auth on /v1/* routes (returns 401) - WebSocket origin check configurable via WSAllowedOrigin - Removed debug fmt.Printf leaks (config, ollama, server) - Registry access protected by sync.RWMutex (race condition fix) - Session cleanup goroutine runs every 15 min - RevokeSession returns error instead of silent no-op	2026-04-26 14:45:22 -04:00
hobokenchicken	9b0aa4dbe8	fix: remove unused fmt import in circuit breaker provider CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details	2026-04-09 12:19:14 -04:00
hobokenchicken	212ac14a1b	feat: implement circuit breaker, fix auth vulnerability CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details	2026-04-09 12:17:18 -04:00
hobokenchicken	e12418cc4c	fix(gemini): ensure strict 1:1 pairing of model calls and function responses CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Gemini requires function results to immediately follow the model message that called them - Implemented look-ahead grouping to pair assistant calls with their tool results - Standardized system and orphaned tool message handling for Gemini compatibility	2026-04-07 18:57:13 +00:00
hobokenchicken	be4ec3482a	fix(gemini): group adjacent tool messages and ensure correct role sequence CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Group consecutive 'tool' messages into a single Gemini content message with multiple 'functionResponse' parts - Ensure assistant tool calls are properly mapped and sent - Maintain v1beta for preview and newer models - Added debug logging for API errors	2026-04-07 18:50:48 +00:00
hobokenchicken	e67aafdac1	fix(gemini): improve tool-calling support and handle function_call response CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Support tool definitions in Gemini requests - Map tool role to 'function' in Gemini content - Ensure tool results are wrapped in JSON objects for Gemini compatibility - Parse FunctionCall from Gemini response and map to OpenAI-compatible ToolCalls - Correctly map finish_reason for tool calls	2026-04-07 18:37:57 +00:00
hobokenchicken	21e5204abd	fix(ollama): improve tool-calling support and restore gemma/llama context limits CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Explicitly set tool_choice: auto when tools are present to aid gemma/llama models - Sync stop sequences into the options map for broader compatibility - Restore gemma/llama to the high-context (32k) optimization list	2026-04-07 14:24:23 +00:00
hobokenchicken	4095c68822	fix(ollama): improve model detection and ensure robust token/context limits CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Use case-insensitive matching for model names and routing - Default max_tokens/num_predict to 8192 for all Ollama models to prevent truncation - Increase default context window and add more large-context model families - Ensure DeepSeek routing handles Ollama-hosted variants correctly	2026-04-07 14:05:21 +00:00
hobokenchicken	ef37dc5af0	fix(ollama): significantly increase context and prediction limits CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Increase timeout to 15m - Set num_ctx to 32k for common models - Set default num_predict to 8192 for common models	2026-04-07 13:48:02 +00:00
hobokenchicken	fdbb068a6c	fix(ollama): map max_tokens to num_predict and increase context window CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Map MaxTokens to num_predict in options map - Set default num_ctx to 8192 for common models (gemma, llama, etc.) - This ensures Ollama doesn't cut off responses early due to default limits	2026-04-07 13:44:17 +00:00
hobokenchicken	dbbf48cb14	fix(ollama): increase timeout and add default max_tokens for large models CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Increase Ollama timeout to 5m for larger models (e.g. gemma4) - Set default max_tokens to 4096 for common Ollama models - Expand stream scanner buffer to 10MB to prevent truncation - Improve model routing and prefix stripping in server	2026-04-07 13:42:10 +00:00
hobokenchicken	1e13b0376b	feat(ollama): improve configuration and dashboard integration CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details	2026-04-07 12:53:17 +00:00
hobokenchicken	1b5cd2815e	fix(ollama): improve error handling and add timeouts CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Add timeouts (30s) and retries to resty client - Add debug logging for Ollama requests and responses - Import time package for timeout configuration - This should fix 500 errors and provide better error messages	2026-04-06 15:05:31 -04:00
hobokenchicken	2f6b7deb2c	feat(providers): add Ollama provider support CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Implement OllamaProvider with OpenAI-compatible API integration - Add Ollama to provider initialization in server.go - Update config.go to handle Ollama (no API key required) - Configure .env with Ollama server at 172.20.1.222:11434 - Support models: glm-4.7-flash:latest, qwen3-coder:30b, gemma4:26b	2026-04-06 14:38:35 -04:00
hobokenchicken	9375448087	fix(moonshot): resolve 401 Unauthorized errors by trimming API keys and improving request compatibility CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details	2026-03-26 17:09:27 +00:00
hobokenchicken	bd1d17cc4d	feat: add moonshot kimi k2.5 support CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details	2026-03-25 09:27:46 -04:00
hobokenchicken	08cf5cc1d9	fix: improve cost tracking accuracy for modern models CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details - Added support for reasoning tokens in cost calculations. - Fixed DeepSeek cache-write token mapping (PromptCacheMissTokens). - Improved CalculateCost debug logging to trace all pricing variables.	2026-03-19 14:14:54 -04:00
hobokenchicken	03dca998df	chore: rebrand project to GopherGate CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details Updated all naming from LLM Proxy to GopherGate. Implemented new CSS-based branding and updated Go module/binary naming.	2026-03-19 13:37:05 -04:00
hobokenchicken	0f3c5b6eb4	feat: enhance usage and cost tracking accuracy CI / Lint (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Build (push) Has been cancelled Details Improved extraction of reasoning and cached tokens from OpenAI and DeepSeek responses (including streams). Ensured accurate cost calculation using registry metadata.	2026-03-19 11:56:26 -04:00
hobokenchicken	6b10d4249c	feat: migrate backend from rust to go CI / Check (push) Has been cancelled Details CI / Clippy (push) Has been cancelled Details CI / Formatting (push) Has been cancelled Details CI / Test (push) Has been cancelled Details CI / Release Build (push) Has been cancelled Details This commit replaces the Axum/Rust backend with a Gin/Go implementation. The original Rust code has been archived in the 'rust' branch.	2026-03-19 10:30:05 -04:00

29 Commits