hobokenchicken/kira

Author	SHA1	Message	Date
hobokenchicken	9d2ba052f4	fix(live2d): precise cat positioning and sizing - Extract layout constants matching Tailwind config (PAD, LEFT_W, GAP, RIGHT_W) - positionModels() helper computes exact pixel positions from layout - Kira: centered in center panel at 78% of available space - Mochi: 120px tall, centered in right sidebar, above status bar - Both models reposition on window resize	2026-06-05 13:42:21 -04:00
hobokenchicken	5f5127f4fa	refactor(live2d): single shared stage for both Kira and Mochi Live2DStage creates ONE full-viewport transparent canvas (z-0, pointer-events:none). Both Kira and Mochi cat models render on the same Pixi stage and WebGL context. KiraAvatar is now UI-only (no canvas), receives model ref from stage. PetZone is label-only. Eliminates all WebGL context conflict errors.	2026-06-05 13:34:51 -04:00
hobokenchicken	43a392e5f5	fix(pets): use preferWebGLVersion:1 for cat canvas to avoid context conflicts Cat gets its own canvas with WebGL1 context (Kira uses WebGL2 by default). Different GL versions don't share buffers, so no bindBuffer spam. Cat now renders in the PetZone section of the right sidebar where it belongs. Removed all shared-context onAppReady plumbing.	2026-06-05 13:25:10 -04:00
hobokenchicken	1f8bcf6b4f	fix(pets): cat renders on shared KiraAvatar canvas via onAppReady callback Single WebGL context, no bindBuffer spam. Cat model loads onto KiraAvatar's stage and positions itself at bottom-right corner. PetZone passes app prop through to Live2DCat.	2026-06-05 13:19:32 -04:00
hobokenchicken	37f8bf59a0	fix(webgl): use forceCanvas for Live2DCat to avoid dual WebGL context conflicts Reverts shared-context approach. Live2DCat gets its own canvas with forceCanvas:true (Canvas2D renderer), which avoids the WebGL bindBuffer spam entirely. Cleaned up onAppReady prop from KiraAvatar.	2026-06-05 13:08:18 -04:00
hobokenchicken	be1e51cc9a	fix(webgl): share single Pixi context between KiraAvatar and Live2DCat Eliminates WebGL bindBuffer/bindTexture spam from dual Application contexts. Cat model now loads onto KiraAvatar's shared stage via onAppReady callback.	2026-06-05 13:04:43 -04:00
hobokenchicken	017c81cffa	feat(pets): replace static cats with Live2D LittleCat model (black texture) - Copied LittleCat model files to frontend/public/live2d/models/little-cat/ - Using the black alternate texture as default - Created Live2DCat component that renders the model in a small canvas - PetZone now shows a single Live2D cat instead of two SVG cats	2026-06-05 12:55:24 -04:00
hobokenchicken	15199dfdee	feat(layout): move avatar to center hero position; timer+notes+chat to left sidebar	2026-06-05 12:44:17 -04:00
hobokenchicken	95f97fa897	fix(avatar): declarative canvas element in JSX; remove manual DOM append React was potentially clearing the canvas on re-render because we appended it manually to a div. Now using a <canvas ref={canvasRef}> element directly in JSX that React manages. Pixi app uses . Scale set to 82% of container.	2026-06-05 10:10:03 -04:00
hobokenchicken	e00dc37e68	fix(avatar): use Pixi resizeTo for native canvas sizing; remove all manual CSS/ResizeObserver Previous approach set CSS width:100% on a low-res canvas, causing the browser to stretch/pixelate the model. Now using Pixi's built-in resizeTo so the canvas internal resolution always matches the container. Model scaled to 90% of container with centered anchor.	2026-06-05 09:57:54 -04:00
hobokenchicken	3a6a1cd6c3	fix(avatar): reduce model margin to 45% to prevent clipping in narrow sidebar	2026-06-05 09:51:39 -04:00
hobokenchicken	13dbcdb7f5	fix(avatar): re-apply CSS 100% after Pixi resize(); use fitModel helper; 65% margin Pixi renderer.resize() overwrites canvas inline width/height styles, locking the canvas to the initial size and leaving empty space below. Now we re-apply width:100%;height:100% after every resize so the canvas always fills its container. Removed unused appRef.	2026-06-05 09:47:51 -04:00
hobokenchicken	f2ff91730b	fix(avatar): use ResizeObserver for accurate container sizing; force canvas CSS 100%; reduce margin to 68% Problem: flex layout wasn't ready on first paint, so clientWidth fell back to 400px. Canvas was 400px wide but parent was only 288px, causing the avatar to be clipped on the right. Fix: ResizeObserver measures real laid-out size before init. Canvas forced to width/height 100% via CSS so it never overflows. Model scaled to 68% with centered anchor. Resize handled dynamically.	2026-06-05 09:43:12 -04:00
hobokenchicken	dc2cb3bbb3	fix(avatar): reduce model scale to 72% (from 85%) and tighten anchor to prevent right-side clipping in narrow sidebar	2026-06-05 09:36:37 -04:00
hobokenchicken	dfd014ac82	feat(ui): complete layout redesign — three-panel desk layout Replaced the hero + scrollable grid with a fixed-height three-column workspace: - Left (fixed 288px): Kira avatar + compact chat + text input - Center (flex): Large focus timer + notes - Right (fixed 256px): Music, white noise, wardrobe, pets Thin top bar: scene selector dots + clock Thin bottom bar: status + connection indicator No cards, no scrollable grid, no wasted space. Clean, modern, everything visible at once. Avatar fills full sidebar height.	2026-06-05 09:33:42 -04:00
hobokenchicken	db23034e36	feat(ui): ditch all glass-card containers — flat, modern, card-free layout All 15+ glass-card instances removed across every component (Timer, Music, Notes, WhiteNoise, PetZone, Clock, ChatBubble, Wardrobe, Toolbar, KiraAvatar, BackgroundScene, WelcomeScreen, App text input + bottom bar). New design: widgets sit directly on the gradient background with only padding, no frosted-glass backgrounds, borders, or shadows. Cleaner, more modern look.	2026-06-05 09:26:51 -04:00
hobokenchicken	f5930d6190	fix(avatar): center Live2D model in card, overlay controls on canvas; scale model to 85% of container; remove card padding; clean template literals to avoid TS parsing issues	2026-06-05 09:16:16 -04:00
hobokenchicken	baaa89756f	feat(ui): center avatar as hero, ~1/3 viewport height; tools grid below - Avatar now centered in its own row above the tools grid (was crammed in column 1) - KiraAvatar container: min-height 33vh, canvas up to 500px wide - Tools reorganized into 4 columns below: Chat, Timer+Music, Notes+Noise, Clock+Pets+Wardrobe - WelcomeScreen restored to full (not compact) for first-time users	2026-06-05 09:03:32 -04:00
hobokenchicken	92250a668b	fix: restore full WebSocket message loop in main.py (was truncated to 77 lines, missing the entire try/except message handler) The REST STT revert commit (`0e74a16`) deleted lines 78-262 including the message loop, identify handler, audio handler, text handler, and disconnect cleanup. This caused the WS to accept then immediately close, triggering a reconnect loop. Refactored for clarity: transcribe_audio(), get_kira_response(), stream_tts() as standalone async helpers. Full pipeline restored.	2026-06-05 02:10:41 -04:00
hobokenchicken	86b1e9aa04	audit-followup: re-verify all builds/deploys clean; update AUDIT.md with full completion of the 9-item plan All items finished and 'clean' applied (legacy archived, docs updated, deprecations suppressed, nesting fixed, white noise added, etc.). Site at https://kira.kaylassafe.space should now have working voice (REST path), incremental TTS, live hearing display, notes, white noise, fixed timer, clean welcome, no legacy code pollution.	2026-06-04 16:06:28 -04:00
hobokenchicken	4641d74536	fix(welcome): make WelcomeScreen support isCompact prop to prevent full-screen CSS clash when rendering inside saved-ID wrapper card in App.tsx Per PLAN item 8. Saved users now get a clean compact welcome prompt without double min-h-screen divs.	2026-06-04 16:04:14 -04:00
hobokenchicken	eb5952adc6	fix(deprecations): remove dead ScriptProcessorNode PCM code (eliminates console warning); improve YouTube playerVars with origin/modestbranding (reduce postMessage spam); fix Timer stopwatch (now properly counts UP, clean display + interval) Per PLAN item 7.	2026-06-04 15:59:02 -04:00
hobokenchicken	1bfc8333e9	clean: archive legacy stt/tts/llm services; update ARCHITECTURE.md + README.md to current stack (REST gpt-4o-transcribe, nano, sage, Honcho, incremental TTS, white noise) Per PLAN item 6. Legacy files moved to archive/legacy-pipeline/.	2026-06-04 15:54:12 -04:00
hobokenchicken	59b72aa184	feat(white-noise): add Web Audio generated white/pink/brown/rain/cafe noise player Separate from lofi music per original spec. Toggleable, volume control, always available in focus column. Finishes item 5.	2026-06-04 15:49:18 -04:00
hobokenchicken	3f1497174d	feat(ui): integrate Notes component into main grid (was dead import) Per original spec and AUDIT/PLAN item 4. Notes now visible alongside Timer/Music.	2026-06-04 15:38:52 -04:00
hobokenchicken	771c00830a	feat(ui): display livePartial / transcript_delta in ChatBubble as 'Hearing:' indicator - REST STT now sends delta (full text) so UI lights up immediately with what was heard. - Works as 'live' for the final transcript (true partials would stream words if Realtime was available). - Per PLAN item 3.	2026-06-04 15:34:37 -04:00
hobokenchicken	77cbd91b93	fix(tts): play Opus chunks immediately as they arrive instead of buffering until speaking_end This makes the voice start playing the first words while the rest of the response is still generating (big win for perceived latency). Per PLAN item 2.	2026-06-04 15:28:40 -04:00
hobokenchicken	0e74a16b40	fix(stt): revert to reliable REST gpt-4o-transcribe + MediaRecorder full-blob (Realtime WS not accessible on key) - Backend: added transcribe_audio (gpt-4o-transcribe), switched audio handler to full blob -> REST -> LLM -> streaming TTS - Frontend: MediaRecorder (webm/opus) full recording sent on stop (one blob per utterance) - Removed dead WhisperStream callbacks and pending_transcript/lock - This unblocks voice per AUDIT item 1 (Option B fallback). Deltas will come in later item. - Also preps for deprecation fix (MediaRecorder is the good path).	2026-06-04 15:23:57 -04:00
hobokenchicken	188da1d52a	fix(stt): try gpt-4o-realtime-preview as base session model + gpt-realtime-whisper for input_audio_transcription (per OpenAI error guidance)	2026-06-04 15:16:08 -04:00
hobokenchicken	191b7ad9b5	fix(stt): correct Realtime WS model to gpt-realtime-whisper + enhance event handling for deltas/completed - URL now uses ?model=gpt-realtime-whisper (was invalid gpt-4o-mini-realtime-preview) - Cleaned session.update (removed modalities that may not apply) - Expanded _handle to catch input_audio_transcription.delta and .completed events - on_error now forwards transcription errors to frontend client - Per AUDIT + PLAN item 1	2026-06-04 15:14:26 -04:00
hobokenchicken	7502f201c7	feat: Realtime WebSocket STT via gpt-realtime-whisper Replaces REST-based transcription (gpt-4o-transcribe) with WebSocket streaming via gpt-realtime-whisper. Frontend captures PCM16 audio and streams it through the backend to a Realtime transcription session. - Server-side VAD detects utterance boundaries automatically - Word-level transcript deltas stream to the client in real-time - On utterance end, gpt-5.4-nano generates a response - TTS streams back via with_streaming_response - Total pipeline: PCM16 → Realtime WS → LLM → streaming TTS	2026-06-04 14:26:19 -04:00
hobokenchicken	25b12ee14f	fix: gpt-realtime-whisper requires Realtime API, not REST endpoint Swapped to gpt-4o-transcribe (/usr/bin/bash.006/min) — middle ground between speed and cost.	2026-06-04 14:23:41 -04:00
hobokenchicken	3128f69e48	fix: switch TTS voice from nova to sage	2026-06-04 14:22:18 -04:00
hobokenchicken	f98f87b7ee	fix: swap to gpt-realtime-whisper for STT Replaces gpt-4o-mini-transcribe (/usr/bin/bash.003/min) with gpt-realtime-whisper (/usr/bin/bash.017/min). Expected to reduce transcription latency from ~2.6s to ~1s due to the model's realtime optimization.	2026-06-04 14:20:40 -04:00
hobokenchicken	9cd183a83b	fix: streaming TTS via with_streaming_response Replaced synchronous TTS (waiting for full audio at 5.9s) with streaming TTS that sends audio chunks as they arrive. Backend now accumulates chunks in audioBufferRef and plays the complete stream on speaking_end. Reduces TTS latency from ~6s to ~1s first byte.	2026-06-04 14:17:54 -04:00
hobokenchicken	2cd5636ad6	debug: add per-step timing logs to identify latency bottleneck	2026-06-04 14:14:02 -04:00
hobokenchicken	7875b5d12a	fix: cache Honcho memory context per-session (not per-turn) The memory context was being rebuilt on every conversation turn via build_system_prompt(), which calls Honcho's dialectic reasoning API twice (get_user_context + get_kira_context). Each call takes 5-15s. Now the memory suffix is computed ONCE during identify and cached in a memory_suffix variable for the session duration. Per-turn latency drops from ~37s to ~3s. Also removed duplicated _pcm16_to_wav and cleaned up orphaned code.	2026-06-04 14:11:14 -04:00
hobokenchicken	c5cc4dd480	fix: replace PCM16 capture with MediaRecorder (Opus/webm) PCM16 capture via AudioContext was streaming raw audio continuously, causing massive accumulated buffers that took ~20s to transcribe. Replaced with MediaRecorder which records compressed Opus/webm and sends a single blob on release — much smaller, faster to transcribe. Also removed all unused PCM16/WAV helper functions from both frontend and backend.	2026-06-04 14:04:44 -04:00
hobokenchicken	537ddcd841	fix: play Opus TTS audio directly instead of WAV-converting it The backend sends Opus-encoded audio from OpenAI TTS (tts-1 with response_format=opus). The frontend was treating it as raw PCM16 and wrapping it in a WAV container, which corrupted the audio into static. Now plays the Opus data directly as audio/ogg.	2026-06-04 13:59:04 -04:00
hobokenchicken	a370f1ebff	fix: use max_completion_tokens for gpt-5.4-nano gpt-5.4-nano uses max_completion_tokens instead of max_tokens	2026-06-04 13:57:05 -04:00
hobokenchicken	a19ac46312	fix: wrap PCM16 in WAV container before STT API call Frontend captures PCM16 mono 24kHz audio. The transcription API expects a proper audio container format (wav, webm, etc.), not raw PCM16 data. Added _pcm16_to_wav() to wrap the raw bytes in a WAV header before sending to gpt-4o-mini-transcribe.	2026-06-04 13:55:05 -04:00
hobokenchicken	f2a5416408	feat: cheapest pipeline — gpt-4o-mini-transcribe + gpt-5.4-nano + TTS Simple 3-step chat completions pipeline at ~/usr/bin/bash.019/min total. Streams PCM16 audio from frontend, transcribes on release, generates response via gpt-5.4-nano, speaks via OpenAI TTS. Cost breakdown: gpt-4o-mini-transcribe: /usr/bin/bash.003/min gpt-5.4-nano: ~/usr/bin/bash.001/min OpenAI TTS (nova): /usr/bin/bash.015/min Total: ~/usr/bin/bash.019/min (~/usr/bin/bash.57/day at 30min)	2026-06-04 13:51:35 -04:00
hobokenchicken	66e799a655	fix: connect directly via websockets to bypass OpenAI-Beta header The openai library's beta.realtime.connect() hardcodes the obsolete 'OpenAI-Beta: realtime=v1' header which the GA API rejects. Connecting directly via the websockets library with only the Authorization header resolves the 'beta_api_shape_disabled' error.	2026-06-04 13:49:56 -04:00
hobokenchicken	274d04ea10	feat: hybrid pipeline — gpt-realtime-whisper + gpt-5.4-nano + TTS Hybrid approach gives streaming STT at ~/usr/bin/bash.017/min + cheap brain at ~/usr/bin/bash.001/min + TTS at ~/usr/bin/bash.015/min = ~/usr/bin/bash.033/min total. - gpt-realtime-whisper handles streaming transcription with VAD - gpt-5.4-nano handles response generation (chat completions) - OpenAI TTS (nova) for voice output - Server VAD detects utterance boundaries - Honcho memory context injected into system prompt - Removed old full Realtime relay service	2026-06-04 13:48:06 -04:00
hobokenchicken	1c15d42e06	fix: remove OpenAI-Beta header, use gpt-realtime-2 GA model The old OpenAI-Beta: realtime=v1 header is rejected by the GA API. Removing it via extra_headers override. Using gpt-realtime-2 which is the current production Realtime model.	2026-06-04 13:42:06 -04:00
hobokenchicken	e2332af8d0	feat: OpenAI Realtime API pipeline Replaced the 3-step sequential pipeline (Whisper STT → DeepSeek LLM → OpenAI TTS) with a single OpenAI Realtime API WebSocket using gpt-4o-mini-realtime-preview. - ~300-800ms latency vs 1-3s - Server VAD for automatic turn detection - Streaming audio chunks during playback - Interruptions: user can speak over Kira mid-response - Honcho memory still injected into session instructions - Frontend captures PCM16 mono 24kHz via AudioContext - Backend relays client ↔ OpenAI Realtime API - Supports both voice (PCM16) and text input	2026-06-04 13:32:39 -04:00
hobokenchicken	e64698b0ab	fix: graceful mic-unavailable handling over HTTP navigator.mediaDevices.getUserMedia() requires a secure context (HTTPS or localhost). When accessed over plain HTTP, the API is undefined. Now shows a friendly chat message instead of a cryptic TypeError in the console.	2026-06-04 12:12:07 -04:00
hobokenchicken	895fb9ac0b	fix: Live2D Ticker registration + outfit texture swap path - Registered pixi Ticker via (Live2DModel as any).registerTicker() to fix 'No Ticker registered' warning and animation issues - Fixed outfit texture swap: textures live on model.textures[] not model.internalModel.textures[]	2026-06-04 12:10:20 -04:00
hobokenchicken	3d3df64d7c	fix: missing mic toggle in Live2D view + YouTube autoplay KiraAvatar: Added Talk mic button to Live2D view (was only in AnimatedAvatar fallback). Includes listening-pulse animation. MusicPlayer: Replaced hidden YouTube iframe with proper IFrame Player API. Now starts on explicit user click (Start Lo-Fi button), complying with browser autoplay policies. Supports station switching and volume control after playback starts.	2026-06-04 12:06:16 -04:00
hobokenchicken	bee428ae0c	fix: outfit texture swap via internalModel.textures array model.internalModel.coreModel.setTexture() expects a raw WebGL texture, not a PixiJS Texture. Instead, set the new PixiJS Texture directly on the model's internalModel.textures[2] array. The render loop's bindTexture() call extracts the WebGL handle from the PixiJS BaseTexture and passes it to the Cubism core. This eliminates the cascade of try-catch fallbacks and the 'coreModel.setTexture is not a function' TypeError.	2026-06-04 12:02:48 -04:00

1 2

56 Commits