# AI Body Double — Project Scope Questions ## 1. Avatar Style What visual direction for the character companion? - **Option A: Cute 2D illustrated character** — Simple animated sprite (blink, sway, idle bounce, wave). Think a stylized flat-vector character. Fast to build, runs anywhere. - **Option B: Live2D / Vtuber-style** — Full rigged character with lip-sync, gestures, head tracking. Looks incredible but requires custom art assets and is significantly more work. - **Option C: Pixel art character** — Retro chibi sprite with simple animations. Cozy, low-fi aesthetic. - **Option D: No character art yet** — Start with a clean UI dashboard, add the character later. --- ## 2. Voice (TTS) What level of voice quality? - **Option A: Cloud API (ElevenLabs)** — Best quality female voices, natural intonation, can do "girly-pop" vibe. ~$5/month. - **Option B: Local Piper TTS** — Free, self-hosted, no recurring cost. Lower quality, more robotic, but plenty of female voice models available. - **Option C: OpenAI TTS** — Good quality, pay-per-use (~$0.015/minute). Middle ground. - **Option D: No voice yet** — Start text-only, add TTS later. --- ## 3. Platform Where does this need to run? - **Browser-based (web app)** — Accessible from any laptop/phone/tablet on the home network. Easiest to build and iterate. - **Desktop app (Electron/Tauri)** — Native feel, offline capable. More work. - **Mobile app** — Phone-native experience. Most work. - **Both** — Browser for laptop, but also accessible on phone. --- ## 4. Start Scope How should we slice the first working version? - **Option A: MVP — Character + Lo-fi + Timer + Notes** Build the visual companion, lo-fi music player, pomodoro/focus timer, and a notes widget. Get the "presence" and toolset right first. Add voice interactivity in phase 2. - **Option B: Full build — Everything including voice** Go straight for the full pipeline: STT (microphone input) -> LLM (processes what she says) -> TTS (talks back) + all tools. Longer first delivery but one complete system. - **Option C: Somewhere in between** Build the dashboard + character + tools, plus TTS only (so the assistant talks to her but doesn't listen yet). Add microphone input in phase 2. --- ## 5. LLM Backend (Brain) What powers the assistant's responses and personality? - **Local (Ollama)** — Self-hosted, free, private. You've got an AMD 6650 XT that can accelerate inference. Runs models like Llama 3, Mistral, Qwen. - **Cloud API** — Better personality/instruction following. OpenAI, Anthropic, or similar. Small cost per month. - **Not needed at first** — Start with scripted responses and canned encouragement, add AI conversational ability later. --- ## 6. Music & Audio How should the lo-fi / white noise work? - **Streaming (YouTube/SoundCloud URLs)** — Curate playlists of lo-fi study beats. Free, endless variety. - **Local audio files** — Download lo-fi tracks and ambient sounds. Works offline. - **Generated** — Use AI music generation for custom tracks. Experimental. - **Integrated web player** — Embed something like Spotify or YouTube Music. --- ## 7. Virtual Pet What kind of pet? - **Cat** — Fits the cozy/girly aesthetic. Classic. - **Dog** — Energetic companion. - **Fantasy creature** — A cute blob/slime/fairy/dragon. - **Customizable** — Let her pick, or unlock different ones. --- ## 8. Background Scenes What environments should be available? - Cozy bedroom / study - Coffee shop / café - Garden / nature - Rainy window - Starry night / space - Underwater / aquarium - Minimalist / clean - Seasonal (winter cabin, spring garden, autumn library) --- ## Project Name (optional) What should we call this? Some ideas: - **Buddy** (simple) - **CozyFocus** / **CoPilot** (functional) - **Luna** / **Mochi** / **Coco** (character name) - Something else? --- Once you answer these, I'll write up the full architecture plan and start building.