Switching resets the chat. Same engine, different hat.
Paste a free key from build.nvidia.com. Sent via the proxy Worker (avoids CORS). Stored in this browser only β prototyping.
Runs entirely on your device, no key. Uses WebGPU (WebLLM) when available β desktop, newer Android, iPhone 15 Pro+/iOS 18+ β else falls back to CPU (wllama). First reply downloads the model once.
Tap Beep β if you hear nothing, it's the iPhone silent switch (flip the side switch off). Native apps bypass it; Safari can't.
Device = phone/OS voices (instant). Kokoro = on-device English. Google = cloud, 24 languages, plays reliably on iPhone (needs your free AI Studio key).
Robotic? On iPhone, download a better voice in Settings β Accessibility β Spoken Content β Voices β English β (e.g. an Enhanced/Premium voice), then pick it here.
Free key at aistudio.google.com β Get API key. Stored in this browser only.
Voice model loads on first reply.