idle
Switching resets the chat. Same engine, different hat.
Paste a free key from build.nvidia.com. Sent via the proxy Worker (avoids CORS). Stored in this browser only β prototyping.
Runs entirely on your device (no server, no WebGPU needed). First reply downloads the model once (~1.6 GB).
Tap Beep β if you hear nothing, it's the iPhone silent switch (flip the side switch off). Native apps bypass it; Safari can't.
Device voice uses your phone/OS voices β instant, reliable, multilingual. Kokoro is a branded English voice (heavier; best on desktop).
Voice model loads on first reply.