idle
Switching resets the chat. Same engine, different hat.
Paste a free key from build.nvidia.com. Sent via the proxy Worker (avoids CORS). Stored in this browser only β prototyping.
Runs entirely on your device (no server, no WebGPU needed). First reply downloads the model once (~1.6 GB).
Voice model loads on first reply.