Status: idle
(output will stream here)
Notes
- Everything runs 100% in your browser via WebGPU. No server calls.
- Click Load model once; first load compiles GPU shaders (cold start). Subsequent prompts will be faster.
- The GPU stats button shows adapter features/limits and WebGL renderer info. Browsers do not expose live VRAM usage for security/privacy reasons.