Why do you think wasm is harder to circumvent? The only way to instantiate a wasm module in the browser is through (drum roll) javascript. Install noscript if you're worried. The days of view source -> edit are basically over anyway due to every site's 1MB+ minified JS blobs.
> Why would I want to run an LLM in the browser, I could just run it natively for better performance?
Why would you try out a new app in a sandboxed browser, when instead you could give it complete access to your entire computer?
Sandboxing is about preventing code from accessing data it's not supposed to. Data like files or memory belonging to other tabs or other processes. Or data streams like your webcam or microphone. Data outside of its, well, sandbox.
So how are compute shaders accessing data they're not supposed to? How do you think they're escaping the sandbox?
It seems like you're just making up your own definitions now because you don't like the tech. What do think a sandbox is, exactly? And what do you think Chrome's GPU sandbox does, if it's not a sandbox?
> Why would I want to run an LLM in the browser, I could just run it natively for better performance?
Why would you try out a new app in a sandboxed browser, when instead you could give it complete access to your entire computer?