feat: browser wasm llm #2970

tjbck · 2024-06-10T04:05:12Z

No description provided.

justinh-rahb · 2024-06-10T04:20:14Z

https://github.com/mlc-ai/web-llm or something like it?

lukestanley · 2024-06-12T09:58:03Z

This is useful if you can only do a limited amount of processing on the server, and where the client has more processing power. WASM approaches to running Llama.cpp exist, as well as web GPU approaches (that only work on certain systems and and certain browser versions). There may be a WASM memory size issue with normal LLM sizes though. Working around it may well be possible though.

tjbck · 2024-06-13T22:30:04Z

@justinh-rahb yep, seems promising!

BuildBackBuehler · 2024-06-21T22:55:32Z

@justinh-rahb yep, seems promising!

Yeah MLC-LLM is the ship!
#1270

Would love to plug-n-play with it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: browser wasm llm #2970

feat: browser wasm llm #2970

tjbck commented Jun 10, 2024

justinh-rahb commented Jun 10, 2024

lukestanley commented Jun 12, 2024 •

edited

Loading

tjbck commented Jun 13, 2024

BuildBackBuehler commented Jun 21, 2024

feat: browser wasm llm #2970

feat: browser wasm llm #2970

Comments

tjbck commented Jun 10, 2024

justinh-rahb commented Jun 10, 2024

lukestanley commented Jun 12, 2024 • edited Loading

tjbck commented Jun 13, 2024

BuildBackBuehler commented Jun 21, 2024

lukestanley commented Jun 12, 2024 •

edited

Loading