Self-hosting LLMs

@GreenSofaBed@lemmy.zip · 8 months ago

Self-hosting LLMs

@Showroom7561@lemmy.ca · edit-2 8 months ago

You can run this right from Windows: https://jan.ai/

You’ll need a lot of RAM, and processing is decently fast, even on a basic laptop.

edit: holy hell. Grammar.

@dangling_cat@lemmy.blahaj.zone · 8 months ago

Tip: you can copy and paste the Hugging Face link directly into the search box, and it will download the model automatically! Also, it’s pretty smart. It will load into your VRAM first, then your RAM. If you can fit everything into VRAM, you get the fastest speed. But even if you are using RAM, it’s not terribly bad; it’s still faster than you can read.

@GreenSofaBed@lemmy.zip · 8 months ago

This is pretty cool!