I’m a person who tends to program stuff in Godot and also likes to look at clouds. Sometimes they look really spicy!
Please try the 4 bit quantisations of the models. They work a bunch faster while eating less RAM.
Generally you want to use 7B or 8B models on the CPU, since everything above will be hellishly slugish.
but i wanna have a website others can access too. I tried using VPNs for cool stuff already (like controlling my lil raspberry robot from work with my phone) but I want this website to be available to all the people…
should i just bite the bullet and rent some hosting service? Or is there still hope for me putting “setup home website server” on my resume?