How does Lemmy feel about "open source" machine learning, akin to the Fediverse vs Social Media?

brucethemoose@lemmy.world · edit-2 11 months ago

How does Lemmy feel about "open source" machine learning, akin to the Fediverse vs Social Media?

fruitycoder@sh.itjust.works · 10 months ago

None taken! I’ll check out AI Horde!

Is there any objective measured ways or at least subject reviews based metrics for a model on g8ve problem set? I know the white papers tend to include it and sometimes the git repos, but I don’t see that info when searching through ollama for example.

I saw you other post about ollama alts and the concurrency mention in one of the projects README sounds promising.

brucethemoose@lemmy.world · edit-2 10 months ago

Honestly I would get away from ollama. I don’t like it for a number of reasons, including:

Suboptimal quants

suboptimal settings

limited model selection (as opposed to just browsing huggingface)

Sometimes suboptimal performance compared to kobold.cpp, especially if you are quantizing cache, double especially if you are not on a Mac

Frankly a lot of attention squatting/riding off llama.cpp’'s development without contributing a ton back.

Rumblings of a closed source project.

I could go on and on, inclding some behavior I just didn’t like from the devs, but I think I’ll stop, as its really not that bad.

brucethemoose@lemmy.world · 10 months ago

Oh, and as for benchmarks, check the huggingface open llm leaderbard. The new one.

But take it with a LARGE grain of salt. Some models game their scores in different ways.

There are more niche benchmarks floating around, such as RULER for long context performance. Amazon ran a good array of models to test their mistral finetune: https://huggingface.co/aws-prototyping/MegaBeam-Mistral-7B-512k