Local & Open Source AI · Posted by Raj Patel · 2mo ago

Phi-3 Small vs Gemma 2 vs Mistral Small: Edge AI Benchmarks

benchmarked small models for edge deployment. latency, accuracy, and memory usage comparison

6 replies

6 Replies

2mo ago

lmao i literally ran into this same problem yesterday. running llama 3 locally and its basically as good as the api for most tasks

2mo ago

ive been doing this for about 6 months now and the biggest lesson ive learned is gguf format basically won the local model format war. its not what I expected at all going in

2mo ago

ok real talk - quantized models have gotten insanely good. barely notice the quality drop. i know thats not the popular opinion here but someone had to say it

2mo ago

quick question - has anyone benchmarked this against the open source alternatives?

2mo ago

yeah exactly. i run everything through lm studio now, way simpler than command line

1mo ago

has anyone actually tried this with copilot? curious how it holds up