Local & Open Source AI · Posted by Raj Patel ·

Phi-3 Small vs Gemma 2 vs Mistral Small: Edge AI Benchmarks

10

benchmarked small models for edge deployment. latency, accuracy, and memory usage comparison

6 replies

6 Replies

38

lmao i literally ran into this same problem yesterday. running llama 3 locally and its basically as good as the api for most tasks

9

ive been doing this for about 6 months now and the biggest lesson ive learned is gguf format basically won the local model format war. its not what I expected at all going in

10

ok real talk - quantized models have gotten insanely good. barely notice the quality drop. i know thats not the popular opinion here but someone had to say it

7

quick question - has anyone benchmarked this against the open source alternatives?

28

yeah exactly. i run everything through lm studio now, way simpler than command line

3

has anyone actually tried this with copilot? curious how it holds up