Benchmarks

Data-driven performance comparisons for small language models fine-tuned with distil labs.