Talk @ July 16 - Paris AI, ML and Computer Vision Meetup
https://www.meetup.com/paris-ai-machine-learning-and-computer-vision-meetup/events/308656827/
I talked about why massive LLMs often make things harder, not better, for real-world applications: high latency, high cost, limited controllability, and surprisingly little actual domain knowledge. In contrast, small language models (SLMs) give us a new kind of superpower — one where we can inject knowledge, align behavior, and actually understand what’s going on under the hood.
I walked through our full SLM workflow, including how we utilize techniques such as Spectrum, DistillKit, and MergeKit to build domain-adapted models that are fast, fine-tuned, and surprisingly capable. And of course, we couldn’t skip a peek at AFM-4.5B — our newest foundation model, designed from the ground up to be lean, competitive, and extensible. It’s already taking on models many times its size… and winning.
www.arcee.ai
Related topics: