slm
A list of posts tagged slm
Blogs
Notes
Responses
- Introducing Appleās On-Device and Server Foundation Models
- TinyAgent: Function Calling at the Edge
- Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20
- Introducing Phi-3
- RecurrentGemma - Open weights language model from Google DeepMind, based on Griffin.
- Introducing Stable LM 2 1.6B
- Phi-2: The surprising power of small language models
- Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes