
Tuning Small LLMs for Fast, Tool-Using Agents: Qwen3-4B + Ollama + Strands (with Rationale)
A practical, opinionated recipe for making a 4B model feel snappy while still using tools reliably—now with the WHY behind each choice and a guide to what small tool-using agents can actually do.