Decoding AI Magazine

Decoding AI Magazine

Home
Notes
Chat
LLM Engineer's Handbook
Agentic AI Engineering Course
Roadmaps
Perks
Reach Out
Archive
About

AI Evals & Observability

No Evals Dataset? Here's How to Build One from Scratch
Build evaluators to signal problems that users actually care about. Step-by-step guide.
12 hrs ago • Paul Iusztin
Stop Vibe Checking Your AI App
The holistic guide to integrating AI Evals: From optimization to production monitoring
Feb 10 • Paul Iusztin
Behind the Scenes of AI Observability in Production
What actually works after 6 months of trial and error
Feb 3 • Alejandro Aboy
Stop Launching AI Apps Without This Framework
A practical guide to building an eval-driven loop for your LLM app using synthetic data, before you have users.
Oct 30, 2025 • Hugo Bowne-Anderson
Escaping POC Purgatory: Evaluation-Driven Development for AI Systems
A new software development life cycle for LLMs
Oct 16, 2025 • Hugo Bowne-Anderson and Stefan Krawczyk
The 5-Star Lie: You Are Doing AI Evals Wrong
Why binary evals are better than likert scales
Sep 20, 2025 • Hamel Husain
The Mirage of Generic AI Metrics
Why off-the-shelf evals sabotage your AI product
Sep 13, 2025 • Hamel Husain
© 2026 Paul Iusztin · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture