8 Comments
User's avatar
Hugo Bowne-Anderson's avatar

Thanks for yet another wonderful collaboration, Paul!

I hope that it helps your audience build more reliable AI-powered software :)

Paul Iusztin's avatar

Great collab as always, Hugo! Hehe, I hope as well. It's hard to get into this Eval-Driven mindset.

Daniel Popescu / ⧉ Pluralisk's avatar

This piece really resonated with me. The 'POC Purgatory' is such an accurate description; it’s a proplem I've seen too often. Your emphasis on Evalution-Driven Development is spot on. It truly feels like we're still figuring out the engineering principles for robust LLM apps. A fantastic read.

Hugo Bowne-Anderson's avatar

I'm so glad it resonated, Daniel, and thank you for your kind words!

Meenakshi NavamaniAvadaiappan's avatar

Thanks for the good 😊

Ricardo Heredia's avatar

Amazing collab between you, guys!

It's really helpful to read this kind of approaches to set some best practices that helps build meaningful products beyond demos.

Thanks for sharing!

Paul Iusztin's avatar

Agree! Evaluation-driven design is probably the future of software as we start integrating more AI into it.

Also glad you enjoy this “Hugo” month 🤟😂

Adam's avatar

"vibe something to start, then add more tests/use cases to hit, see if you hit them, if not, figure out why, fix those problems, keep running those tests/evals, repeat", simple!