top of page

Josh Tobin: Test-driven development for large language model applications.

Josh Tobin is the founder and CEO of Gantry. Previously, Josh worked as a deep learning & robotics researcher at OpenAI and as a management consultant at McKinsey. He is also the creator of Full Stack Deep Learning (, the first course focused on the emerging engineering discipline of production machine learning. Josh did his PhD in Computer Science at UC Berkeley advised by Pieter Abbeel.


Test-driven development for large language model applications.

Large language models are a powerful primitive for building applications quickly and easily. However, when it comes to robustness, reliability, and production readiness, they leave something to be desired.

If you've built applications with LLMs, you may have wondered, "isn't it a bit generous to call this prompt engineering?", "how do I know if this thing is actually working", or "is it even possible to test these things"?

In this talk, we will present a more principled way to develop LLM applications using an approach that is analogous to test-driven development. We'll also show you how to get started with this approach in minutes using Gantry.


bottom of page