Feb 2026
23 Mon
24 Tue
25 Wed
26 Thu
27 Fri 09:30 AM – 05:00 PM IST
28 Sat 09:30 AM – 05:00 PM IST
1 Sun
Submitted Jan 5, 2026
It’s easy to ship a magical agent demo. It’s much harder to ship an agent that works for real users: noisy inputs, partial context, flaky tools, ambiguous goals, and “tiny prompt changes” that break everything.
In this hands-on workshop, we’ll build a small but realistic agent in Python + DSPy, then turn it into something you can actually run in production: structured I/O, tool contracts, tracing, evals, and automatic optimisation.
You’ll leave with a concrete engineering workflow (an “agent improvement loop”) that you can take back to your team:
instrument → collect failures → convert to evals → optimise → ship via CI.
The techniques are framework-agnostic; we’ll use DSPy because it makes optimisation and modularity explicit in code.
Anatomy of a production agent (15 min)
Agent loop, tool contracts, ground-truth checks, stop conditions, failure modes.
Build the agent in DSPy (60 min)
Signatures + modules, tool wiring, structured outputs, error handling.
Observability & evals (45 min)
Tracing, failure buckets, creating an eval set from real-ish cases, measuring baseline.
Optimisation (45 min)
Few-shot baselines → DSPy optimiser run → compare metrics + inspect deltas.
Shipping the improvement loop (15 min)
Minimal CI pattern: run evals on PR, regressions gate merges, version prompts/programs.
The workshop skeleton and requirements can be found in this repo: https://github.com/unravel-team/real-agents-workshop
Kiran Kulkarni is the founder of Unravel.tech, where he helps teams build production-grade AI systems—agentic workflows, evaluation pipelines, and reliability/observability practices. He’s been a founding engineer and engineering leader across data + AI systems and loves turning “cool demos” into software that survives real users.
Utkarsh Dighe is a senior engineer at Unravel.tech, where he designs and builds pragmatic solutions using Agentic AI to tackle complex problems across domains. He takes an engineering-first approach, focusing on reliability, robustness, and scalability—ensuring systems don’t just work in theory, but hold up under real-world usage.
Hosted by
Supported by
Platinum sponsor
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}