Agentic debugging with auto-heal in long-running workflows

Jul 2026

27 Mon

28 Tue

29 Wed

30 Thu

31 Fri 09:00 AM – 06:00 PM IST

1 Sat

2 Sun

All submissions

Previous Next

Preview video

Agentic debugging with auto-heal in long-running workflows

Submitted Jun 24, 2026

I am submitting for: Track 2 - Building & implementing AI tools & agents in production Type of session: 30 mins talk

Describe your session in 2 paragraphs

This talk shares practical lessons from building an agentic AI system that does deep technical investigations of production failures spanning several services, long-running jobs and data streaming pipelines. Post failure identification, how to apply ‘data fixes’ to mitigate failures, before a permanent fix is rolled out?

We’ll share what worked for us and (more importantly) what didn’t in designing systems that enable LLMs to reason over complex systems, combining structured data with dynamic tool use. We’ll talk about how to balance accuracy, extensibility and cost. After all, tokens aren’t free! 🙂

Mention 1-2 takeaways from your session

How agents can be used as peer engineers for debugging production incidents? (To free up human’s time for more complex/creative tasks)
How an agent can reduce MTTM and MTTR for your customers in ways humans can’t.

Previous Next

Comments

Jul 2026

27 Mon

28 Tue

29 Wed

30 Thu

31 Fri 09:00 AM – 06:00 PM IST

1 Sat

2 Sun

Hosted by

The Fifth Elephant

Jumpstart better data engineering and AI futures

Speak at The Fifth Elephant 2026 Annual Conference

Agentic debugging with auto-heal in long-running workflows

Describe your session in 2 paragraphs

Mention 1-2 takeaways from your session

Which audiences is your session going to beneficial for?

Add your bio - who you are; where you work

Link to draft slides - PDF/PPT - with comments access

Link to 2-min elevator pitch video

Comments