Aug 2025

11 Mon 03:00 PM – 11:00 PM IST

12 Tue 03:00 PM – 11:00 PM IST

13 Wed 03:00 PM – 11:00 PM IST

14 Thu 03:00 PM – 11:00 PM IST

15 Fri 03:00 PM – 11:00 PM IST

16 Sat 03:00 PM – 11:00 PM IST

17 Sun 03:00 PM – 11:00 PM IST

Aug 2025

18 Mon 03:00 PM – 11:00 PM IST

19 Tue 03:00 PM – 11:00 PM IST

20 Wed 03:00 PM – 11:00 PM IST

21 Thu 03:00 PM – 11:00 PM IST

22 Fri 03:00 PM – 11:00 PM IST

23 Sat 03:00 PM – 11:00 PM IST

24 Sun 03:00 PM – 11:00 PM IST

Aug 2025

25 Mon 03:00 PM – 11:00 PM IST

26 Tue 03:00 PM – 11:00 PM IST

27 Wed 03:00 PM – 11:00 PM IST

28 Thu 03:00 PM – 11:00 PM IST

29 Fri 03:00 PM – 11:00 PM IST

30 Sat 03:00 PM – 11:00 PM IST

31 Sun 03:00 PM – 11:00 PM IST

Sep 2025

1 Mon 03:00 PM – 11:00 PM IST

2 Tue 03:00 PM – 11:00 PM IST

3 Wed 03:00 PM – 11:00 PM IST

4 Thu 03:00 PM – 11:00 PM IST

5 Fri 03:00 PM – 11:00 PM IST

6 Sat 03:00 PM – 11:00 PM IST

7 Sun 03:00 PM – 11:00 PM IST

Sep 2025

8 Mon 03:00 PM – 11:00 PM IST

9 Tue 03:00 PM – 11:00 PM IST

10 Wed 03:00 PM – 11:00 PM IST

11 Thu 03:00 PM – 11:00 PM IST

12 Fri 03:00 PM – 11:00 PM IST

13 Sat 03:00 PM – 11:00 PM IST

14 Sun 03:00 PM – 11:00 PM IST

Sep 2025

15 Mon 03:00 PM – 11:00 PM IST

16 Tue 03:00 PM – 11:00 PM IST

17 Wed 03:00 PM – 11:00 PM IST

18 Thu 03:00 PM – 11:00 PM IST

19 Fri 03:00 PM – 11:00 PM IST

20 Sat 03:00 PM – 11:00 PM IST

21 Sun 03:00 PM – 11:00 PM IST

Sep 2025

22 Mon 03:00 PM – 11:00 PM IST

23 Tue 03:00 PM – 11:00 PM IST

24 Wed 03:00 PM – 11:00 PM IST

25 Thu 03:00 PM – 11:00 PM IST

26 Fri 03:00 PM – 11:00 PM IST

27 Sat 03:00 PM – 11:00 PM IST

28 Sun 03:00 PM – 11:00 PM IST

Sep 2025

29 Mon 03:00 PM – 11:00 PM IST

30 Tue 03:00 PM – 11:00 PM IST

1 Wed 03:00 PM – 11:00 PM IST

2 Thu 03:00 PM – 11:00 PM IST

3 Fri 03:00 PM – 11:00 PM IST

4 Sat 03:00 PM – 11:00 PM IST

5 Sun 03:00 PM – 11:00 PM IST

Oct 2025

6 Mon 03:00 PM – 11:00 PM IST

7 Tue 03:00 PM – 11:00 PM IST

8 Wed 03:00 PM – 11:00 PM IST

9 Thu 03:00 PM – 11:00 PM IST

10 Fri 03:00 PM – 11:00 PM IST

11 Sat 03:00 PM – 11:00 PM IST

12 Sun 03:00 PM – 11:00 PM IST

Oct 2025

13 Mon 03:00 PM – 11:00 PM IST

14 Tue 03:00 PM – 11:00 PM IST

15 Wed 03:00 PM – 11:00 PM IST

16 Thu 03:00 PM – 11:00 PM IST

17 Fri 03:00 PM – 11:00 PM IST

18 Sat 03:00 PM – 11:00 PM IST

19 Sun 03:00 PM – 11:00 PM IST

Oct 2025

20 Mon 03:00 PM – 11:00 PM IST

21 Tue 03:00 PM – 11:00 PM IST

22 Wed 03:00 PM – 11:00 PM IST

23 Thu 03:00 PM – 11:00 PM IST

24 Fri 03:00 PM – 11:00 PM IST

25 Sat 03:00 PM – 11:00 PM IST

26 Sun 03:00 PM – 11:00 PM IST

Oct 2025

27 Mon 03:00 PM – 11:00 PM IST

28 Tue 03:00 PM – 11:00 PM IST

29 Wed 03:00 PM – 11:00 PM IST

30 Thu 03:00 PM – 11:00 PM IST

31 Fri 03:00 PM – 11:00 PM IST

1 Sat 03:00 PM – 11:00 PM IST

2 Sun

Evaluating Agentic Applications in the SDLC: Ensuring Reliability with OSAI

Submitted Sep 30, 2025

Type of session: 25-35 min talk

Title

Evaluating Agentic Applications in the SDLC: Ensuring Reliability with OSAI

Abstract

Agentic applications, built using open-source large language models (LLMs) and frameworks, are redefining how we approach collaborative software development and intelligent workflows. Yet, their non-deterministic nature raises critical challenges for evaluation and testing. How do we ensure correctness, reliability, and consistency when working with inherently probabilistic systems?

This talk will showcase practical evaluation strategies for agentic applications within the software development lifecycle (SDLC), using a conversational agent built with open-source models as a running use case. We will break down the agent into its core components—query understanding, data orchestration, tool invocation, and response synthesis—and demonstrate methods to design deterministic evaluation frameworks around stochastic behaviours.

Key Takeaways

Practical methods to evaluate open-source agentic applications at the component and system level
Metrics beyond accuracy: goal completion, grounding, latency, and consistency
How to embed evaluation into the SDLC testing cycle, ensuring robustness from development to deployment
Lessons learned from real-world use cases of open-source agentic frameworks

Target Audience

Data scientists, AI/ML Engineers & Researchers
Architects working with AI and agentic use cases
Open Source AI model evaluators/explorers
AI enthusiasts exploring agentic applications

Prerequisites

Basic knowledge of Python
Basic understanding of SQL, APIs
Interest in building or maintaining production-grade AI applications (Who’s Not :P)

Whether you’re building autonomous agents or conversational assistants, this session will equip you with the tools and frameworks to test open-source AI models with confidence in a world of unpredictability.

Speaker Bio

Shruti Dhavalikar is a Data Scientist at Sahaj Software with over six years of experience in building data-driven solutions. She specialises in transforming complex datasets into actionable business insights and has led end-to-end product cycles within Agile environments. Her work emphasises scalable and robust development practices across diverse technology stacks. In addition to her industry contributions, she engages in applied research aligned with real-world challenges and has presented and published her work at international conferences. Outside of work, she nurtures a keen interest in cosmology and space, and enjoys discovering new cuisines as an avid travelling foodie.

The Fifth Elephant OSAI meet-up - Hyderabad edition

Evaluating Agentic Applications in the SDLC: Ensuring Reliability with OSAI

Title

Abstract

Key Takeaways

Target Audience

Prerequisites

Speaker Bio

Comments