Speak at The Fifth Elephant 2026 Annual Conference
Share you work with the community
Jul 2026
27 Mon
28 Tue
29 Wed
30 Thu
31 Fri 09:00 AM – 06:00 PM IST
1 Sat
2 Sun
Narayana Sastry
Submitted Jun 26, 2026
Research teams are bottlenecked on the number of experiments they can run. A team’s research output can be roughly formalized as (# of people) x (# of experiments) x (research taste). With fixed headcount, the question becomes: how do we reduce the marginal cost of running one more experiment, without lowering the quality of judgment behind what gets tested?
ML teams generate ideas at every stage of the lifecycle: feasibility, accuracy, latency, deployment, and business-impact evaluation. The constraint is not lack of ideas; it is the cost of testing them, and the lack of shared visibility into what teammates are already trying. Project Nexus addresses this by adapting Hugging Face’s ML-Intern into a 24x7, always-on, asynchronous ML agent that can autonomously execute experiments on team infrastructure. A researcher submits an idea in natural language, and Nexus reviews relevant literature, writes training and evaluation code, runs GPU jobs, iterates on failures, benchmarks results, pushes code, and documents the experiment where the team can reuse it.
In two weeks of production use by eight engineers, Nexus ran 27 distinct experiments across about 40 sessions. Turnaround per idea dropped from roughly two weeks to hours, often overnight; cost per substantive experiment fell to $7-$22 of compute. Of 27 ideas, 20 succeeded and 7 were parked with evidence. The wins translated into 3 patent pursuits, 3 production-model improvements, 6 product-capability expansions, and 8 new capabilities.
The talk is not just a demo. We explain how we adapted Hugging Face’s open-source ML-Intern agent for an enterprise environment, then focus on the harder production lessons: what broke in long autonomous runs, the reliability and observability fixes we had to add, and the active research questions that remain.
| Criteria | HF ML-Intern | NetApp Nexus |
|---|---|---|
| Infrastructure | HF Jobs, public APIs, HF repos | On-prem Multi-Node GPU, internal LLM service |
| Context | HF Hub + GitHub | Enterprise context (Code repo, document) + Web (Arxiv, HF MCP/CLI) |
| Collaboration | Single-user CLI | Multi-user system: one URL, single queue, and team-visible history |
| Governance | Public artifacts as-is | Enterprise-approved artifacts backed by AD |
| Memory | No persistent memory | Learns from past experiments and recommends next ideas |
Narayana Sastry is a Staff Data & Applied Scientist at NetApp on the Research and Data Science team. The team builds AI-powered governance and security products for compliance and cyber-resilience, spanning classical ML, Generative AI, and AI agents for real-world data governance and security challenges.
Sastry leads data governance initiatives, with interests in AI security, multimodal file support, and high-ROI uses of AI to accelerate development. They adapted Hugging Face’s ML-Intern agent for NetApp’s enterprise infrastructure as Project Nexus, a shared autonomous ML research service used by the team to test ideas across the ML lifecycle on internal compute.
{Add the link to draft slides - PDF/PPT - with comments access}
{Add the link to 2-min elevator pitch video}
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}