About The Fifth Elephant Conference; previous editions

  • India’s most prestigious big data, ML and data science conference since 2012.
  • A platform for practitioners to share real-life case studies of working with data, ML systems and data science - across advertising, e-commerce, OEMs, automobile industry, and platforms.
  • Relevant themes and topics each year, which helps practitioners to gain practical insights from The Fifth Elephant Conference.

The Fifth Elephant 2023 editions

The Fifth Elephant Monsoon
The Fifth Elephant Winter

Themes for 2024 edition

  1. Data engineering and data science - managing, running and optimizing data pipelines; feature engineering; open source alternatives to vendor lock-in; recommendation systems; fraud processing.
  2. Best practices for LLM training, inference and deployment; implementing security while incorporating LLMs and SLMS in organizations; working with Open Source LLM models; security, bias and risk mitigation.
  3. GenAI - Generative AI based use-cases; GenAI and hardware.

The Conference will also host workshops, tutorials, round tables and Birds of Feather (BOF) sessions for interactive participation and networking.

Speaking at The Fifth Elephant 2024

The conference follows the peer process of reviewing submissions, and followed by rehearsals with selected speakers. If you want to speak at the conference, submit your abstract and slides at https://hasgeek.com/fifthelephant/2024/sub.

Attendance

The 2024 edition will be held in-person. Attendance is open to The Fifth Elephant members only. Pick a membership to attend the in-person conference.

Who will benefit from participating in the conference and The Fifth Elephant community

  1. AI/ML/Data Science/Ops engineers who want to learn about state-of-the-art tools and techniques, especially from domains such as health care, e-commerce, automobile, agri-tech and industrial verticals
  2. Data scientists who want a deeper understanding of model deployment/governance.
  3. Architects who are building ML workflows that scale.
  4. Tech founders and CTOs who are building products and platforms that leverage data, ML and LLMs.
  5. Product managers, who want to learn about the process of building AI/ML products.
  6. Directors, VPs and senior tech leadership who are building AI/ML teams.

Sponsorship

Sponsorship slots are open for:

  1. Employer branding and visibility
  2. Infrastructure (GPU, CPU and cloud providers) and developer productivity tool makers who want to evangelise their offering to developers and decision-makers.
  3. Companies who want to do tech branding among AI and ML developers.
  4. Venture Capital (VC) firms and investors who want to scan the landscape of innovations and innovators in AI and who want to source leads for investment in the AI and ML space.

If you are interested in sponsoring The Fifth Elephant, contact sales@hasgeek.com.

Testimonials

“The Fifth Elephant has been a community of passionate and talented people in tech and is for me one of the most valuable sources of practical insights into ML and AI and related aspects” — said Manu Hegde, a member of The Fifth Elephant.

“I have been a regular attendee at Fifth Elephant events since 2016 and I always take away a lot of applied wisdom from the speakers and other practitioners I meet. Often this is about common problem but many times it’s also about topics that I was not even aware of and it definitely broadens my view. The talks are on-point and derive from real world experience with very little to no marketing fluff. The Fifth Elephant community is welcoming of feedback and very supportive to all members.” - said Sidharth Ramachandran, a member of The Fifth Elephant.

Contact information

Join The Fifth Elephant Telegram group on https://t.me/fifthel or WhatsApp group. Follow @fifthel on Twitter.
For inquiries, call The Fifth Elephant on +91-7676332020 or leave a comment here.

Hosted by

All about data science and machine learning

Supported by

Gold Sponsor

Atlassian unleashes the potential of every team. Our agile & DevOps, IT service management and work management software helps teams organize, discuss, and compl

Subhom Mitra

@subhom_mitra Author

Abhijeet Kumar

@abhijeet3922 Author

Chat with Tables: Query tabular data in English using self-hosted Large Language Models

Submitted May 15, 2024

Business users and non-technical professionals often need to quickly analyse or transform tabular data in spreadsheets for ad hoc business intelligence. However, they might lack the necessary programming knowledge to do so themselves and therefore must reach out to a data analyst. Such unexpected delays have the potential to incur huge opportunity costs for time-sensitive business decisions which must be informed by accurate analysis of data.

Generative AI powered by Large Language Models (LLMs) is being used to create novel text, images, and even videos. LLMs specialising in generating code are already being used in enterprise solutions like GitHub Copilot, Gemini Code Assist by Google, watsonx by IBM, and Amazon Q Developer (previously Amazon CodeWhisperer) to boost productivity for developers and programmers. Along the same lines, there now exist LLMs specialising in generating Structured Query Language (SQL), which is widely used across enterprise domains to manage databases and analyse and transform tabular data.

Workshop Objective

In this workshop we demonstrate how to create a web application from scratch using Streamlit and Ollama which can be used to analyse and query CSV files using natural language and the power of LLMs.

Outline

  1. Quick overview of the workshop
  2. Demo of the application
  3. Discussion on running LLMs locally for data privacy
  4. Hands-on: Setting up Ollama model server
  5. Hands-on: Setting up Streamlit and building quick interactive front-end applications
  6. Hands-on: Pipeline for using natural language prompts to transform tabular data using CSV files
  7. Hands-on: Data processing techniques like Prompt Pruning and Correcting LLM Hallucinations using Static Analysis with sqlglot
  8. Discussion on how to create generic “Chat with X” capabilities

Intended Audience

This workshop is intended for data enginners, data scientists, and researchers with basic Python experience who are working on Generative AI use-cases and want to leverage enterprise data. This might also interest business analysts or business consumers who require data querying and analysis services regularly.

Overall, any professional with at least some experience with Python programming who is interested in getting started with Gen AI will stand to benefit from this workshop since it covers both the end-to-end data pipeline as well how to prepare a demo-worthy front-end user interface.

Takeaways

  1. How to run LLMs locally or within your organisation network using Ollama
  2. How to quickly develop interactive web applications using Streamlit
  3. How to analyse tabular data in CSV format using English language queries
  4. How to create “Chat with X” applications for other data formats

Additional Reading

Here are some links to open-source and proprietary products currently available which leverage LLMs to generate SQL and power database interactions.

  1. Vanna: an MIT-licensed open-source Python RAG (Retrieval-Augmented Generation) framework for SQL generation and related functionality.
  2. Dataherald: a natural language-to-SQL engine built for enterprise-level question answering over relational data.
  3. ChatDB: build dashboards for your database with AI.
  4. DB-GPT: an open source AI native data app development framework for building infrastructure in the field of large models.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Make a submission

Accepting submissions till 03 Jun 2024, 11:59 PM

Bangalore International Centre, Bangalore

Hosted by

All about data science and machine learning

Supported by

Gold Sponsor

Atlassian unleashes the potential of every team. Our agile & DevOps, IT service management and work management software helps teams organize, discuss, and compl