Build a PPT AI image analysis question answering system with Granite vision model

TomorrowApr 2025

31 Mon

1 Tue

2 Wed

3 Thu

4 Fri 01:45 PM – 06:10 PM IST

5 Sat

6 Sun

Nutanix Technologies India Pvt Ltd, Bengaluru

All submissions

This submission has been added to the schedule

Preview video

Build a PPT AI image analysis question answering system with Granite vision model

Submitted Mar 29, 2025

Choose the topic your submission falls under: Real-life uses of problems AI tech solves Type of session: Tutorial (lecture style) I am submitting for: Blr OSAI meetup in April 2025

Abstract

The Granite Vision model, part of IBM’s Granite family of open-source foundation models, enables state-of-the-art image analysis and understanding. With over 2 billion parameters, the model is designed for high-performance computer vision tasks, including object detection, scene understanding, and visual question answering. This session demonstrates how to harness the power of the Granite Vision model to build a PowerPoint (PPT) AI image analysis and question-answering system. By combining advanced vision capabilities with natural language processing, this system automates insights extraction from presentation slides, offering immense value to startups, enterprises, and educators.

For the leadership perspective, this solution provides tangible benefits, such as automating repetitive slide review tasks, saving valuable time, and enabling faster decision-making. By leveraging cutting-edge AI for actionable insights from unstructured visual data, startups can drive innovation and gain a competitive edge in their industries.

Concepts for Building the PPT AI Analyzer

Leverage IBM’s Granite Vision model with 2 billion parameters for advanced image analysis tasks.
Extract and classify visual elements like charts, graphs, and tables from PowerPoint slides.
Integrate AI-driven image recognition with natural language processing for question answering.
Automate insights extraction from presentation slides to streamline workflows.

Takeaways

Learn how the Granite Vision model revolutionizes image analysis and question answering for enterprise use cases.
Understand the architecture and practical implementation of a scalable AI-powered system for analyzing PowerPoint presentations.
Gain insights into real-world applications and how entrepreneurs can adopt this solution to drive innovation and efficiency.

Which Audiences is Your Session Going to Benefit?

Organization Leadership: Gain strategic insights into leveraging AI for business innovation and efficiency.
AI Engineers and Developers: Understand the technical architecture and implementation details.
Educators and Analysts: Explore automated tools for interactive content creation and report analysis.

Additional Resources

Granite Vision Model on Hugging Face: Explore IBM’s state-of-the-art Granite Vision model with 2 billion parameters, designed for advanced image analysis and multimodal tasks. Explore Granite Vision Model on Hugging Face

IBM Tutorial: PPT AI Analyzer: Step-by-step guide to building a PowerPoint AI Analyzer with the Granite Vision model. Walk through the full tutorial - Build a PPT AI Analyzer

Speaker’s Bio

Vrunda Gadesha - AI Adovate | IBM

She is a Data Scientist, Ph.D. scholar, and AI enthusiast with expertise in Large Language Models, Natural Language Processing, Machine Learning, and technical content creation. Skilled in Python Programming, she has led AI solution development and shared her knowledge through academic writing and corporate training. She is passionate about advancing AI and data science and is committed to continuous learning and impactful innovation.

All submissions