Vrunda Gadesha

Vrunda Gadesha

@vrunda91

Build a PPT AI image analysis question answering system with Granite vision model

Submitted Mar 29, 2025

Abstract

The Granite Vision model, part of IBM’s Granite family of open-source foundation models, enables state-of-the-art image analysis and understanding. With over 2 billion parameters, the model is designed for high-performance computer vision tasks, including object detection, scene understanding, and visual question answering. This session demonstrates how to harness the power of the Granite Vision model to build a PowerPoint (PPT) AI image analysis and question-answering system. By combining advanced vision capabilities with natural language processing, this system automates insights extraction from presentation slides, offering immense value to startups, enterprises, and educators.

For the leadership perspective, this solution provides tangible benefits, such as automating repetitive slide review tasks, saving valuable time, and enabling faster decision-making. By leveraging cutting-edge AI for actionable insights from unstructured visual data, startups can drive innovation and gain a competitive edge in their industries.

Concepts for Building the PPT AI Analyzer

  • Leverage IBM’s Granite Vision model with 2 billion parameters for advanced image analysis tasks.
  • Extract and classify visual elements like charts, graphs, and tables from PowerPoint slides.
  • Integrate AI-driven image recognition with natural language processing for question answering.
  • Automate insights extraction from presentation slides to streamline workflows.

Takeaways

  • Learn how the Granite Vision model revolutionizes image analysis and question answering for enterprise use cases.
  • Understand the architecture and practical implementation of a scalable AI-powered system for analyzing PowerPoint presentations.
  • Gain insights into real-world applications and how entrepreneurs can adopt this solution to drive innovation and efficiency.

Which Audiences is Your Session Going to Benefit?

  • Organization Leadership: Gain strategic insights into leveraging AI for business innovation and efficiency.
  • AI Engineers and Developers: Understand the technical architecture and implementation details.
  • Educators and Analysts: Explore automated tools for interactive content creation and report analysis.

Additional Resources

  • Granite Vision Model on Hugging Face: Explore IBM’s state-of-the-art Granite Vision model with 2 billion parameters, designed for advanced image analysis and multimodal tasks. Explore Granite Vision Model on Hugging Face
  • IBM Tutorial: PPT AI Analyzer: Step-by-step guide to building a PowerPoint AI Analyzer with the Granite Vision model. Walk through the full tutorial - Build a PPT AI Analyzer

Speaker’s Bio

Vrunda Gadesha - AI Adovate | IBM

She is a Data Scientist, Ph.D. scholar, and AI enthusiast with expertise in Large Language Models, Natural Language Processing, Machine Learning, and technical content creation. Skilled in Python Programming, she has led AI solution development and shared her knowledge through academic writing and corporate training. She is passionate about advancing AI and data science and is committed to continuous learning and impactful innovation.

Comments

Login to leave a comment

No comments posted yet

Hosted by

Jump starting better data engineering and AI futures

Supported by

Meet-up sponsor

Nutanix is a global leader in cloud software, offering organizations a single platform for running apps and data across clouds.

Community sponsor