The Fifth Elephant 2015

A conference on data, machine learning, and distributed and parallel computing

Processing large data with Apache Spark

Submitted by Venkata Naga Ravi (@venkatanagaravi) on Saturday, 23 May 2015

Section: Full Talk Technical level: Intermediate Status: Confirmed & Scheduled


Overview of Apache Spark functionalities with detailed architecture details. We will touch upon Spark Streaming capability for near real time processing.


In this session, Ravi will cover Apache Spark overview with its unique features using in large data systems. He will get more details into Spark EcoSystem,Architecture, Elements and comparison with MapReduce. He will also touch up on its languages support with working demo session.

Speaker bio

Ravi is in in IT industry for 11+ years. Ravi works for Cisco as Technical Leader and part of Cisco service team .He completed MS from BITS and BE from University of Madras. He has well experience in building highly distributable systems using multi-tier architecture. His interest on exploring new technologies and tools.



Preview video


Login to leave a comment