The Fifth Elephant 2016

India's most renowned data science conference

High performance computing using Spark

Submitted by Anand Katti (@anandkatti) on Apr 29, 2016

Technical level: Intermediate Status: Submitted

Abstract

Spark has revolutionized the way Big data computation are done. It provides efficient way of distributed data processing computation. In this session, I will cover our experience of implementing a large scale big data platform (> 100 TB) using Spark and challenges faced/lessons learned

Outline

Spark has revolutionized the way Big data computation are done. It provides efficient way of distributed data processing computation. In this session, I will cover our experience of implementing a large scale big data platform (> 100 TB) using Spark and challenges faced/lessons learned

Speaker bio

Over 17 years of IT industry experience in Data technologies
More than 3+ years of experience in Big Data
Extensive experience in Hadoop, Spark & NOSQL
Architected and delivered multiple end to end Big Data projects

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('You need to be a participant to comment.') }}

{{ formTitle }}
{{ gettext('Post a comment...') }}
{{ gettext('New comment') }}

{{ errorMsg }}