The Fifth Elephant 2016

India's most renowned data science conference

Anand Katti

@anandkatti

High performance computing using Spark

Submitted Apr 29, 2016

Spark has revolutionized the way Big data computation are done. It provides efficient way of distributed data processing computation. In this session, I will cover our experience of implementing a large scale big data platform (> 100 TB) using Spark and challenges faced/lessons learned

Outline

Spark has revolutionized the way Big data computation are done. It provides efficient way of distributed data processing computation. In this session, I will cover our experience of implementing a large scale big data platform (> 100 TB) using Spark and challenges faced/lessons learned

Speaker bio

Over 17 years of IT industry experience in Data technologies
More than 3+ years of experience in Big Data
Extensive experience in Hadoop, Spark & NOSQL
Architected and delivered multiple end to end Big Data projects

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

Jump starting better data engineering and AI futures