The Fifth Elephant 2015

A conference on data, machine learning, and distributed and parallel computing

Rahul Kavale


Deprecating MapReduce Patterns with Apache Spark

Submitted May 7, 2015

Live coding demostration to show how Apache Spark can solve non trivial problems, and hence deprecating some of the established patterns of MapReduce, with consice code, giving us significant performance gain, and developer friendly programming model also keeping other sweetness of MapReduce wolrd intact.


Hadoop MapReduce programming model has evolved into having its own patterns for solving problems. This talk is trying to eloborate how those patterns can be replaced with simpler equivalent solutions with use of Apache Spark, and of course with keeping other sweetness of existing MapReduce world with advantage of performance boost. I will be using some some non trivial examples for the this with live coding for the same.

Speaker bio

Rahul is an Application developer with Thoughtworks. Rahul has worked with technologies ranging from Scala, Java, Ruby. He has experience in builiding web applications to solving big data problems. Rahul has special interest in Scala, Apache Spark. He loves functional programming.


{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

All about data science and machine learning