The Fifth Elephant 2015
A conference on data, machine learning, and distributed and parallel computing
Jul 2015
13 Mon
14 Tue
15 Wed
16 Thu 08:30 AM – 06:35 PM IST
17 Fri 08:30 AM – 06:30 PM IST
18 Sat 09:00 AM – 06:30 PM IST
19 Sun
Rajesh Balamohan
Talk about the present and future of Apache Tez.
Apache Tez is a framework designed to build data-flow driven processing runtimes. Tez provides a scaffolding and library components that can be used to quickly build scalable and efficient data-flow centric engines. This talk will cover the journey of Tez from being a concept in the Apache Incubator to becoming the cornerstone of well-known projects such as Apache Hive and Apache Pig of the Hadoop ecosystem. I will then move on to the future of Tez on how it is improving to make it easier for data processing applications to be built to run in single-digit seconds and/or to scale to petabytes of data.
Rajesh Balamohan has been working on Hadoop for last couple of years and recently has been concentrating on Tez performance at scale.
{{ gettext('Login to leave a comment') }}
{{ gettext('Post a comment…') }}{{ errorMsg }}
{{ gettext('No comments posted yet') }}