The Fifth Elephant 2019

Gathering of 1000+ practitioners from the data ecosystem

Using Apache Nifi to manage a real time master data foundation @ Nike

Submitted by Vidyasagar Reddy (@vsreddy07) on Friday, 14 June 2019

Session type: Short talk of 20 mins Status: Rejected


Nike has a wide variety of systems in the enterprise landscape. All these systems produce data in different shapes and sizes. We are building theNike data foundation so that we meet the below goals.

1.Deliver trusted, accurate, timely and consistent information and insights to the business.
2.Enable governing data as an asset and sharing data at scale.
3.Enable faster and better informed business decisions resulting in serving our customers and consumers more efficiently.

One key aspect in the enterprise data foundation is master data. It is very important that we maintain a single source of truth for master data to address issues around consistency, authenticity and governance. It is also important that updates to master data are visible to consumers as soon as possible.This talk is about Nike’s journey in building a real time master data management platform using Apache Nifi. We will talk about the various pros and cons of the technologies we considered and why we made certain decisions given the overall goals of the program.


  1. What is master data?
  2. What are the various types of master data?
  3. What is the existing situation without a unified platform for master data?
  4. What technologies were evaluated?
  5. Why we have chosen Apache Nifi?
  6. What is Nifi well suited for?
  7. How did we develop and integrate this with our CI/CD pipelines?

Speaker bio

Vidya is currently part of Nike, Enterprise Data and Analytics organisation, leading teams in the space of Enterprise master data management and data science engineering.



Preview video


  • Abhishek Balaji (@booleanbalaji) 11 months ago

    Hi Vidya,

    Thank you for submitting a proposal. We need to see detailed slides and a preview video to evaluate your proposal. Your slides must cover the following:

    • Problem statement/context, which the audience can relate to and understand. The problem statement has to be a problem (based on this context) that can be generalized for all.
    • What were the tools/frameworks available in the market to solve this problem? How did you evaluate these, and what metrics did you use for the evaluation? Why did you pick the option that you did?
    • Explain how the situation was before the solution you picked/built and how it changed after implementing the solution you picked and built? Show before-after scenario comparisons & metrics.
    • What compromises/trade-offs did you have to make in this process?
    • What is the one takeaway that you want participants to go back with at the end of this talk? What is it that participants should learn/be cautious about when solving similar problems?

    We need your updated slides and preview video by Jun 27, 2019 to evaluate your proposal. If we do not receive an update, we’d be moving your proposal for evaluation under a future event.

    • Vidyasagar Reddy (@vsreddy07) Proposer 11 months ago

      Sure. I will upload the details before June 27th.

    • Vidyasagar Reddy (@vsreddy07) Proposer 11 months ago

      @booleanbalaji: I have updated the details. Please do take a look.

      • Abhishek Balaji (@booleanbalaji) 11 months ago

        Thanks, moving this for evaluation

  • Vidyasagar Reddy (@vsreddy07) Proposer 10 months ago

    @booleanbalaji: When would we get to know if this talk has been accepted or not?

Login to leave a comment