The Fifth Elephant 2018

The seventh edition of India's best data conference

Building analytics application with streaming expressions in Apache Solr

Submitted by Amrit Sarkar (@sarkaramrit2) on Wednesday, 28 March 2018

videocam
Preview video

Technical level

Intermediate

Section

Full talk

Status

Confirmed & Scheduled

View proposal in schedule

Vote on this proposal

Login to vote

Total votes:  +2

Abstract

Apache Solr, an open source search engine project, has come a long way since its inception driving applications to have near-real time data mixed with richrelevance available to users with high availability, auto-scaling and effective failover strategy on cloud infrastructure.

Effective real-time analysis and visualization of collected and correlated data to get insights is high need for businesses. Streaming Expressions introduced in Apache Solr v 6.0 provides powerful stream language for Solrcloud. They are a suite of functions that can be combined to perform many different parallel computing tasks like aggregations, parallel relational algebra, batch processing, distributed graph traversal and related MapReduce operations and use-cases.

In Lucidworks, San Francisco California-based enterprise search technology company, we solve complex problems and implement use cases in and around search and analytics paradigm for multiple clients on huge datasets. This session will focus on challenges faced in building near-real time analytics applications on large datasets. We introduce Streaming Expressions in Apache Solr, discuss the concept and key components it is build upon. The session moves on to discuss how Streaming Expressions not only fulfills the expectations, it open doors for numerous possibilities emitting effective, valuable and meaningful analytical data with its ever growing library of functions.

Outline

  • Challenges building analytics applications with real-time data
  • Introduction to Streaming Expressions and Overview
  • Sources, Decorators and Evaluators
  • Short solutions from simple to complex use-cases optimised
  • Statistical Programming with use-case
  • Conclusion

Speaker bio

Amrit Sarkar is Search Engineer and Consultant at Lucidworks Inc, California-based enterprise search technology company, with 3+ years experience in search domain and big data, ecommerce and product.
He is an active Apache Solr Contributor for over an year.
LinkedIn: https://www.linkedin.com/in/sarkaramrit2
Blog: https://www.medium.com/@sarkaramrit2

Slides

https://www.slideshare.net/AmritSarkar1/building-analytics-applications-with-streaming-expressions-in-apache-solr-107811813

Preview video

https://youtu.be/L3-Gj8FzR5I

Comments

  • 1
    Zainab Bawa (@zainabbawa) Reviewer 8 months ago

    Amrit, without a preview video, we will not evaluate your proposal.

    • 1
      Amrit Sarkar (@sarkaramrit2) Proposer 8 months ago (edited 8 months ago)

      Thank you Zainab for the reminder, uploaded.

Login with Twitter or Google to leave a comment