The Fifth Elephant 2012

Finding the elephant in the data.

What are your users doing on your website or in your store? How do you turn the piles of data your organization generates into actionable information? Where do you get complementary data to make yours more comprehensive? What tech, and what techniques?

The Fifth Elephant is a two day conference on big data.

Early Geek tickets are available from fifthelephant.doattend.com.

The proposal funnel below will enable you to submit a session and vote on proposed sessions. It is a good practice introduce yourself and share details about your work as well as the subject of your talk while proposing a session.

Each community member can vote for or against a talk. A vote from each member of the Editorial Panel is equivalent to two community votes. Both types of votes will be considered for final speaker selection.

It’s useful to keep a few guidelines in mind while submitting proposals:

  1. Describe how to use something that is available under a liberal open source license. Participants can use this without having to pay you anything.

  2. Tell a story of how you did something. If it involves commercial tools, please explain why they made sense.

  3. Buy a slot to pitch whatever commercial tool you are backing.

Speakers will get a free ticket to both days of the event. Proposers whose talks are not on the final schedule will be able to purchase tickets at the Early Geek price of Rs. 1800.

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more

Biswajit Pal

“I know what you are going to do next summer” – Predicting Repeat Purchase Behavior by using Bayesian Hierarchical model and Regression Techniques

Submitted Jun 29, 2012

A major question in database marketing is that of identifying the customers who are most likely to make a repeat purchase in the near future. We will discuss a scalable repeat purchase scoring algorithm that assign a propensity score i.e. probability to transact in next ‘n’ period of time for each customer. This algorithm presently scores an entire database containing around 100MM customers in couple of hours and thus has the potential to be used in a Big Data scenario.

Outline

In the context of targeted marketing to consumers, the ability to tell which customers are more likely than others to make a purchase with HP in the near future greatly enhances effectiveness of any marketing campaign. It helps to rank customers on their propensity to re-purchase, and leads to preferential treatment of the right customers. It also reduces the likelihood of bombarding customers, who are less likely to purchase, with marketing material (over email or postal mail), possibly alienating them from future interest in HP.

The propensity to make a repeat purchase depends on two parameters unique to each customer, the probability of churn and the frequency of transactions. The customer repeat purchase modeling framework we propose, based on a Regression based approximation to a Bayesian hierarchical model, answer these questions. Using the answer to these questions as inputs we predicted the likelihood of a customer making a transaction within a time span into the future (e.g. in the next six months).

Our algorithm can score massive databases for repeat purchase. Further, since it uses only transaction data, it is readily applicable to a wide array of customer segments across different business units

Speaker bio

Biswajit Pal currently holds the position of Project Manager in Customer Intelligence team which is a part of Global Analytics organization within HP. He has 6 years of experience in the areas of Predictive Analytics, Data mining, statistical modeling across verticals like Healthcare, retail and IT and holds a Post Graduate degree in Applied Statistics and Informatics from IIT Bombay.

Subhasish Mishra currently holds the position of Project Manager in strategy team which is a part of Global Analytics organization within HP. He has 5 years of experience in the areas of Predictive Analytics, Data mining, statistical modeling and holds a Post Graduate degree in Economics from Delhi School of Economics.

Manav Shroff is an analytics professional with 11 years of global experience across Technology, Financial Services, Banking and Insurance industries. He is currently leading a team of analytics practitioners supporting HP’s Corporate Customer Intelligence group, providing support in the areas of Customer Analytics (B2C and B2B), Web Analytics, and Market Research, Emerging Markets Strategy, Knowledge Management and Campaign Operations. Manav holds a MBA degree and also is a GE certified green belt and lean coach.

Comments

{{ gettext('Login to leave a comment') }}

{{ gettext('Post a comment…') }}
{{ gettext('New comment') }}
{{ formTitle }}

{{ errorMsg }}

{{ gettext('No comments posted yet') }}

Hosted by

The Fifth Elephant - known as one of the best data science and Machine Learning conference in Asia - has transitioned into a year-round forum for conversations about data and ML engineering; data science in production; data security and privacy practices. more