BEGIN:VCALENDAR VERSION:2.0 PRODID:-//HasGeek//NONSGML Funnel//EN DESCRIPTION:Finding the elephant in the data. NAME:The Fifth Elephant 2012 REFRESH-INTERVAL;VALUE=DURATION:PT12H SUMMARY:The Fifth Elephant 2012 TIMEZONE-ID:Asia/Kolkata X-PUBLISHED-TTL:PT12H X-WR-CALDESC:Finding the elephant in the data. X-WR-CALNAME:The Fifth Elephant 2012 X-WR-TIMEZONE:Asia/Kolkata BEGIN:VEVENT SUMMARY:Crunching big data\, Google scale DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T093000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T101500 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/B3ttxjQExvj97SYoKRRN3n@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190704T130219Z DESCRIPTION:Whether you are an e-commerce startup or a genomics research l ab or plan to run products at the scale of Gmail\, Youtube or Adwords\, yo u generate gigabytes of data every day if not every hour. Your storage req uirements run into tera or peta bytes and you may need thousands if not hu ndreds of thousands of CPU cores to process that data. At Google\, we hav e developed several in-house tools and techniques to be able to process da ta at scale. Recently we made several of these tools available externally. In this talk we will go over some of our learnings on big data\, and disc uss techniques with case studies for crunching big data. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190704T130219Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/crunching-big-data-goo gle-scale-B3ttxjQExvj97SYoKRRN3n BEGIN:VALARM ACTION:display DESCRIPTION:Crunching big data\, Google scale in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:In Data We Believe DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T101500 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T110000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/7eYQRvVzq8StTgDSAJ1Ji9@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Beginner,Discussion CREATED;VALUE=DATE-TIME:20190704T130241Z DESCRIPTION:- as tech matures\, the steady state has to be with data\n- da ta (owned or mined or both) is the basis of making decisions\n- the qualit y of decisions is highly influenced by the quality of\n data both in term s of accuracy and timeliness\n- the "big data" meme creates opportunities for algorithmic and\n hueristic research which can lead to products and s ervices that\n cannot be possible with any conventional methods\n- the on ly cost effective means to get to these abilities is if the\n underlying architecture of hardware and software are both standards\n based and open source.\n- the ability to derive simple insights from very large amounts of\n data (for example fractals) GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/in-data-we-believe-7eY QRvVzq8StTgDSAJ1Ji9 BEGIN:VALARM ACTION:display DESCRIPTION:In Data We Believe in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:A Herd of Elephants - Navigating the Hadoop Ecosystem DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T113000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T121500 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/A2yFqFwTG5QJWdihsmdGmm@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Tutorial CREATED;VALUE=DATE-TIME:20190704T130307Z DESCRIPTION:Hadoop has matured to point where it is not longer just one pr oject but a bunch of projects ranging from getting data onto the cluster t o processing and analyzing data to managing the cluster itself. I will be talking from my personal experiences from setting up a hadoop cluster at I nmobi that processes 10TB+ of Data per day (and growing). The several Hado op clusters in Inmobi are spread over multiple datacenters across continen ts.\n\nProjects that will be covered in some detail include Hadoop (HDFS a nd Mapreduce)\, Hive\, HBase\, Pig\, Mahout\, Scribe \, Zookeeper and Oozi e/Azkhaban. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/a-herd-of-elephants-na vigating-the-hadoop-ecosystem-A2yFqFwTG5QJWdihsmdGmm BEGIN:VALARM ACTION:display DESCRIPTION:A Herd of Elephants - Navigating the Hadoop Ecosystem in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:An Introduction to Data Visualization & Processing ( Programming L anguage) DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T113000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T121500 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/EigQEXvJ3tXfWdXT6WjvDo@hasgeek.com CATEGORIES:Data Visualization,Beginner,Tutorial CREATED;VALUE=DATE-TIME:20190704T130450Z DESCRIPTION:The talk is going to be split into two parts (of 15-20 mins ea ch). \n\nFirst part covers\, Data & Visualization. Where I would try to an swer ( with examples ) questions such as: Is Data just numbers? What is th e Purpose of a Visualization? Does context play a role in choosing the typ e of visualization?\n\nSecond part covers\, a basic introduction to a prog ramming language called Processing. Where\, the available methods\, librar ies and export options like Android\, Web ( Applet/Processing.js)\, etc ar e explained. I would like to conclude my talk with an example/case study o f a visualization that I have done using Processing. \n\nLast 5 odd mins i s retained for Q&A and discussion. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/an-introduction-to-dat a-visualization-processing-programming-language-EigQEXvJ3tXfWdXT6WjvDo BEGIN:VALARM ACTION:display DESCRIPTION:An Introduction to Data Visualization & Processing ( Programmi ng Language) in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Visualising Text DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T121500 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T130000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/Hj9Co5h57kRu57zrdRkhW7@hasgeek.com CATEGORIES:Data Visualization,Intermediate,Tutorial CREATED;VALUE=DATE-TIME:20190704T130620Z DESCRIPTION:There are a surprisingly large number of techniques for visual ising text have emerged in the last five years. Streamgraphs\, document ar cs\, word spectrums\, the ever popular word cloud\, etc.\n\nCombined with traditional analytical techniques like sentiment analysis\, statistical im probability\, stemming\, n-gram Markov chains\, etc\, we now have powerful ways of summarising and extracting meaning from text.\n\nThis session wil l walk through examples of popular text visualisations in an easy-to-under stand way\, online tools you can use right away\, and provide you a starti ng point to build your own applications to visualise text. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/visualising-text-Hj9Co 5h57kRu57zrdRkhW7 BEGIN:VALARM ACTION:display DESCRIPTION:Visualising Text in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Big Data and the Cloud a Best Friend Story DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T121500 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T130000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/KpGRcM9Z91seFp3nDWA87M@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190704T130541Z DESCRIPTION:I will show how cloud infrastructure coupled with open source software is enabling the collection\, sharing and processing of big data. I will have specific case studies on organisation that are now enabled to collect and use this information\, how they are using it and what problem s they are solving. As an evangelist for Amazon Web Services\, I am able to work across many organisation using the cloud within big data and am on the front lines of this massive disruption. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/big-data-and-the-cloud -a-best-friend-story-KpGRcM9Z91seFp3nDWA87M BEGIN:VALARM ACTION:display DESCRIPTION:Big Data and the Cloud a Best Friend Story in Audi 1 in 5 minu tes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Simplifying advanced analytics to empower business managers DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T121500 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T130000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/ASSGEKhNiMSRwEs9CqakZS@hasgeek.com CATEGORIES:Industry & Business,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190704T130609Z DESCRIPTION:Advanced analytics can significantly help business managers ma ke more informed decisions by uncovering crucial insights\, that would oth erwise remain hidden. While benefits of analytics are agreed by senior man agement of most enterprises\, the adoption continues to be limited in most enterprises. As Big Data grows in importance\, low adoption rate of analy tics will evolve as an “Achilles Heel” . \n\nThis session will showcas e how to effectively combine analytics and business by embedding data mini ng and machine learning techniques in a business application with right bl end of business context\, simplified user interface and new age IT infrast ructure. This can significantly simplify sophisticated analytics to make i t more accessible to business managers in their day to day decision making process. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190704T130609Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/simplifying-advanced-a nalytics-to-empower-business-managers-ASSGEKhNiMSRwEs9CqakZS BEGIN:VALARM ACTION:display DESCRIPTION:Simplifying advanced analytics to empower business managers i n Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Explorations in Data Analysis and Visualization DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T140000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T143000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/3ji4hCkUZSc4X4unWkXXcC@hasgeek.com CATEGORIES:Data Visualization,Beginner,Discussion CREATED;VALUE=DATE-TIME:20190705T035308Z DESCRIPTION:An exploration of data analysis and visualization. What were o ur goals and how we built a simple tool\, step by step. We will take you t hrough a journey of what we tried and what ended up with. We can discuss h ow you can build your own and some ideas for the future. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190705T035343Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/explorations-in-data-a nalysis-and-visualization-3ji4hCkUZSc4X4unWkXXcC BEGIN:VALARM ACTION:display DESCRIPTION:Explorations in Data Analysis and Visualization in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Looking beyond the usual suspects for Near Real Time analytics on Big Data DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T140000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T143000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/B6C3N8UYnacq7KnBhmLiRE@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T035139Z DESCRIPTION:Talk will outline the big data use cases for PayPal and provid e learnings for POCs on 1 use case involving processing 20 TB of data and 250 billion events per day in near real time GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/looking-beyond-the-usu al-suspects-for-near-real-time-analytics-on-big-data-B6C3N8UYnacq7KnBhmLiR E BEGIN:VALARM ACTION:display DESCRIPTION:Looking beyond the usual suspects for Near Real Time analytics on Big Data in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Introduction to data analysis & visualisation with R DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T140000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T143000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/2TMHJktTgbLnp7fjEwmy4h@hasgeek.com CATEGORIES:Data Analytics,Beginner,Tutorial CREATED;VALUE=DATE-TIME:20190705T035201Z DESCRIPTION:R is an open source software environment for statistical compu ting & graphics. R can be used for data manipulation\, data analysis\, sta tistical modelling and data visualization.\n\nR is used in industry for wi de ranging application from drug research studies\, advertizing campaign e ffectiveness\, building trading strategies in financial markets to mining twitter for sentiment analysis.\n\n GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/introduction-to-data-a nalysis-visualisation-with-r-2TMHJktTgbLnp7fjEwmy4h BEGIN:VALARM ACTION:display DESCRIPTION:Introduction to data analysis & visualisation with R in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:The science behind Predictive Analytics- a Text Mining Perspective DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T143000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T150000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/7VmnZWk4Fcivv7WmSaNLH9@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T035424Z DESCRIPTION:The abundance of structured and unstructured interaction data (web journey metrics\, chats\, etc.)\, motivates us to leverage the power of this data. I'll concentrate on the algorithms which are being deployed now-a-days to dig out inferences from customer text data. I will illustra te with some use cases. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/the-science-behind-pre dictive-analytics-a-text-mining-perspective-7VmnZWk4Fcivv7WmSaNLH9 BEGIN:VALARM ACTION:display DESCRIPTION:The science behind Predictive Analytics- a Text Mining Perspec tive in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Lessons from the evolution of Mayavi DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T143000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T150000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/SfYyuTj629oBgYaG5cFhEU@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T035540Z DESCRIPTION:Mayavi is an open source 3D visualization package that provide s multiple ways\nfor a scientist/engineer to visualize their data in 3D. I t is implemented in\nPython and is designed to be used from both Python's interactive interpreter\nand as a standalone application. It boasts of ma ny interesting features\, which\ninclude an easy to use scripting interfac e\, embeddability into custom\ndialogs/applications\, support for multiple UI tookits\, off-screen rendering\nand automatic recording of UI actions to readable Python scripts. These have\nresulted in Mayavi being used in a very wide variety of applications all over\nthe world in academia and co mmercial organizations.\n\nThe first version of Mayavi was released in 200 1 and was based on a previous\nattempt to produce a visualization tool for colleagues in the author's lab\nwhen he was a PhD student. Subsequently in 2005\, Enthought funded the author\nto work on a much more powerful rew rite of Mayavi.\n\nIn this talk we'll look at the historical development o f Mayavi\, both in terms\nof the technical implementation and features\, a s well as the motivation for the\nevolution. While many of the software e ngineering lessons learned from this\nproject are not new\, the perspectiv e of building the package for the\nparticular set of users is interesting and illuminating.\n GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/lessons-from-the-evolu tion-of-mayavi-SfYyuTj629oBgYaG5cFhEU BEGIN:VALARM ACTION:display DESCRIPTION:Lessons from the evolution of Mayavi in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Build your own Real Time Analytics and Visualization\, Enable Comp lex Event Processing\, Event Patterns and Aggregates DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T143000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T150000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/UJ7n7jYjDWGoGQrKtiN4re@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Demo CREATED;VALUE=DATE-TIME:20190705T035339Z DESCRIPTION:Traditionally people use databases or data-warehouses or even Hadoop and Map Reduce to perform batch analytics. However real time analyt ics\, real time visualization and complex event processing require drastic ally different architecture\, storage and paradigms.\n\nIn this talk we wi ll share a model for accomplishing the above using:\n \nElastic Search: To push large data sets into sharded lucene index. This index can be in m emory\, partially in memory and partially on disk etc. This is key to real time data.\n\nstatsd: Aggregator (provides aggregates on the basic events ). This is key to continuous ETL and continuous analytics.\n\nReverse Pump : The reverse pump\, pushes back aggregate information into Elastic Search \n\nPattern Recognition: This is similar to looking for regular expression s in Perl. We look for patterns in the event stream and match patterns. No te the sliding time window over which is applies is finite and limited by memory as are the number of pattern matchers. This system uses a reverse i ndex and RETE techniques.\n\nGraphite: Basic Visualization (includes simpl e realtime visualization)\n\nNotification: These will be triggered on matc hes of static rules or complex pattern match on base event/aggregate event s or patterns matched\n\nExtras: \nMySQL binlog parser to observe and ana lyze events (BinLogParser) \nDepict the mood of Mysql with music. Make it sing! GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/build-your-own-real-ti me-analytics-and-visualization-enable-complex-event-processing-event-patte rns-and-aggregates-UJ7n7jYjDWGoGQrKtiN4re BEGIN:VALARM ACTION:display DESCRIPTION:Build your own Real Time Analytics and Visualization\, Enable Complex Event Processing\, Event Patterns and Aggregates in Audi 1 in 5 mi nutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Seeing through the Haze - Advanced Visualisations using Python DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T150000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T153000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/EDAE4CaoZfYaWYsdxEWBqj@hasgeek.com CATEGORIES:Data Visualization,Advanced,Demo CREATED;VALUE=DATE-TIME:20190705T035721Z DESCRIPTION:This Tutorial/Demo will go indepth into how to use Python libr aries to represent\, analyze and visualize multidimensional data. Some of the advanced visualizations that will be covered at heatmaps\, treemaps\, streamgraphs\, Hinton diagrams and more. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/seeing-through-the-haz e-advanced-visualisations-using-python-EDAE4CaoZfYaWYsdxEWBqj BEGIN:VALARM ACTION:display DESCRIPTION:Seeing through the Haze - Advanced Visualisations using Python in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY: “I know what you are going to do next summer” – Predicting Repeat Purchase Behavior by using Bayesian Hierarchical model and Regressi on Techniques DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T150000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T153000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/DhqgVbkzqBzWfbv34ebhh8@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T035708Z DESCRIPTION:In the context of targeted marketing to consumers\, the abilit y to tell which customers are more likely than others to make a purchase w ith HP in the near future greatly enhances effectiveness of any marketing campaign. It helps to rank customers on their propensity to re-purchase\, and leads to preferential treatment of the right customers. It also reduce s the likelihood of bombarding customers\, who are less likely to purchase \, with marketing material (over email or postal mail)\, possibly alienati ng them from future interest in HP. \n\nThe propensity to make a repeat pu rchase depends on two parameters unique to each customer\, the probability of churn and the frequency of transactions. The customer repeat purchase modeling framework we propose\, based on a Regression based approximation to a Bayesian hierarchical model\, answer these questions. Using the answ er to these questions as inputs we predicted the likelihood of a customer making a transaction within a time span into the future (e.g. in the next six months). \n\nOur algorithm can score massive databases for repeat purc hase. Further\, since it uses only transaction data\, it is readily applic able to a wide array of customer segments across different business units \n GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/i-know-what-you-are-go ing-to-do-next-summer-predicting-repeat-purchase-behavior-by-using-bayesia n-hierarchical-model-and-regression-techniques-DhqgVbkzqBzWfbv34ebhh8 BEGIN:VALARM ACTION:display DESCRIPTION: “I know what you are going to do next summer” – Predict ing Repeat Purchase Behavior by using Bayesian Hierarchical model and Regr ession Techniques in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Splunk> Driving Real-time Analytics from Small Data\, Big Data and Huge Data DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T150000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T153000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/UgJTXBsa5qf5qT3Adu7xmD@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T035643Z DESCRIPTION:Data is everywhere\, but few organizations are deriving the fu ll value from their data. Hidden inside the websites\, application servers \, networks and mobile devices that power the enterprise is critical infor mation about website traffic\, customer behavior\, IT systems performance\ , cyber-security threats and more. How do you keep up with the velocity an d variety of data streaming in and get actionable insights on it\, all in real-time? What about correlating big data with data from social platforms and relational databases? In this talk\, we'll examine real-world example s from leading online\, retail\, telecoms and financial services organiza tions that offer a blueprint for integrating big data technologies\, deliv ering new visibility and insights to IT and the business users and acceler ating the adoption of big data in the enterprise.\n GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190705T035647Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/splunk-driving-real-ti me-analytics-from-small-data-big-data-and-huge-data-UgJTXBsa5qf5qT3Adu7xmD BEGIN:VALARM ACTION:display DESCRIPTION:Splunk> Driving Real-time Analytics from Small Data\, Big Data and Huge Data in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Exploratory Data Analysis with Python DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T160000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T163000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/UqsQXsSg7vPTaEzxooZo44@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Tutorial CREATED;VALUE=DATE-TIME:20190705T035837Z DESCRIPTION:So we have a large data file. We might not know what to do wit h it. We most probably are looking for patterns and trends. With a multitu de of data analysis tools and algorithms at our disposal\, we are often le ft wondering as to what's the right thing to ask of the data.\n\nExplorato ry data analysis is a field which offers tools and algorithms for the most broad\, general look at a piece of data. It is after performing this sort of a global analysis on the data that we can go ahead and think about bui lding a model to describe the data. This tutorial offers insights into the prerequisites for building such models\, and having gained those\, what a ll one could do with the model.\n\nThe tutorial will seek to answer questi ons like:\n - What's the best way to cluster / classify a given dataset?\n - What does the data 'look' like?\n - How has the dataset evolved over ti me?\n - How do I know that I have inferred all I can from the dataset?\n - I see some peculiar trends in the dataset. What might have caused these?\ n - Do all these questions motivate a good machine learning problem? GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190705T035840Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/exploratory-data-analy sis-with-python-UqsQXsSg7vPTaEzxooZo44 BEGIN:VALARM ACTION:display DESCRIPTION:Exploratory Data Analysis with Python in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Building analyzing and visualizing large graphs DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T160000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T163000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/8mNEh84apKkxP66LAP2au9@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T035854Z DESCRIPTION:A lot of real world problems can be solved better using graphs . In my talk I cover the basics of solving daily problems using graphs\, m odeling and building larger graphs on existing back-end systems (in- memo ry\,relational\,NoSQL). The talk will also cover some simple algorithms an d tools to analyze and visualize such graphs.\n\nThe lecture will be geare d towards programmers\n\nBrief overview:\n\n* Spotting graphs in your data .\n* Graph models -mostly theory\, I'll keep it short.\n* How to model\, a nd program large graphs(millions of nodes and edges).\n* Persisting such g raphs on existing storage architectures (when switching to a graphDB is n ot an option or an overkill).\n* Visualizing huge graphs -what tools exist and how to build your own\, some algorithms for effective visualization. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/building-analyzing-and -visualizing-large-graphs-8mNEh84apKkxP66LAP2au9 BEGIN:VALARM ACTION:display DESCRIPTION:Building analyzing and visualizing large graphs in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:The Elephant in the Cloud DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T160000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T163000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/TvuVnTkhubsrfofbMnQsKw@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T035757Z DESCRIPTION:Hadoop and frameworks on top of it like Hive are a popular app lication running in the Cloud. The Cloud architecture though is significan tly different - in terms of it's elasticity\, it's latency characteristics and it's pricing models than a regular data center. It can also be daunti ng to a lay user to understand and setup. In this talk we will describe ho w Qubole Data Service has adapted Hadoop and Hive to uniquely fit and expl oit the Cloud architecture and make big data processing easy and accessibl e to all. The agenda will be roughly as follows:\n\n1. Start by covering s ome key characteristics of the Cloud.\n2. Describe the current state of ar t of running Big Data stack in the Cloud and the problems and opportunitie s for improvement in the above.\n3. Describe Qubole Architecture and how w e have attempted to tackle some of these problems.\n4. Demonstrate some of the usability enhancements and go over some performance comparisons. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/the-elephant-in-the-cl oud-TvuVnTkhubsrfofbMnQsKw BEGIN:VALARM ACTION:display DESCRIPTION:The Elephant in the Cloud in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:GlusterFS "Big Data" Interface DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T163000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T170000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/3QJuAv4N1fXe8sGFcurQdj@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Demo CREATED;VALUE=DATE-TIME:20190705T040114Z DESCRIPTION:GlusterFS is an open source\, distributed file system capable of scaling to several petabytes and handling thousands of clients. Gluster FS clusters together storage building blocks over Infiniband RDMA or TCP/I P interconnect.\n\nGlusterFS can also be used as a replacement for HDFS an d to run Map/Reduce jobs on data residing on it. GlusterFS Hadoop plugin a llows exisitng Map/Reduce jobs to seamlessly work without any changes. Thi s is done by using Hadoop's FileSystem interface and communicating to Glus terFS via it's native protocol (using FUSE). GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/glusterfs-big-data-int erface-3QJuAv4N1fXe8sGFcurQdj BEGIN:VALARM ACTION:display DESCRIPTION:GlusterFS "Big Data" Interface in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Approaches to ML techniques on the Real world data. DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T163000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T170000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/B2c6wYt9ry6fYWWr151nNn@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Demo CREATED;VALUE=DATE-TIME:20190705T040015Z DESCRIPTION:List out possibilities of applying various Machine Learning Te chniques to Real World Data. It focuses on what (are available ?)\, why (t o use one or some of them ?) and how (to apply for the problem in hand ?)\ n\nA demo on one/more Real world use cases. This can help an enthusiast to start on .... without digging noise!\n\nA useful scenario would be to lea rn behavioural patterns in social networks. Say\, an algorithm that you ca n train to judge your friends. So you can get some insight in your intutiv e behaviour on the web \;)\nYou can apply these rules may be in training a larger set of your own activities. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/approaches-to-ml-techn iques-on-the-real-world-data-B2c6wYt9ry6fYWWr151nNn BEGIN:VALARM ACTION:display DESCRIPTION:Approaches to ML techniques on the Real world data. in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:How the Internet Archive preserves petabytes of data DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T163000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T170000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/B6W332WukPUPjAxQeUVTDr@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040001Z DESCRIPTION:The [Internet Archive][ia] has been [archiving the internet][w ayback] since 1996. It also archives and makes available a vast collection of data including films\, audio and books.\n\nThe [Internet Archive][ia] is one of the earliest organizations to work with petabytes of data. It bu ilt its own infrastructure to store\, process and manage its data reliably \, much before the cloud. Being an archive\, preservation of data is the p rimary concern and it affects engineering decisions. \n\nThis talk is an i ntroduction to the Internet Archive and its infrastructure. \n\n[ia]: htt p://archive.org/\n[wayback]: http://web.archive.org/ GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/how-the-internet-archi ve-preserves-petabytes-of-data-B6W332WukPUPjAxQeUVTDr BEGIN:VALARM ACTION:display DESCRIPTION:How the Internet Archive preserves petabytes of data in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Building the Infrastructure to Handle Big Data DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T170000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T173000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/YWSpFMx7NAkVvC9FYTEZVC@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040200Z DESCRIPTION:Description : \nI’ll be sharing our experience at PromptClou d where we deal with terabytes of data daily. This talk will cover- how to build a multi-purpose technology stack to manage large scale data and the numerous processes that operate on it. We’ll cover following topics in various depths depending upon audience inclination:\n\n1. Automated instal ls for clusters - (Vagrant\, Chef)\n2. Job scheduling (Resque/Celery)\n3. Process and cluster monitoring( God\,Monit\, Sensu)\, peripheral mention o f Nagios\n4. Queue - the perennial suspect( RabbitMQ)\n5. Data storage - H Base\, MongoDB\, Voldemort\, Riak\n6. Dashboard - Sensu\n\nIf there is tim e we can take up code integration and review (git\,gerrit\,jenkins) which are not directly needed but are crucial if the team size dealing with the above mentioned infrastructure increases. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/building-the-infrastru cture-to-handle-big-data-YWSpFMx7NAkVvC9FYTEZVC BEGIN:VALARM ACTION:display DESCRIPTION:Building the Infrastructure to Handle Big Data in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Finding signal in the noise: How to put big data to use DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T170000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T173000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/8y38HV1pNEK1cKmVpjURKX@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040142Z DESCRIPTION:Data scientists use a variety of different techniques nowadays to analyze large-scale data and derive valuable insights from them. I wil l cover some of these techniques from machine learning\, NLP and statistic al modeling and how it has been used to uncover insights hidden in the dat a. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/finding-signal-in-the- noise-how-to-put-big-data-to-use-8y38HV1pNEK1cKmVpjURKX BEGIN:VALARM ACTION:display DESCRIPTION:Finding signal in the noise: How to put big data to use in Aud i 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Build Your Own Search Engine DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T170000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120727T173000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/DUnqsagCb87bP8nqAn5Hxm@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040128Z DESCRIPTION:I plan to cover:\n\n1. The basic idea of a Search Engine\n— 1.1 How is it different from a Database?\n— 1.2 Inverted Index\n\n2. Te chniques for constructing an Inverted Index\n— 2.1 Distributed Indexing \n— 2.2 Dynamic Indexing\n— 2.3 Index Compression\n\n3. Scoring and Ranking\n\n4. Scaling Search GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/build-your-own-search- engine-DUnqsagCb87bP8nqAn5Hxm BEGIN:VALARM ACTION:display DESCRIPTION:Build Your Own Search Engine in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Your Genome on the Cloud: Big Data Challenges in Personalized Medi cine DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T093000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T101500 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/4wB8fgwuoBa6VoVuJEP2ej@hasgeek.com CATEGORIES:Data Analytics,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040228Z DESCRIPTION:The session will have two parts.\n\nThe first part will descri be the evolution of genomic measurement over the last two decades\, survey the current state\, describe how ever reducing costs and increased unders tanding are leading to significant impact on disease diagnosis and cure\, discuss how the world will look in the next 5 years when large numbers of people have their genomes sequenced. \n\nThis will lead to the second part where we will describe the Big Data techniques and challenges in handling large volumes of genomic data: what computations need to be run\, what qu eries need to be handled\, how data needs to flow from site of generation to site of consumption etc. These will include clever ways for text index ing\, fast string matching algorithms\, use of special hardware paradigms (SIMD/GPUs)\, Hadoop based pipelines to large volume processing\, visuali zation methods etc.\n\n\n GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/your-genome-on-the-clo ud-big-data-challenges-in-personalized-medicine-4wB8fgwuoBa6VoVuJEP2ej BEGIN:VALARM ACTION:display DESCRIPTION:Your Genome on the Cloud: Big Data Challenges in Personalized Medicine in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Building Watson -- A Brief Overview of DeepQA and the Jeopardy! Ch allenge DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T101500 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T110000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/6ZCnuLCG9onwEnzYXTF5zY@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040243Z DESCRIPTION:A computer system that can directly and precisely answer natur al\nlanguage questions over an open and broad range of knowledge has been envisioned by scientists and writers since the advent of computers themsel ves. While current computers can store and deliver a wealth of digital con tent created by humans\, they are unable to operate over it in human terms . The quest for building a computer system that can do open-domain Questio n Answering is ultimately driven by a broader vision that sees computers o perating more effectively in human terms rather than strictly computer ter ms. They should function in ways that understand complex information requi rements\, as people would express them\, for example\, in natural language questions or interactive dialogs. Computers should deliver precise\, mean ingful responses\, and synthesize\, integrate\, and rapidly reason over th e breadth of human knowledge as it is most rapidly and naturally produced -- in natural language text.\n\nThe DeepQA project at IBM shapes a grand c hallenge in Computer Science that aims to illustrate how the wide and grow ing accessibility of natural language content and the integration and adva ncement of\nNatural Language Processing\, Information Retrieval\, Machine Learning\,\nKnowledge Representation and Reasoning\, and massively paralle l\ncomputation can drive open-domain automatic Question Answering\ntechnol ogy to a point where it clearly and consistently rivals the\nbest human pe rformance. A first stop along the way was the Jeopardy!\nChallenge\, where a computer system beat human grand champions in the game of Jeopardy!. In this talk\, we will give an overview of the\nDeepQA project and the Jeopa rdy! Challenge. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/building-watson-a-brie f-overview-of-deepqa-and-the-jeopardy-challenge-6ZCnuLCG9onwEnzYXTF5zY BEGIN:VALARM ACTION:display DESCRIPTION:Building Watson -- A Brief Overview of DeepQA and the Jeopardy ! Challenge in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:RTI & Data: Opportunities\, Issues & Challenges DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T113000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T121500 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/6wCMGpUTe5ppBiTw89CSba@hasgeek.com CATEGORIES:Industry & Business,Beginner,Discussion CREATED;VALUE=DATE-TIME:20190705T040315Z DESCRIPTION:To explain:\n- What kind of data can you get from the governme nt via RTI\n- What this data can be used for\n- How to get government data using RTI\n- Issues faced when asking government organizations for RTI. E xamples and case studies\n- Issues with Data received via RTI\n- To discus s ideas for making government data public GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/rti-data-opportunities -issues-challenges-6wCMGpUTe5ppBiTw89CSba BEGIN:VALARM ACTION:display DESCRIPTION:RTI & Data: Opportunities\, Issues & Challenges in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Exponential Growth Models and Impact on Sales Forecast\,Data Volum e\, Query Latency\, Capacity Planning and Search Latency DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T113000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T121500 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/6MLFTzc5dYQzMyMdQvPbw9@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040256Z DESCRIPTION:-Exponential Growth markets\n---- Analyze mathematically the m arket spaces \n Word of mouth Facebook\, Foursquare\, Twitter\, Fli pkart\n Advertising driven growth and mathematical model \n\n---- M odel Fertility in other domains\n - Radioactive decay\n - Revolution\, Love\, Infatuation\n - Mosquito/Allergens distribu tion with height\n\n----- Basic Demand Forecasting\n Base demand\ n Trends on demand\n Seasonality on demand\n Peak Demand Modelling (Exponential + Poisson)\n\n---- Impact to OLTP\n Web Scaling\n Scaling Message Systems - traditional databases ->C ustom solutions\n Caching \n Large DB scaling (compressio n\, indexing\, archiving\, sharding and federated query)\n\n---- Impact to OLAP \n Hadoop and Scaling operations\n Recommendation systems and the impact from time series\n\nWhen does exponential growth e nd? \n Epidemic models applied here\n How to prevent exponent ial growth of a competitor (vaccine models in disease spread)\n\nQ&A GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/exponential-growth-mod els-and-impact-on-sales-forecast-data-volume-query-latency-capacity-planni ng-and-search-latency-6MLFTzc5dYQzMyMdQvPbw9 BEGIN:VALARM ACTION:display DESCRIPTION:Exponential Growth Models and Impact on Sales Forecast\,Data V olume\, Query Latency\, Capacity Planning and Search Latency in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Data Explosion in Medical Imaging DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T113000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T121500 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/MZ67WPxhyHst2AzWDzHcpf@hasgeek.com CATEGORIES:Industry & Business,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040306Z DESCRIPTION:This is a journey through a marshy problem land. \n \nWe take a look at the basics of medical imaging\, the structure and size of the da ta generated and explore the challenges of large data archival\, compressi on techniques\, and image transfer across low-bandwidth network.\n \nFinal ly\, we wrap up with possible directions of research where independent res earchers can make progress\, necessary open source toolkits needed and whe re to get the relevant datasets from.\n \n(While I draw upon my experience s from my career\, the talk is fairly vendor-neutral and does not represen t my company.)\n GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/data-explosion-in-medi cal-imaging-MZ67WPxhyHst2AzWDzHcpf BEGIN:VALARM ACTION:display DESCRIPTION:Data Explosion in Medical Imaging in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:The Elephant that Flew - Big Data Analytics @ InMobi DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T121500 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T130000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/N7PH7qyDjZUdAsC8wfLXui@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040407Z DESCRIPTION:Data processing\, analysis and visualization of data at a scal e at which web systems (like InMobi) operate\, is a very hard problem. I'l l discuss the evolution of the data analytics at InMobi - the approaches t hat we tried\, the challenges faced\, and our rationale to develop an in-h ouse analytics system on top of Hadoop. I will be discussing the details o f this system (moderate level)\, how it is helping us attain better effici ency levels and what future directions could it take. I will also be doing a demo of the capabilities of this system towards the end of the talk. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/the-elephant-that-flew -big-data-analytics-inmobi-N7PH7qyDjZUdAsC8wfLXui BEGIN:VALARM ACTION:display DESCRIPTION:The Elephant that Flew - Big Data Analytics @ InMobi in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Open data and the life sciences: An open world DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T121500 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T130000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/LKQRZdo9546gRs3DaHV6e3@hasgeek.com CATEGORIES:Industry & Business,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040419Z DESCRIPTION:Introduction to the semantic web. Dealing with data deluge an d disparate data in the Life Sciences. What are the industry drivers? How can this data be mined to generate new hypothesis and forge new paths to better health care? Will end with a brief description of DIstilBio and specific use cases in pharma and health care. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/open-data-and-the-life -sciences-an-open-world-LKQRZdo9546gRs3DaHV6e3 BEGIN:VALARM ACTION:display DESCRIPTION:Open data and the life sciences: An open world in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Aadhaar - world's largest biometric identity platform (200 trillio n biometric matches per day\, 2 PB of data) DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T121500 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T130000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/GJui9sMyE1ykmjUgjyfr6A@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040426Z DESCRIPTION:Aadhaar has unique compute and data challenges that exhibit al l characteristics of Big Data - Volume\, Variety and Velocity. The challen ge is to derive Value from these attributes.\n---------------------------- --------------------------------------------------\nA number of technologi es have been used to handle massive parallel processing\, streaming data r eads\, data locality computing\, low latency reads\, data integrity and ch allenges of dealing with distributed data - best explained by the CAP theo rem.\n---------------------------------------------------------------\nBig Data solutions are used in building both OLTP and Analytics systems in Aa dhaar.\n-----------------------------------------------------------------\ n# Get to know how the following technologies are used:\n* Hadoop stack : HDFS\, HBase\, Hive\, Pig\, Zookeeper\n* MySQL : sharded\, partitioned\, d istributed\n* SEDA : Mule\, RabbitMQ\n* Search : MongoDB\, sharded Solr\n* Compute Grid : Spring\, GridGain\n* Monitoring : Custom built\, Nagios\n* Analytics & Visualization\n* Deployment footprint : Thousands of CPU core s\n* Extensive Data archival\, DR GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/aadhaar-worlds-largest -biometric-identity-platform-200-trillion-biometric-matches-per-day-2-pb-o f-data-GJui9sMyE1ykmjUgjyfr6A BEGIN:VALARM ACTION:display DESCRIPTION:Aadhaar - world's largest biometric identity platform (200 tri llion biometric matches per day\, 2 PB of data) in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Messaging architecture at Facebook DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T140000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T143000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/HHfVWXahwoumw4hePyaxcT@hasgeek.com CATEGORIES:Industry & Business,Advanced,Lecture CREATED;VALUE=DATE-TIME:20190705T040442Z DESCRIPTION:Building a scalable messaging backend for a Billion user is Ha rd. This talk covers the initial design days when the Facebook messaging a rchitecture was being decided.\n\nShould back-end stores be eventually or strongly consistent? How much availability is highly available? Should we build services or components? How big does a piece of software have to be before it stops being a component. How can a back-end be built for an appl ication that's yet to be designed? Is it possible to simulate a billion us er workload - and does it help?\n\nThe issues faced by the design team ran ged from the technical to the philosophical - to ultimately the practical. This talk will explore some of these areas. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/messaging-architecture -at-facebook-HHfVWXahwoumw4hePyaxcT BEGIN:VALARM ACTION:display DESCRIPTION:Messaging architecture at Facebook in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:the aRt of NSSO data DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T140000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T143000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/JmqairG6JMnqJfqeXGcbkS@hasgeek.com CATEGORIES:Industry & Business,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040512Z DESCRIPTION:First part of the session demystifies the organisation of raw data published by NSSO. We discuss the different rounds and schedules of data collection by NSSO\, and the specificities of the published datasets. \n\nIn the second part\, we discuss the process of extracting the raw data into formats that can be statistically manipulated and the challenges the rein.\n\nWe conclude by briefly noting few R libraries that can be used fo r exploratory visualisation of the extracted data. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/the-art-of-nsso-data-J mqairG6JMnqJfqeXGcbkS BEGIN:VALARM ACTION:display DESCRIPTION:the aRt of NSSO data in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Analytics Platform - Experience and evolution DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T140000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T143000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/FqK3pj8dP65rWxsYLBT1td@hasgeek.com CATEGORIES:Industry & Business,Beginner,Discussion CREATED;VALUE=DATE-TIME:20190705T040500Z DESCRIPTION:Analytics is an involving ask\; the challenges are multi-fold. On one side\, the technology required to handle huge data volumes\, & on the other side\, needs of the customer & domain which constantly evolve. A dd to that\, the demands of instantaneous data availability. In the course of developing an analytics solution for media consumption & then extendin g it to other domains (ex. security)\, we learnt a lot & this talk is all about sharing the same. \n\nHere is how we will go about that talk -\n\n* An introduction to Akamai Media Analytics\, the product & the domain\n\n* The technology infrastructure that runs it for us - at an architectural le vel. The nuances of delayed reporting & near real time reporting\n\n* The data & reporting flexibility that empowers our customers\; how the platfor m makes it happen\n\n* Q&A\n GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/analytics-platform-exp erience-and-evolution-FqK3pj8dP65rWxsYLBT1td BEGIN:VALARM ACTION:display DESCRIPTION:Analytics Platform - Experience and evolution in Audi 2 in 5 m inutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Real Time Intelligent Systems & Big Data Streams DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T143000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T150000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/8nk28qVPBh9dr4gppW8UKb@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040525Z DESCRIPTION:Moving to real time\, “anticipation denotes intelligence”. There is and will be vast potential for improved performance using appli ed predictive analytics\, real time in many domains. The oncoming onslaugh t of sensor data\, beyond the firewall generated content\, all injected in to the enterprise. For the advanced analytics and data mining community\, what differentiates is the consumption of analytics. Use off the shelf com ponents to stitch a system together\, bolt on some machine learning algori thms\, and optimize. Join us for a lively discussion of a few applied exa mples. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190705T040548Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/real-time-intelligent- systems-big-data-streams-8nk28qVPBh9dr4gppW8UKb BEGIN:VALARM ACTION:display DESCRIPTION:Real Time Intelligent Systems & Big Data Streams in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:How your favorite retailers make money from analytics? DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T143000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T150000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/GUbqgiqvnQbfZpM2Bi5PCe@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040544Z DESCRIPTION:This lecture would be covered with various statistical techniq ues which can reduce the marketing costs of the retailer & understand the customer in detail. It also covers the best practices used in the industry to get quick returns over the investment in retail especially. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/how-your-favorite-reta ilers-make-money-from-analytics-GUbqgiqvnQbfZpM2Bi5PCe BEGIN:VALARM ACTION:display DESCRIPTION:How your favorite retailers make money from analytics? in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:The Data Journalism Handbook DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T143000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T150000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/5wbDpxRuKK3PevxdqVbJXk@hasgeek.com CATEGORIES:Industry & Business,Beginner,Discussion CREATED;VALUE=DATE-TIME:20190705T040536Z DESCRIPTION:We live in an information age\, where journalists previously h ad to work like hunter-gathers\, they are now drowning in data. The Data J ournalism Handbook aims to help them learn to sail the data seas\, targeti ng questions such as 'where can I find data?'\, 'how do I work with it?' a nd 'how do I present it to the public?'. \n\nThis session will be a discus sion of data journalism and how to get started and also of the data journa lism scene in India. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/the-data-journalism-ha ndbook-5wbDpxRuKK3PevxdqVbJXk BEGIN:VALARM ACTION:display DESCRIPTION:The Data Journalism Handbook in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Social Media & Text Analytics using Open Source Tools DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T150000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T153000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/RUck4WKm7YLYSERFQgUKaC@hasgeek.com CATEGORIES:Data Analytics,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040632Z DESCRIPTION:You don't need the most expensive Text Mining Tools or even e xpensive software to start doing Text Analytics. This session will first c over Social Media Analytics and the range of work that can be done in anal yzing Social Media Data. We will then look at a simple framework for doing text analytics and a demo of the same using Rapid Miner. We will also tal k about some libraries available in R and Python to start doing Social Med ia Analytics and any other Text Analytics as well. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190705T040649Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/social-media-text-anal ytics-using-open-source-tools-RUck4WKm7YLYSERFQgUKaC BEGIN:VALARM ACTION:display DESCRIPTION:Social Media & Text Analytics using Open Source Tools in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Recommendation Play @Flipkart DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T150000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T153000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/MfzhQdUebyQ2DhfcacRPUk@hasgeek.com CATEGORIES:Data Analytics,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040557Z DESCRIPTION:Product discovery is an important step in users interaction wi th an ecommerce website and recommendation systems play a very useful part in the discovery process. Recommendation Systems aim to predict the user ’s intent and help connect them to products they need or may be interest ed in. Recommending the right set of products for every user\, understandi ng his intent and foreseeing the future requirements is very important. Th is becomes ever so important with increasing size of the catalog\, making it harder for every user to express their intent through well formed queri es.\n\nIn this talk we’ll motivate different ways in which recommendatio n systems simplify and enrich the discovery/decision process. We will talk about various types of recommendations and algorithms aimed at mining the collective intelligence from the users of our website. We will cover the challenges faced while building the flipkart recommendation system: \n\n- -Where to begin\n\n--How and what data to collect from users\n\n--Analyzin g and understanding the above data to mine personal tastes and behaviour p atterns\n\n--Scaling challenges and achieving performance both at the fron tend and the backend GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190705T040647Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/recommendation-play-fl ipkart-MfzhQdUebyQ2DhfcacRPUk BEGIN:VALARM ACTION:display DESCRIPTION:Recommendation Play @Flipkart in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Targeting the Moment of Truth: Using Big Data in Retail DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T150000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T153000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/V2amN7CR4zYqREyqV2yt62@hasgeek.com CATEGORIES:Industry & Business,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040645Z DESCRIPTION:The key moment of truth for a customer is when he or she makes the choice to take a product from the shelf and put it in to the shopping cart at a store or clicks on the add to my shopping cart button on a web site. \n\nRetailers have been for decades trying to crack this moment of t ruth - how to steer individuals to the right product at the right price an d the right channel. Now with the capability to process big data\, retaile rs have finally started to effectively combine customer data [like demogr aphics\, psychographics\, purchase history\, social\, mobile and location information) with their product offerings and purchase context / channels to take crack at this problem. \n\nIn this session\, we walk through the typical challenges that retailers face in this area and see how some of th e leading retailers have started to harness big data to crack the code. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/targeting-the-moment-o f-truth-using-big-data-in-retail-V2amN7CR4zYqREyqV2yt62 BEGIN:VALARM ACTION:display DESCRIPTION:Targeting the Moment of Truth: Using Big Data in Retail in A udi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Application of Data analytics in Small / Medium Enterprise plannin g DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T160000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T163000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/UcfegnNpbUuNJdp5irBLcx@hasgeek.com CATEGORIES:Industry & Business,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040701Z DESCRIPTION:Most mature Small / Medium enterprises (SMEs) have their workf low digitized. Transactions - purchases\, sales are captured in an IT syst em. However\, these IT systems are primarily used transactionally - for bi lling\, receivables\, cash flow\, balance sheet\, and inventory.\n\nDecisi on making in SMEs is largely the entrepreneur's intuition driven. More obj ectivity\, however\, can be brought about in this decision making by using the historical transactional data for forecasting\, production / purchase planning\, pricing\, marketing planning and much more.\n\nEntrepreneur's own low understanding of what can be done with data\, low prevalence of bu siness analytics skill sets amongst the local software vendors\, and absen ce of an affordable ready to use smart analytics products hinder this.\n\n We describe an application of such concepts of such tools from Surat's tex tile industry\, and a proposed architecture for such a tool. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/application-of-data-an alytics-in-small-medium-enterprise-planning-UcfegnNpbUuNJdp5irBLcx BEGIN:VALARM ACTION:display DESCRIPTION:Application of Data analytics in Small / Medium Enterprise pla nning in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:From Data to Action (in Retail Banking) DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T160000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T163000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/R6KMTvtQPYtsTzGHCFwgFn@hasgeek.com CATEGORIES:Industry & Business,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040720Z DESCRIPTION:Let’s take an example.\n\nConsider a retail bank offering sa vings products to its customers. It collects all kinds of data – the cus tomers\, the transactions they do\, the amounts of money they keep in thei r accounts. Let’s take a random piece of data.\n\nCustomer x001: ATM use r: Yes\nCustomer x002: ATM user: No\n\nNow what?\n\nBy itself this piece o f data means nothing beyond what it is. To make anything from it\, one nee ds something else – a purpose. I must want to do something for which thi s piece of data can do something.\n\nPurpose is context dependent. If I am in charge of ATM usage at the bank\, my purpose is to increase the number of people using ATMs. For this I need to make the ATM services more custo mer friendly. But what is “customer-friendly”? And how will this piece of data help me?\n\nThe point being made here is that generating the data report is not the end of the process. It is not even the beginning. It is a small input into a larger context. In the above example\, the ATM Manag er will\, most probably\, follow a chain of thinking that goes a bit like this:\n\nWho are the customers who are using the ATM? What do they find at tractive about it?\nWho are the customers who are not using the ATM? What is holding them back?\n\nFrom the random piece of data\, the ATM manager n eeds to extract some relevant information and from that perceive a critica l insight that can take him closer to a solution. For example:\n\nProfile of ATM non-user:\n77% are senior citizens\n68% are rural customers\n44% ar e women\n23% are High Networth Individuals (average account balance > 500\ ,000)\n\nThe above is information gleaned from churning the data. But what useful insight can one draw from it? For this the ATM Manager needs to go beyond the data and study the human beings. An example:\n\nWomen are scar ed of entering ATMs alone. They look at the ATM kiosk as a place where th ey may get robbed\, attacked\, etc. They prefer to go to a branch where th ere is safety in crowds.\n\nNow\, the ATM manager has something concrete t hat he can act upon:\n\nMake ATMs safer for women – relook at lighting\; location\; behavior of security guards\, etc.\n\nThus\, we have a journey from a random piece of data to concrete action points. This journey is wh at fulfills the promise of data. And this journey happens not in the compu ter but in the mind of the user. There is a thinking process that works ba ckwards from the purpose and asks the right questions\, supplements the da ta with appropriate additional information and designs hypotheses for acti on.\n\nHow do we enable the user to make this journey each time\, every ti me? GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/from-data-to-action-in -retail-banking-R6KMTvtQPYtsTzGHCFwgFn BEGIN:VALARM ACTION:display DESCRIPTION:From Data to Action (in Retail Banking) in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Structuring data from surveys - A case study from a slum survey co nducted in 2010 DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T160000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T163000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/6K5Lp7gDPdrK9u46Ju4R9h@hasgeek.com CATEGORIES:Industry & Business,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040708Z DESCRIPTION:An extensive survey was carried out across 1000+ households in 36 slums across Bangalore by NGIL\, CSTEP in 2010. We will talk about the process of data collection and cleanup and the challenges that we faced d uring the data collection and during cleanup. We will also give an overvie w of the database schema we designed for the survey. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/structuring-data-from- surveys-a-case-study-from-a-slum-survey-conducted-in-2010-6K5Lp7gDPdrK9u46 Ju4R9h BEGIN:VALARM ACTION:display DESCRIPTION:Structuring data from surveys - A case study from a slum surve y conducted in 2010 in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Towards Social Semantic Tools DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T163000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T170000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/VZqMcWQjdeAW6Ci5gXuKbf@hasgeek.com CATEGORIES:Big Data Infrastructure & Processing,Intermediate,Demo CREATED;VALUE=DATE-TIME:20190705T040742Z DESCRIPTION:An example of a conversation related to a certain context that is happening elsewhere is the case of the following blog. The author\, sa y J\, of the blog post http://jace.zaiki.in/2012/06/26/technology-outsourc e-vs-open-source has disabled comments on the post in his own blog but has suggested a link where the comments and discussion could go on. The disc ussion could also have also been on twitter\, facebook or a blog. We show how this discussion in particular and other discussions of interest can be pulled together on to the same page/context of the post of the original a uthor so one need not click through to go elsewhere for discussions. After this example\, we briefly describe how this technique can be used by anyo ne to develop digital heritage walks by assimilating relevant content on t he web. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 1 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/towards-social-semanti c-tools-VZqMcWQjdeAW6Ci5gXuKbf BEGIN:VALARM ACTION:display DESCRIPTION:Towards Social Semantic Tools in Audi 1 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Analytics is a magic wand for HR professionals DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T163000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T170000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/Wy8YjH8PfWFig2FTEqustC@hasgeek.com CATEGORIES:Industry & Business,Beginner,Lecture CREATED;VALUE=DATE-TIME:20190705T040751Z DESCRIPTION:This session is intended to showcase the power of analytics as a problem solver in the HR space. For years\, the Human Resources functio n grapples with some fundamental challenges\, especially around Hiring Eff ectiveness\, Employee Retention and Resource Optimization. I intend to sha re some analysis frameworks that could be very useful in addressing these issues and can yield very powerful insights. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20200619T062515Z LOCATION:Audi 2 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/analytics-is-a-magic-w and-for-hr-professionals-Wy8YjH8PfWFig2FTEqustC BEGIN:VALARM ACTION:display DESCRIPTION:Analytics is a magic wand for HR professionals in Audi 2 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT BEGIN:VEVENT SUMMARY:Financial Market and Economic Data DTSTART;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T163000 DTEND;TZID=Asia/Kolkata;VALUE=DATE-TIME:20120728T170000 DTSTAMP;VALUE=DATE-TIME:20200811T123622Z UID:session/5atkqcYKvYBG457xSqb3Vp@hasgeek.com CATEGORIES:Data Visualization,Intermediate,Lecture CREATED;VALUE=DATE-TIME:20190705T040759Z DESCRIPTION:Financial markets create massive amounts of data\, from produc t documentation to traded prices. And you find that one piece of informati on is related to another\, or depends on it or is otherwise correlated. Th e use of innovative techniques to visualize data is not just for eye-candy or appeal\, but will hugely influence business or investing decisions. I propose to demonstrate what has been done and what you can do with market data. GEO:12.9431582;77.5964488824009 LAST-MODIFIED;VALUE=DATE-TIME:20190705T040802Z LOCATION:Audi 3 - Nimhans Convention Centre\nBangalore\, IN URL:https://hasgeek.com/fifthelephant/2012/schedule/financial-market-and-e conomic-data-5atkqcYKvYBG457xSqb3Vp BEGIN:VALARM ACTION:display DESCRIPTION:Financial Market and Economic Data in Audi 3 in 5 minutes TRIGGER:-PT5M END:VALARM END:VEVENT END:VCALENDAR