The Fifth Elephant 2016 Looking under the hood - demystifying data toolsThe goal of this talk is to help build an understanding of the performances of the following packages - R Dataframe R data.table Pandas Numpy PySpark RDDs PySpark Dataframes RedShift While these packages are operating in different but intersecting realms of use cases, depending on the cardinality of the data and the operations that will be performed on it, some are more suited than others for the… more
Section: Crisp talk
Technical level: Intermediate
|
The Fifth Elephant 2017 Interestingness of interestingness measuresAnalysis of relationship between entities is at the heart of data mining problems. There are many metrics used for association mining like support, confidence, lift, mutual information etc. However many of these measures provide conflicting results about the interestingness of the association. Therefore it becomes very important to understand how to evaluate metrics for an application. more
Section: Full talk for data engineering track
Technical level: Advanced
|
The Fifth Elephant 2023 Monsoon Building efficient and secure vector data workflowsBackground Large Language Models have demonstrated amazing capability for solving complex problems. But they can’t answer what they haven’t seen, and to take advantage of these amazing models, we need to expose our data to the model. Fine-tuning is not an option, at least not a cheap one. Prompt engineering is a helpful technique to provide context to LLMs, which helps the model restrict its answ… more
|
The Fifth Elephant 2023 Winter Let's build production-ready RAG pipeline the right way!Retrieval Augmented Generation (RAG) is a brilliant technique of augmenting knowledge to Large Language Models (LLMs) so that we can use the power of Language Models for multiple use cases on enterprise and real-time data. more
|
Call for Papers Language Models are Few Shot LearnersLanguage Models are few short learners is an important paper in the space of GenerativeAI and Natural Language Processing. It introduced GPT-3 and showed the capability of large language models to generalize as task-agnostic learners. more
|
Decoding Llama3: An explainer for tinkerers Decoding Llama3: Part 1 - Intro to Llama3The Announcement On 18th April 2024, Meta introduced pre-trained and instruction-fine-tuned language models with 8B and 70B parameters for a broad range of use cases, including improved reasoning. more
|
Decoding Llama3: An explainer for tinkerers Decoding Llama3: Part 2 - Understanding the configurationWe are continuing our series of Decoding Llama3 with the overview of model architecture in this blog. more
|
Decoding Llama3: An explainer for tinkerers Decoding Llama3: Part 3 - NormalisationWe are continuing our series of Decoding Llama3 with Normalisation in this blog. more
|
Decoding Llama3: An explainer for tinkerers Decoding Llama3: Part 4 - Rotary Positional EmbeddingsWe are continuing our series of Decoding Llama3 with Rotary Positional Embeddings in this blog. more
|
Decoding Llama3: An explainer for tinkerers Decoding Llama3: Part 5 - Grouped Query AttentionWe are continuing our series of Decoding Llama3 with Grouped Query Attention in this blog. more
|
Decoding Llama3: An explainer for tinkerers Decoding Llama3: Part 6 - Feed Forward NetworkWe are continuing our series of Decoding Llama3 with Feed Forward Network and SiLU activation function. more
|
Decoding Llama3: An explainer for tinkerers Decoding Llama3: Part 7 - Transformer Block & ModuleWe are continuing our series of Decoding Llama3 with Transformer block and Transformer module. more
|
Bengaluru Systems Meetup #1 Learn about the systems that power GenAI applicationsWhat is a system? A system is a set of things working together as parts of a mechanism or an interconnecting network; it is a complex whole. The definition comes from the Oxford Dictionary. more
|
Simrat Hanspal
@simrathanspal
NLP specialist with 14 years of experience building products. Exploring productivity hacks.
- Joined Jun 2016