Episode 177:

AI-Based Data Cleaning, Data Labelling, and Data Enrichment with LLMs Featuring Rishabh Bhargava of refuel

February 14, 2024

This week on The Data Stack Show, Eric and Kostas chat with Rishabh Bhargava, Co-Founder and CEO of refuel. During the episode, the group discusses the evolution of AI, machine learning, and large language models (LLMs). Rish shares his background and the inception of refuel, which focuses on making clean and reliable data accessible for businesses through data cleaning, labeling, and enrichment using LLMs. The conversation explores the impact of LLMs on data quality, the challenges of implementing LLM technology, and the user experience of working with LLMs. They also touch upon the importance of confidence scores in machine learning and the iterative process of model training, a practical use case involving refuel and RudderStack, and more.

Notes:

Highlights from this week’s conversation include:

  • The overview of refuel (0:33)
  • The evolution of AI and LLMs (3:51)
  • Types of LLM models (12:31)
  • Implementing LLM use cases and cost considerations (00:15:52)
  • User experience and fine-tuning LLM models (21:49)
  • Categorizing search queries (22:44)
  • Creating internal benchmark framework (29:50)
  • Benchmarking and evaluation (35:35)
  • Using refuel for documentation (44:18)
  • The challenges of analytics (46:45)
  • Using customer support ticket data (48:17)
  • The tagging process (50:18)
  • Understanding confidence scores (59:22)
  • Training the model with human feedback (1:02:37)
  • Final thoughts and takeaways (1:05:48)

 

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.