Episode 149:

Turning Tables Into APIs for Real-time Data Apps, Featuring Matteo Pelati and Vivek Gudapuri of Dozer

August 2, 2023

This week on The Data Stack Show, Eric and Kostas chat with Matteo Pelati and Vivek Gudapuri, Co-Founders of Dozer, a company that helps users turn various data sources into APIs for real-time data access. During the conversation, the group discusses the problems that led to the creation of Dozer and how it bridges the gap between data engineering and application engineering. Topics also include the components and workflow of Dozer, its handling of schema changes, working with event streams, use cases, the importance of reliability and observability in Dozer’s data-to-API solution, and more.

Notes:

Highlights from this week’s conversation include:

  • Building Dozer: Simplifying Data Sources into APIs (1:13)
  • Bridging Data Engineering with Application Engineering (4:19)
  • Turning Data Sources into APIs (7:46)
  • The cost of caching (12:59)
  • Challenges with legacy systems (14:30)
  • Real-time data integration (19:31)
  • YAML and SQL experience (25:37)
  • Behind the scenes of Dozer (29:18)
  • Heavy Workloads and Low Latency (42:00)
  • Use Cases of Dozer (45:51)
  • Reliability and storing data from different connectors (51:35)
  • Importance of observability in serving data to customers (53:24)
  • Final thoughts and takeaways (56:34)

 

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.