Episode 189

May 16, 2024

Customer Data Modeling, The Data Warehouse, Reverse ETL, and Data Activation with Ryan McCrary of RudderStack

with Ryan McCrary

 – Product Manager, RudderStack

This week on The Data Stack Show, Eric and John chat with Ryan McCrary, Product Manager at Rudderstack. During the episode, the group explores the complexities of customer data management, focusing on data activation, identity resolution, and entity management. They also discuss Rudderstack’s profiles product, which aims to bring business users closer to data, making it actionable within their existing tools. The episode covers the challenges of stitching user profiles deterministically, handling anomalies, and the significance of reverse ETL in the data industry. They also touch on the importance of data ownership, visibility between teams, and the role of machine learning in building a data foundation. Overall, the conversation sheds light on the evolving landscape of data management and the need for structured, collaborative tools in the space.

Episode 188

May 8, 2024

How To Invest in Data Infrastructure and Data Projects That Create Business Value with Matthew Kelliher-Gibson of Rudderstack

with Matthew Kelliher-Gibson

 – Technical Product Marketing Manager, Rudderstack

This week on The Data Stack Show, Eric and John chat with Matthew Kelliher-Gibson, Technical Product Marketing Manager at Rudderstack. During the episode, Matthew shares his extensive background in data and data science. With over a decade of experience across various industries, Matthew discusses his transition from analyst to managing data science teams and his journey through political campaigns to an MBA focused on business analytics. The group explores the challenges of purchasing data tools, emphasizing the need to drive business value by cutting costs and increasing revenue. They also discuss strategies for working with business users to prioritize company needs and build trust, personal productivity tools, managing tasks and knowledge, and more.

Episode 187

May 1, 2024

Startup Lessons and Torch Passing with Kostas Pardalis

with Kostas Pardalis

 – Co-Founder in Stealth Mode

This week on The Data Stack Show, Eric and Kostas welcome John Wessel who will be taking over as the new co-host of the podcast as Kostas passes the baton. Kostas reminisces about his journey in starting the podcast and details the challenges of getting the show off the ground. He also shares his journey in the data world, the idea for beginning the podcast, and how his curiosity-driven approach to conversations with industry leaders has grown the show into the success it is today. Kostas also talks about stepping back to focus on a new data infrastructure venture, offering advice to the new co-host, John about what he can expect on the show. Don’t miss this episode!

Episode 186

April 24, 2024

Open Source and the Evolution of Data Systems with Andrew Lamb of InfluxData

with Andrew Lamb

 – Staff Engineer, InfluxData, PMC Apache Arrow

This week on The Data Stack Show, Eric and Kostas chat with Andrew Lamb, a Staff Engineer at InfluxData. During the episode, Andrew takes us on a deep dive into the intricacies of time series databases and the evolution of data systems. He discusses the specialized challenges of managing high cardinality data and the trade-offs in query performance. The conversation also touches on the development of Data Fusion, its adaptation for time series data, and the potential for innovation in the query language space. The episode concludes with a look at the future of data tooling and the exciting possibilities that arise from removing traditional constraints in database architecture with each person expressing enthusiasm for the role of projects like Data Fusion in shaping the landscape of data systems. Don’t miss this episode!

Episode 3

April 18, 2024

Data Council Week: A Decade of Supporting the Data Community with Pete Soderling

with Pete Soderling

 – Founder, Data Council & Zero Prime Ventures

It’s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode features Data Council founder Pete Soderling. In this conversation, Pete reflects on the conference’s evolution, its pause during the pandemic, and its successful return with community support. The episode highlights the technical depth of the conference, its vendor-neutral stance, and the diversity of its attendees, ranging from engineers to industry leaders. Pete shares his pride in the community’s growth and the conference’s role in nurturing data professionals and founders, the careful curation of speakers, the conference’s expansion through the years across the data stack, and more.

Episode 2

April 17, 2024

Data Council Week: AI Isn’t Just Hype – How To Successfully Apply LLMs Today with Tristan Zajonc of Continual

with Tristan Zajonc

 – Co-Founder & CEO, Continual

It’s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode, Tristan Zajonc returns to the podcast to discuss the evolution of AI and its integration into applications. Tristan is the Co-Founder and CEO of Continual. In this discussion, the group covers the shift towards generative AI in data science, the progression of machine learning in production, Continual AI copilot platform and the importance of reliability and low latency in AI responses. The conversation also touches on the challenges and future potential of AI copilots in complex industries and large enterprises, considering regulatory and technological breakthroughs needed for widespread adoption, and more.

Episode 1

April 15, 2024

Data Council Week: How To Do Self-Service Data Analytics and Business Intelligence Right with Ryan Dolley of GoodData

with Ryan Dolley

 – Vice President of Product Strategy, GoodData

It’s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode, Ryan Dolley, Vice President of Product Strategy at GoodData joins the show. During the conversation, Ryan shares his journey from creative arts to data, emphasizing the importance of understanding human behavior in both fields. The discussion also covers his diverse experiences in the data industry, the existential question of what to do with abundant data, the industry’s hype cycles, the challenges of self-serve data projects, the need for a balance between autonomy and governance in analytics, and more.

Episode 185

April 10, 2024

The Evolution of Data Processing, Data Formats, and Data Sharing with Ryan Blue of Tabular

with Ryan Blue

 – Co-Founder and CEO, Tabular

This week on The Data Stack Show, Eric and Kostas chat with Ryan Blue, the Co-Founder and CEO of Tabular, and also creator of Iceberg and former Cloudera and Netflix employee. During the episode, Ryan discusses the challenges of managing large-scale data and the development of Iceberg, a new table format. He explains Iceberg’s benefits, such as automatic partitioning and improved metadata management, which simplify data engineers’ tasks and enhance query performance. The conversation covers the importance of atomicity in analytics systems, the scalability of Iceberg, and the trade-offs in mixed workload environments. Additionally, Ryan addresses the differences in cloud object storage performance and the integration of security and access controls into distributed file systems. He also touches on recent Iceberg updates, including Python and Rust support, the anticipation of view support in the upcoming release, and more.

Episode 184

April 3, 2024

Kafka Streams and Operationalizing Event Driven Applications with Apurva Mehta of Responsive

with Apurva Mehta

 – Co-Founder and CEO, Responsive

This week on The Data Stack Show, Eric and Kostas chat with Apurva Mehta, Co-Founder and CEO of Responsive, about event-driven applications and the necessary infrastructure. Apruva shares his journey from LinkedIn to Confluent and eventually founding Responsive, focusing on managing event-driven applications in the cloud. The discussion covers the definition of event-driven applications, the significance of latency and state in event processing, and the evolution of Kafka and Kafka Streams. They also explore the challenges of managing Kafka in production, the developer experience with Kafka Streams, and the operational complexities of running distributed stateful applications. Apruva highlights Responsive’s approach to simplifying the management of these applications, the potential for innovation in event-driven architectures, and more.

Episode 183

March 27, 2024

Why Modern Data Quality Must Move Beyond Traditional Data Management Practices with Chad Sanderson of Gable.ai

with Chad Sanderson

 – CEO, Gable.ai

This week on The Data Stack Show, Eric and Kostas chat with Chad Sanderson, the CEO at Gable.ai. During the episode, Chad discusses the complexities of managing the data supply chain, emphasizing the importance of data quality, feedback loops, and aligning incentives within organizations. He shares his journey from analyst to data infrastructure leader at companies like Oracle, Sephora, and Microsoft. Chad introduces his company, Gable, which tackles upstream data quality issues. He critiques traditional data catalogs and advocates for a more dynamic, decentralized approach. The conversation explores the role of metadata, the integration of data quality checks in the software development lifecycle, the need for cultural shifts towards data responsibility, the significance of full lineage graphs and semantic metadata, treating data as a product with quality gates, and more.

Interview Episodes

Episode 189

with Ryan McCrary

– Product Manager, RudderStack

Episode 188

with Matthew Kelliher-Gibson

– Technical Product Marketing Manager, Rudderstack

Episode 187

with Kostas Pardalis

– Co-Founder in Stealth Mode

Episode 186

with Andrew Lamb

– Staff Engineer, InfluxData, PMC Apache Arrow

Episode 3

with Pete Soderling

– Founder, Data Council & Zero Prime Ventures

Episode 2

with Tristan Zajonc

– Co-Founder & CEO, Continual

Episode 1

with Ryan Dolley

– Vice President of Product Strategy, GoodData

Episode 185

with Ryan Blue

– Co-Founder and CEO, Tabular

Episode 184

with Apurva Mehta

– Co-Founder and CEO, Responsive

Episode 183

with Chad Sanderson

– CEO, Gable.ai

About the Podcast

Sign Up for The Data Stack Show Newsletter

 – Product Manager, RudderStack

 – Technical Product Marketing Manager, Rudderstack

 – Co-Founder in Stealth Mode

 – Staff Engineer, InfluxData, PMC Apache Arrow

 – Founder, Data Council & Zero Prime Ventures

 – Co-Founder & CEO, Continual

 – Vice President of Product Strategy, GoodData

 – Co-Founder and CEO, Tabular

 – Co-Founder and CEO, Responsive

 – CEO, Gable.ai