Interview Episodes

Episode 186

April 24, 2024

with Andrew Lamb

 – Staff Engineer, InfluxData, PMC Apache Arrow

This week on The Data Stack Show, Eric and Kostas chat with Andrew Lamb, a Staff Engineer at InfluxData. During the episode, Andrew takes us on a deep dive into the intricacies of time series databases and the evolution of data systems. He discusses the specialized challenges of managing high cardinality data and the trade-offs in query performance. The conversation also touches on the development of Data Fusion, its adaptation for time series data, and the potential for innovation in the query language space. The episode concludes with a look at the future of data tooling and the exciting possibilities that arise from removing traditional constraints in database architecture with each person expressing enthusiasm for the role of projects like Data Fusion in shaping the landscape of data systems. Don’t miss this episode!

Episode 3

April 18, 2024

with Pete Soderling

 – Founder, Data Council & Zero Prime Ventures

It’s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode features Data Council founder Pete Soderling. In this conversation, Pete reflects on the conference’s evolution, its pause during the pandemic, and its successful return with community support. The episode highlights the technical depth of the conference, its vendor-neutral stance, and the diversity of its attendees, ranging from engineers to industry leaders. Pete shares his pride in the community’s growth and the conference’s role in nurturing data professionals and founders, the careful curation of speakers, the conference’s expansion through the years across the data stack, and more. 

Episode 2

April 17, 2024

with Tristan Zajonc

 – Co-Founder & CEO, Continual

It’s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode, Tristan Zajonc returns to the podcast to discuss the evolution of AI and its integration into applications. Tristan is the Co-Founder and CEO of Continual. In this discussion, the group covers the shift towards generative AI in data science, the progression of machine learning in production, Continual AI copilot platform and the importance of reliability and low latency in AI responses. The conversation also touches on the challenges and future potential of AI copilots in complex industries and large enterprises, considering regulatory and technological breakthroughs needed for widespread adoption, and more. 

Episode 1

April 15, 2024

with Ryan Dolley

 – Vice President of Product Strategy, GoodData

It’s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode, Ryan Dolley, Vice President of Product Strategy at GoodData joins the show. During the conversation, Ryan shares his journey from creative arts to data, emphasizing the importance of understanding human behavior in both fields. The discussion also covers his diverse experiences in the data industry, the existential question of what to do with abundant data, the industry’s hype cycles, the challenges of self-serve data projects, the need for a balance between autonomy and governance in analytics, and more. 

Episode 185

April 10, 2024

with Ryan Blue

 – Co-Founder and CEO, Tabular

This week on The Data Stack Show, Eric and Kostas chat with Ryan Blue, the Co-Founder and CEO of Tabular, and also creator of Iceberg and former Cloudera and Netflix employee. During the episode, Ryan discusses the challenges of managing large-scale data and the development of Iceberg, a new table format. He explains Iceberg’s benefits, such as automatic partitioning and improved metadata management, which simplify data engineers’ tasks and enhance query performance. The conversation covers the importance of atomicity in analytics systems, the scalability of Iceberg, and the trade-offs in mixed workload environments. Additionally, Ryan addresses the differences in cloud object storage performance and the integration of security and access controls into distributed file systems. He also touches on recent Iceberg updates, including Python and Rust support, the anticipation of view support in the upcoming release, and more. 

Episode 184

April 3, 2024

with Apurva Mehta

 – Co-Founder and CEO, Responsive

This week on The Data Stack Show, Eric and Kostas chat with Apurva Mehta, Co-Founder and CEO of Responsive, about event-driven applications and the necessary infrastructure. Apruva shares his journey from LinkedIn to Confluent and eventually founding Responsive, focusing on managing event-driven applications in the cloud. The discussion covers the definition of event-driven applications, the significance of latency and state in event processing, and the evolution of Kafka and Kafka Streams. They also explore the challenges of managing Kafka in production, the developer experience with Kafka Streams, and the operational complexities of running distributed stateful applications. Apruva highlights Responsive’s approach to simplifying the management of these applications, the potential for innovation in event-driven architectures, and more. 

Episode 183

March 27, 2024

with Chad Sanderson

 – CEO, Gable.ai

This week on The Data Stack Show, Eric and Kostas chat with Chad Sanderson, the CEO at Gable.ai. During the episode, Chad discusses the complexities of managing the data supply chain, emphasizing the importance of data quality, feedback loops, and aligning incentives within organizations. He shares his journey from analyst to data infrastructure leader at companies like Oracle, Sephora, and Microsoft. Chad introduces his company, Gable, which tackles upstream data quality issues. He critiques traditional data catalogs and advocates for a more dynamic, decentralized approach. The conversation explores the role of metadata, the integration of data quality checks in the software development lifecycle, the need for cultural shifts towards data responsibility, the significance of full lineage graphs and semantic metadata, treating data as a product with quality gates, and more.

Episode 182

March 20, 2024

with Kevin Liu

 – Software Engineer, Stripe

This week on The Data Stack Show, Eric and Kostas chat with Kevin Liu, Software Engineer at Stripe. During the episode, Kevin discusses data infrastructure challenges and the development of data products. He also shares insights on the importance of metadata management and the role of catalogs in maintaining data consistency across various systems. The conversation also covers open-source projects like the Python Iceberg library and the future of databases in the cloud, the ease of use of internal tools, the integration of data for builders, the balance between simplicity and functionality in user interfaces, and more.

Episode 181

March 13, 2024

with Mike Driscoll

 – CEO, Rill Data

This week on The Data Stack Show, Eric and Kostas chat with Mike Driscoll, the CEO of Rill Data. During the episode, Mike recounts his journey from the Human Genome Project to developing the Druid engine, which was created to handle massive advertising data. He discusses Druid’s adoption by major companies and its evolution, emphasizing the importance of speed, simplicity, and scalability in data tools. The dialogue covers the progression of BI tools, the role of object stores, and the integration of AI in data technology. Mike also touches on the significance of SQL and AI’s influence on data visualization, what he would do if he wasn’t working in data, and more.

Episode 180

March 6, 2024

with Kunal Agarwal

 – Co-Founder and CEO, Unravel Data

This week on The Data Stack Show, Eric and Kostas chat with Kunal Agarwal, the Co-Founder and CEO of Unravel Data. During the episode, Kunal discusses the evolution of data operations and the role of Unravel in simplifying these processes. The group discusses the shift towards real-time workloads, the impact of AI and machine learning, and the challenges of cloud migration and managing complex data environments. Kunal shares his journey from fashion to data management and emphasizes the importance of observability for data ops teams. The conversation also covers cost optimization, the productivity of data teams, reliability of data systems, the unique cost management considerations in cloud versus on-premises setups, and more.