Conversations at the intersection of data engineering and business

with Kostas Pardalis & Eric Dodds

Episode 186

April 24, 2024

with Andrew Lamb

āŸā€“ Staff Engineer, InfluxData, PMC Apache Arrow

This week on The Data Stack Show, Eric and Kostas chat with Andrew Lamb, a Staff Engineer at InfluxData. During the episode, Andrew takes us on a deep dive into the intricacies of time series databases and the evolution of data systems. He discusses the specialized challenges of managing high cardinality data and the trade-offs in query performance. The conversation also touches on the development of Data Fusion, its adaptation for time series data, and the potential for innovation in the query language space. The episode concludes with a look at the future of data tooling and the exciting possibilities that arise from removing traditional constraints in database architecture with each person expressing enthusiasm for the role of projects like Data Fusion in shaping the landscape of data systems. Donā€™t miss this episode!

Episode 3

April 18, 2024

with Pete Soderling

āŸā€“ Founder, Data Council & Zero Prime Ventures

Itā€™s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode features Data Council founder Pete Soderling. In this conversation, Pete reflects on the conference’s evolution, its pause during the pandemic, and its successful return with community support. The episode highlights the technical depth of the conference, its vendor-neutral stance, and the diversity of its attendees, ranging from engineers to industry leaders. Pete shares his pride in the community’s growth and the conference’s role in nurturing data professionals and founders, the careful curation of speakers, the conference’s expansion through the years across the data stack, and more.Ā 

Episode 2

April 17, 2024

with Tristan Zajonc

āŸā€“ Co-Founder & CEO, Continual

Itā€™s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode, Tristan Zajonc returns to the podcast to discuss the evolution of AI and its integration into applications. Tristan is the Co-Founder and CEO of Continual. In this discussion, the group covers the shift towards generative AI in data science, the progression of machine learning in production, Continual AI copilot platform and the importance of reliability and low latency in AI responses. The conversation also touches on the challenges and future potential of AI copilots in complex industries and large enterprises, considering regulatory and technological breakthroughs needed for widespread adoption, and more.Ā 

Episode 1

April 15, 2024

with Ryan Dolley

āŸā€“ Vice President of Product Strategy, GoodData

Itā€™s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode, Ryan Dolley, Vice President of Product Strategy at GoodData joins the show. During the conversation, Ryan shares his journey from creative arts to data, emphasizing the importance of understanding human behavior in both fields. The discussion also covers his diverse experiences in the data industry, the existential question of what to do with abundant data, the industry’s hype cycles, the challenges of self-serve data projects, the need for a balance between autonomy and governance in analytics, and more.Ā 

Episode 185

April 10, 2024

with Ryan Blue

āŸā€“ Co-Founder and CEO, Tabular

This week on The Data Stack Show, Eric and Kostas chat with Ryan Blue, the Co-Founder and CEO of Tabular, and also creator of Iceberg and former Cloudera and Netflix employee. During the episode, Ryan discusses the challenges of managing large-scale data and the development of Iceberg, a new table format. He explains Iceberg’s benefits, such as automatic partitioning and improved metadata management, which simplify data engineers’ tasks and enhance query performance. The conversation covers the importance of atomicity in analytics systems, the scalability of Iceberg, and the trade-offs in mixed workload environments. Additionally, Ryan addresses the differences in cloud object storage performance and the integration of security and access controls into distributed file systems. He also touches on recent Iceberg updates, including Python and Rust support, the anticipation of view support in the upcoming release, and more.Ā