This week on The Data Stack Show, Eric and Kostas chat with Omar Maher, the Director of Product Marketing at Parallel Domain. During the episode, the group discusses synthetic data in the context of computer vision and autonomous vehicle development. Omar shares his background in data and machine learning and explains how synthetic data can be used to generate labeled data that is fresh, clean, and useful for training and testing machine learning models. The conversation also includes the challenges of obtaining high-quality labeled data for computer vision projects, the importance of addressing edge cases, ethical implications of using synthetic data to train AI models, and more.
Highlights from this week’s conversation include:
The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
To keep up to date with our future episodes, subscribe to our podcast on Apple, Spotify, Google, or the player of your choice.
Get a monthly newsletter from The Data Stack Show team with a TL;DR of the previous month’s shows, a sneak peak at upcoming episodes, and curated links from Eric, John, & show guests. Follow on our Substack below.