
Data Without Barriers: Synthetic Data as a Catalyst for Responsible Innovation
The ability to access and use high-quality data is becoming a key enabler, and bottleneck, for innovation across AI and digital systems. Yet privacy constraints, regulation, and data scarcity continue to limit what organizations and researchers can do. Synthetic data generation is increasingly emerging as a powerful ingredient for enabling responsible, inclusive, and scalable data-driven innovation.
In this talk, I’ll introduce a broader vision for data democratization, with synthetic data playing a central role. I’ll walk through how generative AI models can be used to synthesize rich, realistic tabular datasets, and how these can be safely shared and applied across a wide range of use cases, from AI model development and testing to fairness research, simulation, and beyond.
The session will include a live walkthrough of open-source tools, showcasing how accessible and practical synthetic data generation can be today.