Data made from scratch for training AI models.

news-15062024-123455

Ever since OpenAI’s ChatGPT ignited the generative AI boom in 2022, the importance of having the right data for training AI models has been evident. Companies need a large amount of accurate and reliable data to ensure their AI models perform effectively. However, obtaining high-quality data, especially in specialized domains like health and finance, can be a challenge.

To address this data scarcity issue, San Francisco-based startup Gretel AI has developed a solution that involves creating synthetic data. Synthetic data is artificially generated data that simulates real-world data characteristics. Gretel’s generative AI-powered system allows users to easily create synthetic datasets for tabular data using natural language prompts. This technology enables companies to generate large amounts of data for training AI models without compromising customer privacy.

According to Gretel’s cofounder and CEO Ali Golshan, synthetic data is gaining traction in 2024 as companies look for alternative data sources to build or fine-tune AI models. Golshan emphasizes that synthetic data should not be equated with low-quality or irrelevant data. Quality is paramount when generating synthetic data to ensure its usefulness in training AI models.

Gretel’s products cater to businesses, organizations, and governments by leveraging existing data to create synthetic datasets tailored for specific tasks. The company’s focus on domain-specific data sets it apart from other synthetic data providers who rely on scraping data from the internet. Golshan envisions a future where companies can monetize their unique datasets by selling synthetic data trained on their proprietary information.

In addition to Gretel, other startups like SynthLabs, Synthetaic, and Clearbox AI are also entering the synthetic data market to meet the growing demand for data-driven AI models. Looking ahead, Gretel plans to establish a synthetic data and model exchange platform, allowing companies to securely access and share AI-trained data. This initiative aims to streamline the data-sharing process and create a safe environment for handling sensitive information.

With $68 million raised in its Series B funding round in 2021, Gretel is well-positioned to capitalize on the rising demand for synthetic data solutions. Golshan envisions Gretel becoming a leading provider of high-quality, secure data services, akin to industry giants like Databricks and Snowflake. By embracing the potential of synthetic data, Gretel aims to revolutionize the way companies train their AI models and unlock new opportunities in the data market.

To stay updated on the latest developments in AI and its impact on business, subscribe to the Eye on AI newsletter for free. Keep abreast of how AI is reshaping industries and driving innovation in the digital age.

Exit mobile version