OpenAI seeks partnerships to generate AI training data

OpenAI seeks partnerships to generate AI training data
ChatGPT logo is seen in this illustration taken, Feb 3, 2023.
PHOTO: Reuters file

ChatGPT maker OpenAI said on Thursday (Nov 9) it intends to work with organisations to produce public and private datasets for training artificial intelligence (AI) models.

Popular chatbot ChatGPT, which can generate poems and prose from simple prompts, is based on large language models that are trained entirely on open-source data available on the Internet.

The company's latest effort could help it produce more nuanced training data that are more conversational in style.

"We're particularly looking for data that expresses human intention, across any language, topic and format," the company said in a blog post.

OpenAI said it is seeking partners to help it create an open-source dataset for training language models. This dataset would be public for anyone to use in AI model training, it said.

The company said it is also preparing private datasets for training proprietary AI models.

ALSO READ: OpenAI unveils personalised AI apps as it seeks to expand its ChatGPT consumer business

This website is best viewed using the latest versions of web browsers.