GPT-4 artificial intelligence shows some competence in chemistry
GPT-4 could aid chemistry researchers, but limitations reveal the need for improvements
GPT-4, which stands for Generative Pre-trained Transformer 4, belongs to a category of artificial intelligence systems known as large language models. These can gather and analyse vast quantities of information in search of solutions to challenges set by users. One advance for GPT-4 is that it can use information in the form of images in addition to text.
Although the specific datasets used for training GPT-4 have not been disclosed by its developers, it has clearly learned a significant amount of detailed chemistry knowledge. To analyse its capabilities, the researchers set the system a series of chemical tasks focused on organic chemistry – the chemistry of carbon-based compounds. These covered basic chemical theory, the handling of molecular data, predicting the properties of chemicals, the outcome of chemical processes and proposing new chemical procedures.
The results of the investigation were varied, revealing both strengths and significant limitations. GPT-4 displayed a good understanding of general textbook-level knowledge in organic chemistry. It was weak, however, when set tasks dealing with specialized content or unique methods for making specific organic compounds. It displayed only partial efficiency in interpreting chemical structures and converting them into a standard notation. One interesting feat was its ability to make accurate predictions for the properties of compounds that it had not specifically been trained on. Overall, it was able to outperform some existing computational algorithms, but fell short against others.
“The results indicate that GPT-4 can tackle a wide range of tasks in chemical research, spanning from textbook-level knowledge to addressing untrained problems and optimizing multiple variables,” says Hatakeyama-Sato. “Inevitably, its performance relies heavily on the quality and quantity of its training data, and there is much room for improvement in its inference capabilities.”
The researchers emphasise that their work was only a preliminary investigation, and that future research should broaden the scope of the trials and dig deeper into the performance of GPT-4 in more diverse research scenarios.
They also hope to develop their own large language models specializing in chemistry and explore their integration with existing techniques.
“In the meantime, researchers should certainly consider applying GPT-4 to chemical challenges, possibly using hybrid methods that include existing specialized techniques,” Hatakeyama-Sato concludes.
Original publication
Other news from the department science
Get the chemical industry in your inbox
From now on, don't miss a thing: Our newsletter for the chemical industry, analytics, lab technology and process engineering brings you up to date every Tuesday and Thursday. The latest industry news, product highlights and innovations - compact and easy to understand in your inbox. Researched by us so you don't have to.