More Data in Chemistry

Clearer reporting of negative experimental results would improve reaction planning in chemistry

13-Jun-2022 - Germany

databases containing huge amounts of experimental data are available to researchers across a wide variety of chemical disciplines. However, a team of researchers have discovered that the available data is unsuccessful in predicting the yields of new syntheses using artificial intelligence (AI) and machine learning. Their study published in the journal Angewandte Chemie suggests that this is in large part down to the tendency of scientists not to report failed experiments.

Although AI-based models have been particularly successful in predicting molecular structures and material properties, they return rather inaccurate predictions for information relating to product yields in synthesis, as Frank Glorius and his team of researchers at Westfälische Wilhelms-Universität Münster, Germany, have discovered.

The researchers attribute this failure to the data used to train AI systems. “Interestingly, the prediction of reaction yields (reactivity) is much more challenging than the prediction of molecular properties. Reactants, reagents, quantities, conditions, the experimental execution—all determine the yield, and thus, the problem of yield prediction becomes very data-intensive,” explains Glorius. So, despite the huge amounts of available literature and results, the researchers came to realize that the data is not fit for accurate predictions of the expected yield.

The problem is not only down to a lack of experiments. In contrast, the team identified three possible causes for biased data. Firstly, the results of chemical syntheses may be flawed due to experimental error. Secondly, when chemists are planning their experiments, they may, either consciously or unconsciously, introduce bias based on personal experience and reliance on well-established methods. Finally, since only reactions with a positive outcome are believed to contribute to progress, failed reactions are reported less frequently.

To find out which of these three factors had the greatest influence, Glorius and the team purposely altered the datasets for four different, commonly used (and therefore data-rich) organic reactions. They artificially increased experimental error, reduced the size of the data sampling sets, or removed negative results from the data. Their investigations showed that the experimental error had the smallest influence on the model, while the contribution made by the lack of negative results was fundamental.

The group hopes that these findings will encourage scientists to always report failed experiments as well as their successes. This would improve data availability for training AI, ultimately helping to speed up planning and making experimentation more efficient. Glorius adds: “machine learning in (molecular) chemistry will increase efficiency dramatically and fewer reactions will have to be run to achieve a certain goal, for example, an optimization. This will empower chemists and will help them to make chemical processes—and the world—more sustainable.”

Original publication

Dr. Felix Strieth-Kalthoff et al.; Machine Learning for Chemical Reactivity: The Importance of Failed Experiments; Angewandte Chemie International Edition; 2022

https://www.chemeurope.com/en/news/1176460/more-data-in-chemistry.html

Original publication

Dr. Felix Strieth-Kalthoff et al.; Machine Learning for Chemical Reactivity: The Importance of Failed Experiments; Angewandte Chemie International Edition; 2022

Topics

databases artificial intelligence machine learning synthesis chemical synthesis

Show all

Organizations

Universität Münster

Angewandte Chemie

See the theme worlds for related content

Topic world Synthesis

Chemical synthesis is at the heart of modern chemistry and enables the targeted production of molecules with specific properties. By combining starting materials in defined reaction conditions, chemists can create a wide range of compounds, from simple molecules to complex active ingredients.

25+ products

5+ whitepaper

25+ brochures

View topic world

Topic world Synthesis

25+ products

5+ whitepaper

25+ brochures

View topic world

Last viewed contents

Michael Schreiber becomes second Managing Director of LUMITOS - The internationally experienced marketing executive joins LUMITOS from Mettler-Toledo

Go to page

Henry_Rapoport

Go to page

Roscoe_G._Dickinson

Go to page

Frank_Austen_Gooch

Go to page

More from the department science Subscribe to newsletter

Get the chemical industry in your inbox

More Data in Chemistry

Clearer reporting of negative experimental results would improve reaction planning in chemistry

Original publication

Bosch increases investment in startups

Other news from the department science

The smallest test tube in the world

Spray drying tech used in instant coffee applied to high-capacity battery production

An elegant method for the detection of single spins using photovoltage

New Method for Detecting Nanoplastics in Body Fluids

New, non-toxic synthesis method for “miracle material” MXene

A fluid battery that can take any shape

CO₂ removal and storage: Which options are feasible and desirable?

Scientists achieve breakthrough in laser-alignment for macromolecular single-particle imaging

Predicting the Kinetic Energy of Molecular Quantum Systems Using Artificial Intelligence

Rolling particles make suspensions more fluid

3D-printed open-source robot offers accessible solution for materials synthesis

Green Chemistry Meets Microbiome Research for Soil Regeneration

Significant progress achieved in chemical liquid analysis

From trash to treasure: new method efficiently regenerates spent lithium cobalt oxide batteries

Chatbot opens computational chemistry to nonexperts

What innovation potential does the use of graphene offer for lithium-ion batteries?

Sustainable plastics containing flame retardants for use in closed-loop applications

How water hides its quantum secret

Marine litter: what biodegradable plastics can do to solve the problem

Elastic Inks for Textile-Integrated Electronics

Get the chemical industry in your inbox

Most read news

New green chemistry extracts valuable compounds from plant waste

Scientific breakthrough in chemistry

C1 raised €20 million in fresh capital to replace fossil resources in the shipping, aviation and chemical industries

Just take a photo to determine the chemical composition

World premiere: "Mannheim 001" production plant produces marine fuel from wastewater and electricity

The "funky" side of solid-state battery development

Novel Memristors to Overcome AI’s "Catastrophic Forgetting"

New approaches in battery production

Improved Recycling of Plastic from Packaging Waste

Structure of Supercritical Water Decoded

European chemical companies pessimistic for 2025: recovery not foreseeable until 2026

Self-healing batteries: the future of durable and safe energy storage

More news from our other portals

What drives our cravings for food and drink?

Unilever reaches new business arrangement for Ben & Jerry’s in Israel

Just take a photo to determine the chemical composition

Groundbreaking AI tool generates 3D map of the brain

Pineapple juice is scarce and expensive due to a small harvest

Printed Skin to Replace Animal Testing

Researchers bring prehistoric algae back to life

AI food scanner turns phone photos into nutritional analysis

analytica expands its international network to the USA

Tool identifies specific viruses to combat dangerous bacteria

Label battle: Paulaner wins in dispute against Karlsberg brewery

Pocket-sized breath test for stomach bacteria

Origin of Life: How microbes laid the foundation for complex cells

Who is ready to eat a plant-based diet?

Munich-based lab automation startup raised €2.77M

Sensors against superbugs

Design2Market: New delivery box for Frittenwerk in just 6 weeks

Inflammatory messenger fuels Alzheimer’s

New imaging reveals the secrets of cellular traffic control

Eating mangos daily shown to improve insulin sensitivity and blood glucose control

New antibiotic for multidrug resistant superbug

Research breakthrough: Lab-grown teeth might become an alternative to fillings

With bird flu in raw milk, many in U.S. still do not know risks of consuming it

New Leadership at Andreas Hettich GmbH

mRNA-based COVID-19 vaccines train the ‘long-term memory’ of the immune system

Scientists discover why obesity takes away the pleasure of eating

New Real-time Method for Environmental Monitoring

How does the immune system age?

Meat consumption remains at a low level

Science on the back burner: European laboratories struggle with massive restrictions

See the theme worlds for related content

Topic world Synthesis

Topic world Synthesis

Last viewed contents

Laboratory inauguration in Russia

Immunoglobulin_M

Nanobodies

Michael Schreiber becomes second Managing Director of LUMITOS - The internationally experienced marketing executive joins LUMITOS from Mettler-Toledo

Henry_Rapoport