News & Analysis
/
Article

Generating high-quality training data for atomic neural networks with virtual reality

NOV 27, 2020
Virtual reality allows users to manipulate and create low-dimensional molecular structures that can be input as training data to predict the energies of larger, complex molecular systems.
Generating high-quality training data for atomic neural networks with virtual reality internal name

Generating high-quality training data for atomic neural networks with virtual reality lead image

A growing trend in the field of computational chemistry is applying machine learning to calculate the structures and properties of molecules. Machine learning methods allow researchers to surpass prior limitations originating from a lack of sufficient processing power. The emphasis has shifted to creating superior data to train algorithms rather than upping the computational speed.

Amabilino et al. demonstrate the use of virtual reality as an efficient, intuitive way to generate a high-quality training dataset for the purpose of deriving the energy functions of large molecular systems. They developed a program featuring real-time interactive quantum molecular dynamics that a user can directly interact with through a virtual reality headset and controllers. The software package, named Narupa, is completely open source and free for any group to use.

“A scientist can literally go into virtual reality and reach out to touch the molecule as if it’s a tangible object,” said senior author David Glowacki. “The program is running according to a real-time physics simulation, so the user can set up these different geometries that can then be fed into the machine from which to learn.”

With this technique, the researchers created six different training datasets containing only smaller hydrocarbons with up to six carbon atoms in each molecule. These datasets were then fed into atomic neural networks, which were able to accurately predict the energies of much higher-dimensional systems containing nearly 100 atoms. Specifically, they determined the energy of a large hydrocarbon chain called squaline reacting with a cyano radical.

The results suggest that even small training datasets, when intelligently curated, can guide neural networks to fit accurate potential energy surfaces for large molecular systems.

Source: “Training atomic neural networks using fragment-based data generated in virtual reality,” by Silvia Amabilino, Lars A. Bratholm, Simon Jonathan Bennie, Michael O’Connor, and David Glowacki, Journal of Chemical Physics (2020). The article can be accessed at http://doi.org/10.1063/5.0015950 .

Related Topics
More Science
AAS
/
Article
Known as ASTERIS, the AI network removes noise from images to reveal features a full magnitude fainter than before.
AAS
/
Article
Stars have a hard time forming in the extreme environment around our Milky Way’s black hole. New data promises to explain why.
AAS
/
Article
This month’s episode showcases the stars and planets visible on March evenings. First up: March 3rd’s predawn a total lunar eclipse! Then track down three planets after sunset, and savor the easy-to-spot Winter Triangle of bright stars.
AAS
/
Article
Experts are concerned that the satellites could ruin dark skies, pollute the atmosphere, and worsen space debris. The public has a limited time to comment.