Text-mined dataset of inorganic materials synthesis recipes
Dataset en
Authors
OK
Olga Kononova
HH
Haoyan Huo
TH
Tanjin He
Abstract
1 min read
Auto-generated open-source dataset of 19,744 chemical reactions retrieved from 53,538 solid-state synthesis paragraphs. The data are collected using an automated extraction pipeline which converts unstructured scientific paragraphs describing inorganic materials synthesis into so-called "codified recipe" of synthesis. The pipeline utilizes a variety of text mining and NLP approaches to find information about target materials, starting compounds, synthesis steps and conditions in the text, and to process them into chemical equation. Submitted Data Descriptor in Scientific Data: SDATA-19-00539
Discussion(0)
No comments yet. Be the first to comment.