Blog
SciWalker Studio powers new metric to assess life sciences papers
The need for alternative evaluation methods that can effectively assess the completeness, reliability and relevance of scientific papers is a pressing topic within the research community. While scientists commonly employ Impact Factor and H-index as the standard evaluative tools, recent research has revealed limitations to these metrics, including the lack of assessment at journal or…
Read More“An interconnected landscape”: how SciWalker collates information to support chemical innovation.
The shift towards sustainable chemistry has highlighted the relevance of ionic liquids (ILs) as an area of interest. These non-flammable, low-volatility solvents allow precise property optimization to enhance reactions and enable recycling while reducing waste. However, identifying the optimal ion combination involves consolidating knowledge fragments from across multiple documents and fields. This is where a…
Read MoreOC Processor 101: streamlining knowledge management
In 2006, mathematician and data scientist Clive Humby gave a talk at a conference where he reportedly used the phrase “data is the new oil.” Today, over a decade later, the statement holds its ground and is, in fact, more valid than it ever was. Stretching the metaphor of comparing raw data to raw oil…
Read MoreOntoChem extracts U.S. Food and Drug Administration SPL files
The “Structured Product Labeling” (SPL) files of the United States FDA are a valuable public resource for drugs on the market. Thus, UNII (Unique Ingredient Identifier) numbers are assigned to each drug and its chemical structure information by the FDA registration system. UNII numbers are also used in several databases such as drug labels in…
Read MoreProcessing tables in documents and images
Probably most of the scientific information is captured in tables – for example in US patents from 2001-2017 we have extracted more than 10 million tables containing interesting properties on materials and compounds. At OntoChem we have developed several technologies to extract this knowledge over the last 5 years. These software modules may read different…
Read MoreSemantic homonym resolution – key to reduce the number of false positive search hits
Many words can have different meanings – also known as “homonyms”. Homonymic terms are often the cause for false positive search hits. How do we use semantic indexing to find what you intended to? Homonyms in different knowledge areas: Just take the term “sting” – it could mean a protein named Sting (stimulator of interferon…
Read MoreProcessing images to chemical structures
A lot of scientific information is captured in images – we are using machine learning techniques such as deep neural networks to classify images. For example, we have applied transfer learning to train a deep convolutional neural network for developing a ML classifier that detects if an image contains a chemical structure. If so, this…
Read MoreSODIAC free for academic use
The chemistry enabled ontology editor software SODIAC has been released for free academic use. SODIAC may be used to create general ontologies or chemical ontologies and to perform advanced ontology manipulations for very large ontologies with multimillion concepts in the OBO format. For annotating chemistry classes to chemical compounds a ChemAxon JChem Base license is…
Read More