Clinical Studies And The Road To Linked Data
Clinical studies aren’t what they used to be. In the past, the process was one-off: You conducted a study, gathered a lot of data, analyzed it, wrote a report, and submitted it to the authorities. But, says long-time Linked Data advocate Kerstin Forsberg, an information architect at AstraZeneca, that’s all changed in the last few years.
“A study is not a study on its own,” says Forsberg. Today, the goal is to do meta-analysis across many studies, so parties ranging from pharmaceuticals companies to contract research organizations to government authorities all are ‘customers’ of clinical data, so to speak. Data from various studies must be shared among all these parties. “It puts a new context around clinical trial data, that it must be easy to link data together, to link across several different studies,” she says.
The case is there to use modern information standards, like semantic web standards and Linked Data principles, to address this need. It’s why Forsberg is one of the individuals spearheading a volunteer effort to create RDF and OWL representations of the standards published by the Clinical Data Interchange Standards Consortium (CDISC) an international, non-profit organization that develops and supports global data standards for medical research.



The technology, dubbed the Insight Discovery platform, is explained to be the “first machine learning platform that combines computer science and a branch of mathematics known as Topological Data Analysis (TDA) that visualizes the entire dataset.” Hundreds of machine learning algorithms, it says, go to work exploring datasets to in minutes automatically discover insights that can’t be determined through query-based or ad hoc approaches.
Metaome, which was founded by CEO Kalpana Krishnaswami and CTO Ramkumar Nandakumar as a bioinformatics services provider before transitioning to a product vendor, contains a few more than a dozen life sciences public data sets so far. Infomaticians in the life sciences space have the expertise to query such data across sets via SPARQL, but the front-line biologist isn’t necessarily an infomatician. So, DistilBio has created a query interface that makes it easier for them to ask large and complex questions in a simplified way across data sets while building a graph in the process.
A new paper has been published
“In particular, a lot of the strengths of Knowledge Explorer have to do with modeling data as RDF and then testing queries, visualizing and browsing the data to see that you have the ontologies and data mappings you need for your integration and application requirements.” says Robert Stanley, IO Informatics president and CEO. The Personal version is aimed at academic experts focused on data integration and semantic data modeling, as well as personal power users in life sciences and other data-intensive industries, or anyone who wants to learn the tool in anticipation of leveraging their enterprise data sets for collaboration and integration projects.


Eric Franzon
VP Community
Jennifer Zaino
Contributor
Angela Guess Contributor
semanticweb.com Twitter feed loading...