Adventures in Computing: State of the Semantic Web in the Clinical Domain

In a previous post entitled Why Do We Need Ontologies in Healthcare Applications, I explained the important difference between ontologies, coding systems, and information models of data structures. I also outlined the benefits of using Semantic Web technologies like RDF, RDFS, OWL, SWRL, R2RML, SPARQL, SKOS, and Linked Open Data (LOD). These benefits include:

Reasoning and inferencing which are essential characteristics of intelligent Health IT Systems (iHIT)
Model consistency checking
Open World Assumption (OWA) and Non-Unique Naming Assumption enabling the integration of heterogeneous data sources and knowledge bases using Linked Open Data (LOD) principles. This integration can be accomplished by providing an RDF view over existing relational databases using R2RML (RDB to RDF Mapping Language) and by performing SPARQL federated queries. Intelligent queries can retrieve inferred facts using SPARQL 1.1 Entailment Regimes.
Linking to other biomedical ontologies like SNOMED and the Translational Medicine Ontology
Clinical Knowledge Management (CKM) using OWL to model and execute Clinical Practice Guidelines (CPGs) and Care Pathways (CPs).

Semantic Web in Clinical and Translational Research

The following are papers on how Semantic Web technologies are being used to realize these benefits in the healthcare domain:

A semantic-web oriented representation of the clinical element model for secondary use of electronic health records data in the Journal of the American Medical Informatics Association (JAMIA)
Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank in the Journal of Biomedical Semantics
Ontology-based Modeling of Clinical Practice Guidelines: A Clinical Decision Support System for Breast Cancer Follow-up Interventions at Primary Care Settings

Apache Stanbol

I recently came across Apache Stanbol, a new Apache project which is described as "a set of reusable components for semantic content management". What I really like about Apache Stanbol is that it not only works on unstructured data sources, but also integrates a number of other popular Apache open source software which can be used to add a semantic layer to modern RESTful content-oriented applications. These components include:

Apache Tika for text and metadata extraction from a variety of commonly used document formats
Apache OpenNLP for natural language processing and named entity recognition (NER)
Apache Solr for document store and semantic search
Apache Jena as the RDF and Semantic Web framework.

Other open source components like Apache Mahout (a scalable Machine Learning library) can be integrated to provide document recommendation and clustering services.

The Content Enhancers in Stanbol can perform named entity recognition (NER) and link text annotations to external datasets such as DBPedia. In the clinical domain, these enhancers can be used to extract entities from medical records, journal articles, and clinical guidelines. These entities can then be linked to other clinical data sources such as drug and disease databases using Linked Data techniques.

Apache Stanbol also provides Reasoners based on Jena RDFS, OWL, and OWLMini Reasoners as well as the HermiT OWL Reasoner. These reasoners can perform consistency checking and classification. Stanbol supports Inference Rules in the following formats: SWRL, Jena Rules, and SPARQL (by converting Stanbol Rules into SPARQL CONSTRUCTs).

Adventures in Computing

Sunday, February 24, 2013

State of the Semantic Web in the Clinical Domain

Semantic Web in Clinical and Translational Research

Apache Stanbol

No comments:

License

About Me

Disclaimer

Blog Archive

Adventures in Computing

Sunday, February 24, 2013

State of the Semantic Web in the Clinical Domain

Semantic Web in Clinical and Translational Research

Apache Stanbol

No comments:

License

About Me

Disclaimer

Subscribe To

Blog Archive