A novel ontology and machine learning driven hybrid cardiovascular clinical prognosis as a complex adaptive clinical system

Farooq, Kamran; Hussain, Amir

doi:10.1186/s40294-016-0023-x

Research
Open access
Published: 12 July 2016

A novel ontology and machine learning driven hybrid cardiovascular clinical prognosis as a complex adaptive clinical system

Complex Adaptive Systems Modeling volume 4, Article number: 12 (2016) Cite this article

3489 Accesses
12 Citations
3 Altmetric
Metrics details

Abstract

Purpose

This multidisciplinary industrial research project sets out to develop a hybrid clinical decision support mechanism (inspired by ontology and machine learning driven techniques) by combining evidence, extrapolated through legacy patient data to facilitate cardiovascular preventative care.

Methods

The proposed cardiovascular clinical decision support framework comprises of two novel key components: (1) Ontology driven clinical risk assessment and recommendation system (ODCRARS) (2) Machine learning driven prognostic system (MLDPS). State of the art machine learning and feature selection methods are utilised for the prognostic modelling purposes. The ODCRARS is a knowledge-based system which is based on clinical expert’s knowledge, encoded in the form of clinical rules engine to carry out cardiac risk assessment for various cardiovascular diseases. The MLDPS is a non knowledge-based/data driven system which is developed using state of the art machine learning and feature selection techniques applied on real patient datasets. Clinical case studies in the RACPC, heart disease and breast cancer domains are considered for the development and clinical validation purposes. For the purpose of this paper, clinical case study in the RACPC/chest pain domain will be discussed in detail from the development and validation perspective.

Results

The proposed clinical decision support framework is validated through clinical case studies in the cardiovascular domain. This paper demonstrates an effective cardiovascular decision support mechanism for handling inaccuracies in the clinical risk assessment of chest pain patients and help clinicians effectively distinguish acute angina/cardiac chest pain patients from those with other causes of chest pain.

Conclusion

The new clinical models, having been evaluated in clinical practice, resulted in very good predictive power, demonstrating general performance improvement over benchmark multivariate statistical classifiers. Various chest pain risk assessment prototypes have been developed and deployed online for further clinical trials.

Introduction

The adoption of clinical decision support systems (CDSSs) in the diagnosis and administration of major chronic diseases e.g. (Dementia Lindgren 2011), cancer, diabetes (OConnor et al. 2011), hypertension (Luitjes et al. 2010) and heart disease (DeBusk et al. 2010) have made significant contributions in improving the clinical outcomes at primary and secondary care healthcare organisations all over the world. CDSS have also made it possible for system developers and knowledge engineers to collate and construct domain expert knowledge for the purpose of clinical risk assessment and screening by clinicians (Khong and Ren 2011).

Clinical decision support systems are being extensively deployed in healthcare settings all over the world. Modern clinical decision support systems are increasingly dissimilar to each other, despite following the same generic architecture which defines a typical CDSS (Burstein et al. 2011). These clinical decision support systems incorporate a variety of innovative techniques to perform various key operations which include clinical knowledge dissemination and collecting patient’s medical history for effective clinical decision making. These systems aim to provide clinical decision support and automatic personalised clinical advice through inference capabilities (Mohiuddin 2011). They also help to streamline clinical workflows through integration with electronic healthcare records for patient clinical history collection, diagnosis, inference and training.

Clinical decision support operations are an integral part of modern healthcare management systems. They assist clinicians, patients and healthcare stakeholders by providing expert clinical knowledge and patient-centric information (Classen et al. 2011). The information provided by these intelligent clinical systems is used for clinical decision making in order to improve the effectiveness and quality of healthcare. Automated cardiovascular decision support systems are now being deployed in hospitals and primary care organizations in order to meet the ever growing clinical needs of prognosis in the areas of cardiovascular disease and coronary heart disease. Computerized decision support strategies have already been implemented successfully in several areas of cardiovascular care (Kuperman et al. 2007). These applications are being used as part of the extension of clinical informatics infrastructure in the UK and US. These systems are also being used in both primary and secondary care settings for providing efficient healthcare delivery to its patients. In order to capitalise on the benefits provided by cardiovascular decision support systems, a strong foundation in evidence-based medicine and well-established clinical practice guidelines (CPGs) have to be considered to ensure clinical governance in the next generation clinical systems.

Background

Ontology driven clinical decision support frameworks

An ontology is an explicit specification of a conceptualization. The term is borrowed from philosophy, where an ontology is a systematic account of existence. For AI systems, what “exists” is that which can be represented. When the knowledge of a domain is represented in a declarative formalism, the set of objects that can be represented is called the universe of discourse. This set of objects, and the describable relationships among them, are reflected in the representational vocabulary with which a knowledge-based program represents knowledge. Thus, in the context of AI, we can describe the ontology of a program by defining a set of representational terms. In such an ontology, definitions associate the names of entities in the universe of discourse (e.g., classes, relations, functions, or other objects) with human-readable text describing what the names mean, and formal axioms that constrain the interpretation and well-formed use of these terms. Formally, an ontology is the statement of a logical theory (Gruber 1993). Ontologies are often equated with taxonomic hierarchies of classes, but class definitions, and the subsumption relation, but ontologies need not be limited to these forms. Ontologies are also not limited to conservative definitions, that is, definitions in the traditional logic sense that only introduce terminology and do not add any knowledge about the world (Herbert and Enderton 1972).

The Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT) is an onto-logical resource specifically developed some thirty years ago with a view to standardize healthcare systems. SNOMED CT and with UMLS are clinical thesauruses, aiming to resolve documentation standardization issues in clinical systems. These are large scale medical taxonomies which have been exploited in modern clinical systems showing significant good results in the targeted clinical systems. In Mortensen et al. (2014) it shows that the clinicians using healthcare systems equipped with SNOMED outperformed clinicians using conventional systems without SNOMED CT capabilities.

Machine learning driven cardiovascular decision support systems

Machine learning refers to a type of artificial intelligence algorithm designed to identify patterns in input data, such as patient characteristics, in order to perform complex classification tasks. Machine learning based clinical decision support systems can avoid the bottleneck of knowledge acquisition because knowledge is directly learned through the clinical data. In addition, ML-based clinical decision support systems are able to give recommendations that are generated by non-linear forms of knowledge, and are easily maintainable by simply adding new cases (Chi 2009).

In Nahar et al. (2013), a number of computational intelligence techniques were utilised in the detection of heart disease as a preventative measure. A comparative analysis of six well-known machine learning classifiers was carried out using the Cleveland heart disease dataset. Authors introduced medical knowledge driven feature selection (MFS) and it was compared against the state of the art feature selection algorithms. Their experimental results showed that machine learning classification combined with MFS significantly improved the performance of binary classification. MFS feature selection technique was combined with computerised feature selection process to further refine classification accuracies obtained in previous iterations. MFS combined with Naive Bayes and Sequential minimal optimisation (SMO for training of support vector machine) provided the best classification accuracies and TP (true positive) and F-measure resulted in a higher performance as compare to experimental setups based on state of the art feature selection techniques combined with machine learning classifiers.

We proposed an ontology and machine learning driven hybrid clinical decision support framework for cardiovascular preventative care as shown in Fig. 1. The development of the machine learning driven prognostic system (MLDPS) was carried out in close collaboration with clinical experts. The rapid access chest pain clinic’s case study was identified by the consultant cardiologist from Raigmore Hospital in Inverness, UK. The key objective of the RACPC clinical case study was to help improve the diagnostic and performance capabilities of the RACPC. The heart disease clinical case study was carried out in collaboration with general medical practitioners from UK in order to develop a preventative care mechanism for patients who are at risk of developing heart disease.

The ODCRARS is a knowledge-based system which is based on clinical expert’s knowledge, encoded in the form of clinical rules (utilised by the clinical rules engine) to carry out cardiac risk assessment for various cardiovascular diseases. The MLDPS is a non knowledge-based/data driven prognostic system which is developed by applying machine learning and feature selection techniques on legacy patient datasets. This approach eliminates the need for writing clinical rules thereby reducing dependency on clinical experts to encode their advice in the clinical decision making. Non-knowledge based clinical decision support systems are utilised in providing point-of-care clinical decision making and implementation of such solutions facilitate development of cost effective solutions with improvement in the quality of care provided.

The rest of this paper will be in sections: In “Background” section, we provide a detailed description of the novel machine learning driven prognostic system based on the chest pain clinical case study and the complete development life cycle followed by validation results. At the end we conclude our findings and provide future directions of our research.

Methods

MLDPS development based on rapid access chest pain clinic’s clinical case study

An iterative development process, based on machine learning and feature selection has been utilised in the development of machine learning driven prognostic models. The MLDPS’s development process is general enough to handle a variety of healthcare datasets which will enable researchers to develop cost effective and evidence based clinical decision support systems. For the purpose of this paper, development and validation of the MLDPS based on the chest pain clinical case study will be discussed in detail. The key stages of the prognostic model development process are shown in Fig. 2. The general description of each stage is as follows:

Results and discussion

The consultant cardiologist from Raigmore Hospital specified a revised clinical requirement to break original patient dataset down into clinical risk factors and lab test results and create two new study groups. The key clinical objective of introducing this demarcation amongst clinical risk factors and lab results was to evaluate the impact of classification results using these two new datasets. So two new study cohorts were created for this purpose as shown in Table 1, so that a comparison could be drawn among two study groups. Another clinical requirement was to compare the clinical effectiveness of two models separately and to classify chest pain patients (predicting risk of cardiac or non cardiac chest pain) purely on the basis of the risk factors and test results information independently.

For the comparative analysis, the original patient dataset was distributed into two study sets as follows:

Table 1 Clinical risk factors and test results in two study groups

A novel ontology and machine learning driven hybrid cardiovascular clinical prognosis as a complex adaptive clinical system

Abstract

Purpose

Methods

Results

Conclusion

Introduction

Background

Ontology driven clinical decision support frameworks

Machine learning driven cardiovascular decision support systems

Methods

MLDPS development based on rapid access chest pain clinic’s clinical case study

Results and discussion

Study group 1: clinical risk factors

Evaluation

Performance evaluation of experimental setups

Study group 2: lab test results

Evaluation

Performance evaluation of experimental setups

Implementation of online clinical prognostic models

Validation of the machine learning driven system (MLDPS) and ontology driven clinical risk assessment and recommendation system (ODCRARS)

Conclusions

References

Authors’ contributions

Acknowledgements

Competing interests

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords