Journal of biomedical informatics
-
The domain of healthcare has always been flooded with a huge amount of complex data, coming in at a very fast-pace. A vast amount of data is generated in different sectors of healthcare industry: data from hospitals and healthcare providers, medical insurance, medical equipment, life sciences and medical research. With the advancement in technology, there is a huge potential for utilization of this data for transforming healthcare. ⋯ A systematic mapping study was conducted to identify and analyze research on big data analytics and artificial intelligence in healthcare, in which 2421 articles between 2013 and February 2019 were evaluated. The results of this study will help understand the needs in application of these technologies in healthcare by identifying the areas that require additional research. It will hence provide the researchers and industry experts with a base for future work.
-
Manually curating standardized phenotypic concepts such as Human Phenotype Ontology (HPO) terms from narrative text in electronic health records (EHRs) is time consuming and error prone. Natural language processing (NLP) techniques can facilitate automated phenotype extraction and thus improve the efficiency of curating clinical phenotypes from clinical texts. While individual NLP systems can perform well for a single cohort, an ensemble-based method might shed light on increasing the portability of NLP pipelines across different cohorts. ⋯ Our study demonstrates that ensembles of natural language processing can improve both generic phenotypic concept recognition and patient specific phenotypic concept identification over individual systems. Among the individual NLP systems, each individual system performed best when they were applied in the dataset that they were primary designed for. However, combining multiple NLP systems to create an ensemble can generally improve the performance. Specifically, the ensemble can increase the results reproducibility across different cohorts and tasks, and thus provide a more portable phenotyping solution compared to individual NLP systems.