Identification of complex clinical phenotypes among critically ill patients is a major challenge in clinical research. The overall research goal of our work is to develop automated approaches that accurately identify critical illness phenotypes to prevent the resource intensive manual abstraction approach. In this paper, we describe a text processing method that uses Natural Language Processing (NLP) and supervised text classification methods to identify patients who are positive for Acute Lung Injury (ALI) based on the information available in free-text chest x-ray reports. To increase the classification performance we enhanced the baseline unigram representation with bigram and trigram features, enriched the n-gram features with assertion analysis, and applied statistical feature selection. We used 10-fold cross validation for evaluation and our best performing classifier achieved 81.70% precision (positive predictive value), 75.59% recall (sensitivity), 78.53% f-score, 74.61% negative predictive value, 76.80% specificity in identifying patients with ALI.
Yumiko Imai, Keiji Kuba, G. Gregory Neely, Rubina Yaghubian-Malhami, Thomas Perkmann, Geert Loo, Maria Ermolaeva, Ruud A. W. Veldhuizen, Connie Y. H. Leung, Hongliang Wang, Haolin Liu, Yang Sun, Manolis Pasparakis, Manfred Köpf, Christin Mech, Sina Bavari, Malik Peiris, Arthur S. Slutsky, Akira Shizuo, Malin Hultqvist, Rikard Holmdahl, John Nicholls, Chengyu Jiang, Christoph J. Binder, Josef Penninger
Christine U. Vohwinkel, Nana Burns, E. Coit, Xiaoyi Yuan, Eszter K. Vladar, Christina Sul, Eric P. Schmidt, Peter Carmeliet, Kurt R. Stenmark, Eva Nozik‐Grayck, Rubin M. Tuder, Holger K. Eltzschig
Lauralyn McIntyre, David Moher, Dean Fergusson, Katrina Sullivan, Shirley H. J. Mei, Manoj M. Lalu, John C. Marshall, Malcolm D. McLeod, Gilly Griffin, Jeremy Grimshaw, Alexis F. Turgeon, Marc T. Avey, Michael A. Rudnicki, M R Hadian Jazi, Jason Fishman, Duncan J. Stewart
Discussion(0)
No comments yet. Be the first to comment.