A Study of Deep Learning Methods for De-identification of Clinical Notes at Cross Institute Settings.

Yang; Xi;Lyu; Tianchen;Lee; Chih-Yin;Bian; Jiang;Hogan; William R;Wu; Yonghui;

doi:10.1109/ICHI.2019.8904544

A Study of Deep Learning Methods for De-identification of Clinical Notes at Cross Institute Settings.

Clicks: 310

ID: 85158

2019

Article Quality & Performance Metrics

Overall Quality Improving Quality

0.0 /100

Combines engagement data with AI-assessed academic quality

Reader Engagement Star Article

78.3 /100

269 views

224 readers

AI Quality Assessment

Not analyzed

Abstract

EN
- Turkish
- Spanish
- Portuguese
- Arabic
- Chinese
- French
- German
- Indonesian
- Russian
- Thai

In this study, we examined a deep learning method for de-identification of clinical notes at UF Health under a cross-institute setting. We developed deep learning models using 2014 i2b2/UTHealth corpus and evaluated the performance using clinical notes collected from UF Health. We compared four pre-trained word embeddings, including two embeddings from the general domain and two embeddings from the clinical domain. We also explored linguistic features (i.e., word shape and part-of-speech) to further improve the performance of de-identification. The experimental results show that the performance of deep learning models trained using i2b2/UTHealth corpus significantly dropped (strict and relax F1 scores dropped from 0.9547 and 0.9646 to 0.8360 and 0.8870) when applied to another corpus from a different institution (UF Health). Linguistic features, including word shapes and part-of-speech, could further improve the performance of de-identification in cross-institute settings (improved to 0.8527 and 0.9052).

Reference Key	yang2019aieee Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors	Yang, Xi;Lyu, Tianchen;Lee, Chih-Yin;Bian, Jiang;Hogan, William R;Wu, Yonghui;
Journal	ieee international conference on healthcare informatics ieee international conference on healthcare informatics
Year	2019
DOI	10.1109/ICHI.2019.8904544 Searching for DOI...
URL	https://doi.org/10.1109/ICHI.2019.8904544
Keywords	Deep learning natural language processing de-identification

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

Comments

Login to comment Register

No comments yet. Be the first to comment on this article.