Experiences implementing scalable, containerized, cloud-based NLP for extracting biobank participant phenotypes at scale.
Clicks: 315
ID: 109591
2020
Article Quality & Performance Metrics
Overall Quality
Improving Quality
0.0
/100
Combines engagement data with AI-assessed academic quality
Reader Engagement
Emerging Content
5.7
/100
19 views
19 readers
Trending
AI Quality Assessment
Not analyzed
Abstract
To develop scalable natural language processing (NLP) infrastructure for processing the free text in electronic health records (EHRs).We extend the open-source Apache cTAKES NLP software with several standard technologies for scalability. We remove processing bottlenecks by monitoring component queue size. We process EHR free text for patients in the PrecisionLink Biobank at Boston Children's Hospital. The extracted concepts are made searchable via a web-based portal.We processed over 1.2 million notes for over 8000 patients, extracting 154 million concepts. Our largest tested configuration processes over 1 million notes per day.The unique information represented by extracted NLP concepts has great potential to provide a more complete picture of patient status.NLP large EHR document collections can be done efficiently, in service of high throughput phenotyping.
| Reference Key |
miller2020experiencesjamia
Use this key to autocite in the manuscript while using
SciMatic Manuscript Manager or Thesis Manager
|
|---|---|
| Authors | Miller, Timothy A;Avillach, Paul;Mandl, Kenneth D; |
| Journal | JAMIA open |
| Year | 2020 |
| DOI |
10.1093/jamiaopen/ooaa016
|
| URL | |
| Keywords |
Citations
No citations found. To add a citation, contact the admin at info@scimatic.org
Comments
No comments yet. Be the first to comment on this article.