conLSH: Context based Locality Sensitive Hashing for mapping of noisy SMRT reads.
Clicks: 260
ID: 91228
2020
Article Quality & Performance Metrics
Overall Quality
Improving Quality
0.0
/100
Combines engagement data with AI-assessed academic quality
Reader Engagement
Emerging Content
6.0
/100
20 views
20 readers
Trending
AI Quality Assessment
Not analyzed
Abstract
Single Molecule Real-Time (SMRT) sequencing is a recent advancement of Next Gen technology developed by Pacific Bio (PacBio). It comes with an explosion of long and noisy reads demanding cutting edge research to get most out of it. To deal with the high error probability of SMRT data, a novel contextual Locality Sensitive Hashing (conLSH) based algorithm is proposed in this article, which can effectively align the noisy SMRT reads to the reference genome. Here, sequences are hashed together based not only on their closeness, but also on similarity of context. The algorithm has O(n) space requirement, where n is the number of sequences in the corpus and ρ is a constant. The indexing time and querying time are bounded by On·lnnln1P and O(n) respectively, where P > 0, is a probability value. This algorithm is particularly useful for retrieving similar sequences, a widely used task in biology. The proposed conLSH based aligner is compared with rHAT, popularly used for aligning SMRT reads, and is found to comprehensively beat it in speed as well as in memory requirements. In particular, it takes approximately 24.2% less processing time, while saving about 70.3% in peak memory requirement for H.sapiens PacBio dataset.
| Reference Key |
chakraborty2020conlshcomputational
Use this key to autocite in the manuscript while using
SciMatic Manuscript Manager or Thesis Manager
|
|---|---|
| Authors | Chakraborty, Angana;Bandyopadhyay, Sanghamitra; |
| Journal | Computational biology and chemistry |
| Year | 2020 |
| DOI |
S1476-9271(19)31149-1
|
| URL | |
| Keywords |
Citations
No citations found. To add a citation, contact the admin at info@scimatic.org
Comments
No comments yet. Be the first to comment on this article.