Phone-aware Neural Language Identification

Clicks: 17
ID: 282563
2017
Pure acoustic neural models, particularly the LSTM-RNN model, have shown great potential in language identification (LID). However, the phonetic information has been largely overlooked by most of existing neural LID models, although this information has been used in the conventional phonetic LID systems with a great success. We present a phone-aware neural LID architecture, which is a deep LSTM-RNN LID system but accepts output from an RNN-based ASR system. By utilizing the phonetic knowledge, the LID performance can be significantly improved. Interestingly, even if the test language is not involved in the ASR training, the phonetic knowledge still presents a large contribution. Our experiments conducted on four languages within the Babel corpus demonstrated that the phone-aware approach is highly effective.
Reference Key
li2017phoneaware Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors Zhiyuan Tang; Dong Wang; Yixiang Chen; Ying Shi; Lantian Li
Journal arXiv
Year 2017
DOI DOI not found
URL
Keywords

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

No comments yet. Be the first to comment on this article.