developing jsequitur to study the hierarchical structure of biological sequences in a grammatical inference framework of string compression algorithms

Clicks: 170
ID: 151494
2012
Grammatical inference methods are expected to find grammatical structures hidden in biological sequences. One hopes that studies of grammar serve as an appropriate tool for theory formation. Thus, we have developed JSequitur for automatically generating the grammatical structure of biological sequences in an inference framework of string compression algorithms. Our original motivation was to find any grammatical traits of several cancer genes that can be detected by string compression algorithms. Through this research, we could not find any meaningful unique traits of the cancer genes yet, but we could observe some interesting traits in regards to the relationship among gene length, similarity of sequences, the patterns of the generated grammar, and compression rate.
Reference Key
galbadrakh2012genomicsdeveloping Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors ;Bulgan Galbadrakh;Kyung-Eun Lee;Hyun-Seok Park
Journal Journal of environmental management
Year 2012
DOI 10.5808/GI.2012.10.4.266
URL
Keywords

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

No comments yet. Be the first to comment on this article.