Divide and Conquer approach for Genome Classification based on subclass characterization
Divide and Conquer approach for Genome Classification based on subclass characterization
Abstract Classification of large grass genome sequences has major challenges in functional genomes. The presence of motifs in grass genome chains can make the prediction of the functional behavior of grass genome possible. The correlation between grass genome properties and their motifs is not always obvious, since more than one motif may exist within a genome chain. Due to the complexity of this association most pattern classification algorithms are either vain or time consuming. Attempted to a reduction of high dimensional data that utilizes DAC technique is presented. Data are disjoining into equal multiple sets while preserving the original data distribution in each set. Then, multiple modules are created by using the data sets as independent training sets and classified into respective modules. Finally, the modules are combined to produce the final classification rules, containing all the previously extracted information. The methodology is tested using various grass genome data sets. Results indicate that the time efficiency of our algorithm is improved compared to other known data mining algorithms.
Murty M.N.、Patil S.S.、Angadi U.B.
Dept. of Computer Science and Automation, Indian Institute of ScienceDept. of Computer Science, University of Agricultural ScienceCABin, ISRI
生物科学研究方法、生物科学研究技术计算技术、计算机技术分子生物学
Divide and Conquer (DAC)Class within Class (CWC)
Murty M.N.,Patil S.S.,Angadi U.B..Divide and Conquer approach for Genome Classification based on subclass characterization[EB/OL].(2025-03-28)[2025-05-11].https://www.biorxiv.org/content/10.1101/003475.点此复制
评论