To search, Click below search items.


All Published Papers Search Service


Identification of Protein Coding Regions in Genomic DNA Using Unsupervised FMACA Based Pattern Classifier


P. Kiran Sree, Inampudi Ramesh Babu


Vol. 8  No. 1  pp. 305-309


Genes carry the instructions for making proteins that are found in a cell as a specific sequence of nucleotides that are found in DNA molecules. But, the regions of these genes that code for proteins may occupy only a small region of the sequence. Identifying the coding regions play a vital role in understanding these genes. In this paper we propose a unsupervised Fuzzy Multiple Attractor Cellular Automata (FMCA) based pattern classifier to identify the coding region of a DNA sequence. We propose a distinct K-Means algorithm for designing FMACA classifier which is simple, efficient and produces more accurate classifier than that has previously been obtained for a range of different sequence lengths. Experimental results confirm the scalability of the proposed Unsupervised FCA based classifier to handle large volume of datasets irrespective of the number of classes, tuples and attributes. Good classification accuracy has been established.


Cellular Automata (CA), Unsupervised learning Classifier, Genetic Algorithm (GA), Machine learning, Distinct K-Means Algorithm, Coding Regions, Fuzzy Multiple Attractor Cellular Automata (FMACA), Pattern Classifier