Predicting regulatory elements in repetitive sequences using transcription factor binding sites

Jorng Tzong Horng, Wen Fu Cho, Cheng Yan Kao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Repeat sequences are the most abundant ones in the extragenic region of genomes. Biologists have already found a large number of regulatory elements in this region. These elements may profoundly impact the chromatin structure formation in nucleus and also contain important clues in gemetic evolution and phylogeny study. This study attempts to mine rules on how combinations of individual binding sites are distributed repeat sequences. The association rules mined would facilitate efforts to identify gene classes regulated by similar mechanisms and accurately predict regulatory elements. Herein, the combinations of transcription factor binding sites in the repeat sequences are obtained and, then, data mining techniques are applied to mine the association rules from the combination of binding sites. In addition, the discovered associations are further proned to remove those insignificant associations and obtain a set of discovered associations. Finally, the discovered association rules are used to partially classify the repeat sequences in our repeat database. Experiments on several genomes include C.Elegans, Human Chromosome 22, and Yeast.

Original languageEnglish
Title of host publicationProceedings of the Atlantic Symposium on Computational Biology and Genome Information Systems and Technology, CBGIST 2001
EditorsC.H. Wu, P.P. Wang, J.T.L. Wang
Pages215-219
Number of pages5
StatePublished - 2001
EventProceedings of the Atlantic Symposium on Computational Biology and Genome Information Systems and Technology, GBGIST 2001 - Durham, NC, United States
Duration: 15 Mar 200117 Mar 2001

Publication series

NameProceedings of the Atlantic Symposium on Computational Biology and Genome Information Systems and Technolgoy, CBGIST 2001

Conference

ConferenceProceedings of the Atlantic Symposium on Computational Biology and Genome Information Systems and Technology, GBGIST 2001
Country/TerritoryUnited States
CityDurham, NC
Period15/03/0117/03/01

Fingerprint

Dive into the research topics of 'Predicting regulatory elements in repetitive sequences using transcription factor binding sites'. Together they form a unique fingerprint.

Cite this