A data mining method to predict transcriptional regulatory sites based on differentially expressed genes in human genome

Hsien Da Huang, Huei Lin Chang, Tsung Shan Tsou, Baw Jhiune Liu, Jorng Tzong Horng

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Very large-scale gene expression analysis, i.e., UniGene and dbEST, is provided to find those genes with significantly differential expression in specific tissues. The differentially expressed genes in a specific tissue are potentially regulated concurrently by a combination of transcription factors. This study attempts to mine putative binding sites on how combinations of the known regulatory sites homologs and over-represented repetitive elements are distributed in the promoter regions of considered groups of differentially expressed genes. We propose a data mining approach to statistically discover the significantly tissue-specific combinations of known site homologs and over-represented repetitive sequences, which are distributed in the promoter regions of differentially gene groups. The association rules mined would facilitate to predict putative regulatory elements and identify genes potentially co-regulated by the putative regulatory elements.

Original languageEnglish
Pages (from-to)923-942
Number of pages20
JournalJournal of Information Science and Engineering
Volume19
Issue number6
StatePublished - Nov 2003

Keywords

  • Data mining
  • EST
  • Gene expression
  • Regulatory site
  • Transcription factor
  • UniGene

Fingerprint

Dive into the research topics of 'A data mining method to predict transcriptional regulatory sites based on differentially expressed genes in human genome'. Together they form a unique fingerprint.

Cite this