An optimal algorithm for maximum-sum segment and its application in bioinformatics. Extended abstract

Tsai Hung Fan, Shufen Lee, Hsueh I. Lu, Tsung Shan Tsou, Tsai Cheng Wang, Adam Yao

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

25 Scopus citations

Abstract

We study a fundamental sequence algorithm arising from bioinformatics. Given two integers L and U and a sequence A of n numbers, the maximum-sum segment problem is to find a segment A[i,j] of A with L ≤ j-i+1 ≤ U that maximizes A[i]+A[i+1]+⋯+A[j]. The problem finds applications in finding repeats, designing low complexity filter, and locating segments with rich C+G content for biomolecular sequences. The best known algorithm, due to Lin, Jiang, and Chao, runs in O(n) time, based upon a clever technique called left-negative decomposition for A. In the present paper, we present a new O(n)-time algorithm that bypasses the left-negative decomposition. As a result, our algorithm has the capability to handle the input sequence in an online manner, which is clearly an important feature to cope with genome-scale sequences. We also show how to exploit the sparsity in the input sequence: If A is representable in O(k) space in some format, then our algorithm runs in O(k) time. Moreover, practical implementation of our algorithm running on the rice genome helps us to identify a very long repeat structure in rice chromosome 1 that is previously unknown.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsOscar H. Ibarra, Zhe Dang
PublisherSpringer Verlag
Pages251-257
Number of pages7
ISBN (Print)3540405615
DOIs
StatePublished - 2003

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2759
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'An optimal algorithm for maximum-sum segment and its application in bioinformatics. Extended abstract'. Together they form a unique fingerprint.

Cite this