An optimal algorithm for maximum-sum segment and its application in bioinformatics extended abstract

Tsai Hung Fan, Shufen Lee, Hsueh I. Lu, Tsung Shan Tsou, Tsai Cheng Wang, Adam Yao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We study a fundamental sequence algorithm arising from bioinformatics. Given two integers L and U and a sequence A of n numbers, the maximum-sum segment problem is to find a segment A[i, j] of A with L ≤ j−i+1 ≤ U that maximizes A[i]+A[i+1]+• • •+A[j]. The problem finds applications in finding repeats, designing low complexity filter, and locating segments with rich C+G content for biomolecular sequences. The best known algorithm, due to Lin, Jiang, and Chao, runs in O(n) time, based upon a clever technique called left-negative decomposition for A. In the present paper, we present a new O(n)-time algorithm that bypasses the left-negative decomposition. As a result, our algorithm has the capability to handle the input sequence in an online manner, which is clearly an important feature to cope with genome-scale sequences. We also show how to exploit the sparsity in the input sequence: If A is representable in O(k) space in some format, then our algorithm runs in O(k) time. Moreover, practical implementation of our algorithm running on the rice genome helps us to identify a very long repeat structure in rice chromosome 1 that is previously unknown.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsOscar H. Ibarra, Zhe Dang
PublisherSpringer Verlag
Pages239-250
Number of pages12
ISBN (Print)3540405615
StatePublished - 2003
Event8th International Conference on Implementation and Application of Automata, CIAA 2003 - Santa Barbara, United States
Duration: 16 Jul 200318 Jul 2003

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2759
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th International Conference on Implementation and Application of Automata, CIAA 2003
Country/TerritoryUnited States
CitySanta Barbara
Period16/07/0318/07/03

Fingerprint

Dive into the research topics of 'An optimal algorithm for maximum-sum segment and its application in bioinformatics extended abstract'. Together they form a unique fingerprint.

Cite this