Partial words and the critical factorization theorem

Francine Blanchet-sadri

Partial words and the critical factorization theorem

Francine Blanchet-sadri

2005, Journal of Combinatorial Theory, Series A

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

The study of combinatorics on words, or finite sequences of symbols from a finite alphabet, finds applications in several areas of biology, computer science, mathematics, and physics. Molecular biology, in particular, has stimulated considerable interest in the study of combinatorics on partial words that are sequences that may have a number of "do not know" symbols also called "holes". This paper is devoted to a fundamental result on periods of words, the Critical Factorization Theorem, which states that the period of a word is always locally detectable in at least one position of the word resulting in a corresponding critical factorization. Here, we describe precisely the class of partial words w with one hole for which the weak period is locally detectable in at least one position of w. Our proof provides an algorithm which computes a critical factorization when one exists. A World Wide Web server interface at http://www.uncg.edu/mat/cft/ has been established for automated use of the program. We thank Ajay Chriscoe for very valuable comments and suggestions, and for implementing Algorithm 2 and creating a World Wide Web site for this research. We also thank the referee of a preliminary version of this paper for his/her very valuable comments and suggestions. 1 sequences that may have a number of "do not know" symbols. Such sequences are referred to as partial words and appear, for instance, when genes or proteins are compared. Another area of current interest for the study of the combinatorics on partial words is data communication where some information may be missing, lost, or unknown. While a word can be described by a total function, a partial word can be described by a partial function.

Figures (3)

uemima 4 Let u,v € At and let z € W,(A). If uz T zu, then one of the following holds:

Francine Blanchet-sadri

Theoretical Computer Science, 2004

The study of the combinatorial properties of strings of symbols from a finite alphabet (also referred to as words) is profoundly connected to numerous fields such as biology, computer science, mathematics, and physics. Research in combinatorics on words goes back roughly a century. There is a renewed interest in combinatorics on words as a result of emerging new application areas such as molecular biology. Partial words were recently introduced in this context. The motivation behind the notion of a partial word is the comparison of genes (or proteins). Alignment of two genes (or two proteins) can be viewed as a construction of partial words that are said to be compatible. While a word can be described by a total function, a partial word can be described by a partial function. More precisely, a partial word of length n over a finite alphabet A is a partial function from {1, . . . , n} into A. Elements of {1, . . . , n} without an image are called holes. A word is just a partial word without holes. The notion of period of a word is central in combinatorics on words. In the case of partial words, there are two notions: one is that of period, the other is that of local period. This paper extends to partial words with one hole the well known result of Guibas and Odlyzko which states that for every word u, there exists a word v of same length as u over the alphabet {0, 1} such that the set of all periods of u coincides with the set of all periods of v. Our result states that for every partial word u with one hole, there exists a partial word v of same length as u with at most one hole over the alphabet {0, 1} such that the set of all periods of u coincides with the set of all periods of v and the set of all local periods of u coincides with the set of all local periods of v. To prove our result, we use the technique of Halava, Harju and Ilie which they used * This material is based upon work supported by the National Science Foundation under Grants CCR-9700228 and CCR-0207673. A Research Assignment from the University of North Carolina at Greensboro is gratefully acknowledged. I thank Phuongchi Thi Le for very valuable comments and suggestions. She received a research assistantship from the University of North Carolina at Greensboro to work with me on this project.

Log In

Partial words and the critical factorization theorem

Sign up for access to the world's latest research

Abstract

Related papers

Related topics