Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2001, Festschrift in Honor of Gabriel Thierrin
Ciliates (an ancient group of single cell organisms) have two sorts of nuclei with different functionalities: the micronucleus and the macronucleus. After the cell mating the micronuclear genes are converted into the macronuclear genes in the process called gene assembly. This is one of the most complex examples of DNA processing known in any organisms, and it is fascinating from the computational point of view. This paper continues the investigation of gene assembly in the framework of three molecular operations: ld-excision, hi-excision/reinsertion, and dlad-excision/reinsertion. In general, for a given micronuclear gene there exists many strategies for using these three operations to accomplish gene assembly. Since it is not known yet which strategies are actually used by ciliates, it is important to study the invariants of gene assembly, i.e., those properties of gene assembly that are common to all these strategies. A macronuclear gene (before its excision and capping with telomeres) can be assembled either in a linear or in a circular molecule. We prove in this paper that the circularity property (whether or not a given gene will be assembled in a circular molecule) is an invariant. We give a simple decision algorithm for the circularity property, and discuss a number of other related invariants.
Gene assembly in stichotrichous ciliates happening during sexual reproduction is one of the most involved DNA manipulation processes occurring in biology. This biological process is of high interest from the computational and mathematical points of view due to its close analogy with such concepts and notions in theoretical computer science as permutation and linked list sorting and string rewriting. Studies on computational properties of gene assembly in ciliates represent a good example of interdisciplinary research contributing to both computer science and biology. We review here a number of general results related both to the development of different computational methods enhancing our understanding on the nature of gene assembly, as well as to the development of new biologically motivated computational and mathematical models and paradigms. Those paradigms contribute in particular to combinatorics, formal languages and computability theories.
2001
The way that ciliates transform genes from their micronuclear to the macronuclear form is very interesting (and unique), also from a computational point of view. In this paper, we describe the model of gene assembly in ciliates presented in (2), (3), and (4). Moreover, we prove that the set of three operations underlying this model is universal, in the sense
Lecture Notes in Computer Science, 2004
Two models for gene assembly in ciliates have been proposed and investigated in the last few years. The DNA manipulations postulated in the two models are very different: one model is intramolecular -a single DNA molecule is involved here, folding on itself according to various patterns, while the other is intermolecular -two DNA molecules may be involved here, hybridizing with each other. Consequently, the assembly strategies predicted by the two models are completely different. Interestingly however, the final result of the assembly (including the assembled gene) is always the same. We compare in this paper the two models for gene assembly, formalizing both in terms of pointer reductions. We also discuss invariants and universality results for both models.
Mathematical Structures in Computer Science, 2002
Ciliates have d e v eloped a unique nuclear dualism -two n uclei of di erent functionality: the germline micronucleus and the somatic macronucleus. The way that ciliates assemble the macronuclear genes after cell mating constitutes one of the most intricate DNA processings in living organisms. This processing is also very interesting from the computational point of view. In this paper, we i n vestigate the operations of loop excision and hairpin excision/reinsertion used in the assembly process. In particular, we consider three levels of formalization of this process, culminating in graph reduction systems.
The intramolecular model for gene assembly in ciliates considers three operations, ld, hi, and dlad that can assemble any gene pattern through folding and recombination: the molecule is folded so that two occurrences of a pointer (short nucleotide sequence) get aligned and then the sequence is rearranged through recombination of pointers. In general, the sequence rearranged by one operation can be arbitrarily long and consist of many coding and non-coding blocks. We consider in this paper simple variants of the three operations, where only one coding block is rearranged at a time. We characterize in this paper the gene patterns that can be assembled through these variants. Our characterization is in terms of signed permutations and dependency graphs. Interestingly, we show that simple assemblies possess rather involved properties: a gene pattern may have both successful and unsuccessful assemblies and also more than one successful assembling strategy.
Micronuclear genes in stichotrichous ciliates are broken into blocks separated by noncoding sequences, sometimes with the blocks in a shuffled order, some even inverted. During reproduction, all blocks are assembled in the correct order and orientation. This process is possible due to the special structure of micronuclear genes: each coding block M ends with a short nucleotide sequence (called pointer) that is repeated at the beginning of the coding block that should follow M in the assembled gene. Many of the pointers have multiple occurrences along both strands of the gene. This yields a very high number of pointer-induced possible divisions into coding and noncoding blocks.
We study parallel complexity of signed graphs motivated by the highly 8 complex genetic recombination processes in ciliates. The molecular gene 9 assembly operations have been modeled by operations of signed graphs, 10 i.e., graphs where the edges have a sign + or −. In the optimization prob-11 lem for signed graphs one wishes to find the parallel complexity by which 12 the graph can be reduced to the empty graph. We relate parallel com-13 plexity to matchings in graphs for some natural graph classes, especially 14 bipartite graphs. It is shown, for instance, that a bipartite graph G has 15 parallel complexity one if and only if G has a unique perfect matching. 16 We also formulate some open problems of this research topic.
Int'l J. of Communications, Network and …, 2012
Lecture Notes in Computer Science, 2007
The process of gene assembly in ciliates is a fascinating example of programmed DNA manipulations in living cells. Macronuclear genes are split into coding blocks (called MDSs), shuffled and separated by non-coding sequences to form micronuclear genes. Assembling the coding blocks from micronuclear genes to form functional macronuclear genes is facilitated by an impressive in-vivo implementation of the linked list data structure of computer science. Complexity measures for genes may be defined in many ways, including the number of MDSs, the number of loci, etc. We take a different approach in this paper and propose four complexity measures for genes in ciliates, based on the 'effort' required to assemble the gene. We consider: (a) the types of operations used in the assembly, (b) the number of operations used in the assembly, (c) the length of the molecular folds involved, and (d) the length of the shortest possible parallel assembly for that gene.
New Generation Computing, 2002
We define three operations on strings and languages suggested by the process of gene assembly in hypotrichous ciliates. This process is considered to be a prime example of DNA computing in vivo. This paper is devoted to some computational aspects of these operations from a formal language point of view. The closure of the classes of regular and context-free languages under these operations is settled. Then, we consider the ld-macronuclear language of a given language L, which consists of all ldmacronuclear strings obtained from the strings of L by iteratively applying the loop-direct repeat-excision. Finally, we discuss some open problems and further directions of research.
2003
This is the second part of a review of the research on formal modelling of gene assembly in ciliates. In the first part, we provided the basic biological background and introduced molecular operations in the process of gene assembly. In this part, we shall represent these operations within three formal frameworks: MDS descriptors, legal strings, and overlap graphs, corresponding to different abstraction levels which however turn out to be equivalent as far as the operational ability of gene assembly is concerned.
Lecture Notes in Computer Science, 2006
The intramolecular model for gene assembly in ciliates considers three operations, ld, hi, and dlad that can assemble any gene pattern through folding and recombination: the molecule is folded so that two occurrences of a pointer (short nucleotide sequence) get aligned and then the sequence is rearranged through recombination of pointers. In general, the sequence rearranged by one operation can be arbitrarily long and consist of many coding and non-coding blocks. We consider in this paper some simpler variants of the three operations, where only one coding block is rearranged at a time. We characterize in this paper the gene patterns that can be assembled through these variants. Our characterization is in terms of signed permutations and dependency graphs. Interestingly, we show that simple assemblies possess rather involved properties: a gene pattern may have both successful and unsuccessful assemblies and also more than one successful assembling strategy.
Natural Computing, 2008
Gene assembly in ciliates is an impressive computational process. Ciliates have a unique way of storing their genetic information in two fundamentally different forms within their two types of nuclei. Micronuclear genes are broken into blocks (called MDSs), with MDSs shuffled and separated by non-coding material; some of the MDSs may even be inverted. During gene assembly, all MDSs are sorted in the correct order to yield the transcription-able macronuclear gene. Based on the intramolecular model for gene assembly, we prove in this paper that gene assembly may be used in principle to solve computational problems. We prove that any given instance of the hamiltonian path problem may be encoded in a suitable way in the form of an 'artificial' gene so that gene assembly is successful on that gene-like pattern if and only if the given problem has an affirmative answer. ⋆ A. Alhazov ([email protected]) and V. Rogojin are on leave
We survey in this paper the main differences among three variants of an intramolecular model for gene assembly: the general, the simple, and the elementary models. We formalize all of them in terms of sorting signed permutations and compare their behavior with respect to: (i) completeness, (ii) confluence (with the notion defined in three different setups), (iii) decidability, (iv) characterization of the sortable permutations in each model, (v) sequential complexity, and (vi) experimental validation.
Vol 1: Algorithms and Complexity & Vol 2: Formal Models and Semantics, 2004
Natural Computing Series, 2006
The intramolecular model for gene assembly in ciliates considers three operations, ld, hi, and dlad that can assemble any micronuclear gene pattern through folding and recombination: the molecule is folded so that two occurrences of a pointer (short nucleotide sequence) get aligned and then the sequence is rearranged through recombination of pointers. In general, the sequence rearranged by one operation can be arbitrarily long and may consist of many coding and non-coding blocks. We consider in this paper some restricted variants of the three operations, where only one coding block is rearranged at a time. We present in this paper the molecular model of these simple operations. We also introduce a mathematical model for the simple operations, on three levels of abstractions: MDS descriptors, signed permutations, and signed double occurrence strings. Interestingly, we show that simple assemblies possess rather involved properties: a gene pattern may have both successful and unsuccessful assemblies and also more than one successful strategy.
DNA computing, or more generally Molecular computing is an exciting research area at the intersection of mathematics, computer science and molecular biology. Research in DNA computing can be roughly divided in two streams: DNA computing in vitro and in vivo. The former is concerned with building (specialized) DNA-based computers in test tubes, while the latter is concerned with implementing some computational components in living cells, as well as with studying the computational processes taking place in the living cells.
International Journal of Foundations of Computer Science, 2007
The process of gene assembly in ciliates, an ancient group of organisms, is one of the most complex instances of DNA manipulation known in any organism. Three molecular operations ld, hi, and dlad have been postulated for the gene assembly process. We propose in this paper a mathematical model for contextual variants of ld and dlad on strings: recombinations can be done only if certain contexts are present. We prove that the proposed model is Turing-universal.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.