Physical Sciences and Mathematics | Open Access Articles

A 2-2/3 Approximation For The Shortest Superstring Problem, Chris Armen, Clifford Stein

Computer Science Technical Reports

Given a collection of strings S={s_1, ..., s_n} over an alphabet \Sigma, a superstring \alpha of S is a string containing each s_i as a substring; that is, for each i, 1<=i<=n, \alpha contains a block of |s_i| consecutive characters that match s_i exactly. The shortest superstring problem is the problem of finding a superstring \alpha of minimum length. The shortest superstring problem has applications in both data compression and computational biology. In data compression, the problem is a part of a general model of string compression proposed by Gallant, Maier and Storer (JCSS '80). Much of the recent interest in the problem is due to its application to DNA sequence assembly. The problem has been shown to be NP-hard; in fact, it was shown by Blum et al.(JACM '94) to be MAX SNP-hard. The first O(1)-approximation was also due to Blum et al., who gave an algorithm that always returns a superstring no more than 3 times the length of an optimal solution. Several researchers have published results that improve on the approximation ratio; of these, the best previous result is our algorithm ShortString, which achieves a 2 3/4-approximation (WADS '95). We present our new algorithm, G-ShortString, which achieves a ratio of 2 2/3. It generalizes the ShortString algorithm, but the analysis differs substantially from that of ShortString. Our previous work identified classes of strings that have a nested periodic structure, and which must be present in the worst case for our algorithms. We introduced machinery to descibe these strings and proved strong structural properties about them. In this paper we extend this study to strings that exhibit a more relaxed form of the same structure, and we use this understanding to obtain our improved result.

Full-Text Articles in Physical Sciences and Mathematics

A 2-2/3 Approximation For The Shortest Superstring Problem, Chris Armen, Clifford Stein

Computer Science Technical Reports

Information Retrieval, Information Structure, And Information Agents, Daniela Rus, Devika Subramanian

Computer Science Technical Reports

A Rosat Hri Observation Of The Supernova Remnant G109.1 – 1.0, Alan P. Hurford, Robert A. Fesen

Dartmouth Scholarship

An Api For Choreographing Data Accesses, Elizabeth A.M. Shriver, Leonard F. Wisniewski

Computer Science Technical Reports

Complexity Analysis Of Two Permutations Used By Fast Cosine Transform Algorithms, Sean S.B. Moore, Leonard F. Wisniewski

Computer Science Technical Reports

Finding Real-Valued Single-Source Shortest Paths In O(N^3) Expected Time, Stavros G. Kolliopoulos, Clifford Stein

Computer Science Technical Reports

Expanding The Potential For Disk-Directed I/O, David Kotz

Dartmouth Scholarship

Enwrich: A Compute-Processor Write Caching Scheme For Parallel File Systems, Apratim Purakayastha, Carla Schlatter Ellis, David Kotz

Dartmouth Scholarship

Interfaces For Disk-Directed I/O, David Kotz

Computer Science Technical Reports

Structured Permuting In Place On Parallel Disk Systems, Leonard F. Wisniewski

Computer Science Technical Reports

Process Migration For Heterogeneous Distributed Systems, Matt Bishop, Mark Valence, Leonard F. Winiewski

Computer Science Technical Reports

Fast Spherical Transforms On Distance Transitive Graphs, J R. Driscoll, D M. Healy Jr, D Rockmore

Computer Science Technical Reports

Determination Of Malmquist Bias And Selection Effects From Monte Carlo Simulations, Wolfram Freudling, Luiz N. Da Costa, Gary Wegner, Riccardo Giovanelli

Dartmouth Scholarship

Disk-Directed I/O For An Out-Of-Core Computation, David Kotz

Dartmouth Scholarship

Discovery Of Extreme-Ultraviolet Radiation From The Seyfert Galaxy Ton S180 (=Euve J0057−223), Stéphane Vennes, Elisha Polomski, Stuart Bowyer, John R. Thorstensen

Dartmouth Scholarship

Deciding Finiteness For Matrix Groups Over Function Fields, Robert Beals, Daniel N. Rockmore, Ki-Seng Tan

Computer Science Technical Reports

Oscillons: Resonant Configurations During Bubble Collapse, E J. Copeland, M Gleiser, H R. Müller

Dartmouth Scholarship

A Multiple Discrete Pass Algorithm On A Dec Alpha 2100, Scott R. Cushman

Dartmouth College Undergraduate Theses

Simulation Of A Video-On-Demand System, Song Bac Toh

Dartmouth College Undergraduate Theses

Tias: A Transportable Intelligent Agent System, Kenneth Harker

Dartmouth College Undergraduate Theses

Ph.D. Thesis Proprosal: Transportable Agents, Robert S. Gray

Computer Science Technical Reports

Low-Level Interfaces For High-Level Parallel I/O, Nils Nieuwejaar, David Kotz

Dartmouth Scholarship

Exploring The Use Of I/O Nodes For Computation In A Mimd Multiprocessor, David Kotz, Ting Cai

Dartmouth Scholarship

Expanding The Potential For Disk-Directed I/O, David Kotz

Computer Science Technical Reports

Content-Based Image Retrieval: Color And Edges, Robert S. Gray

Computer Science Technical Reports

Exploring The Use Of I/O Nodes For Computation In A Mimd Multiprocessor, David Kotz, Ting Cai

Computer Science Technical Reports

The Orbital Period Of The Pre-Cataclysmic Binary Re 2013+400 And A Study Of The Atmosphere Of The Dao White Dwarf Primary, M. A. Barstow, M. R. Burleigh, T. A. Fleming, J. B. Holberg, D. Koester, M. C. Marsh, S. R. Rosen, R. G.M. Rutten, S. Sakai, R. W. Tweedy, G. Wegner

Dartmouth Scholarship

Disk-Directed I/O For An Out-Of-Core Computation, David Kotz

Computer Science Technical Reports

Disk-Directed I/O For Mimd Multiprocessors, David Kotz

Dartmouth Scholarship

Dartcvl: The Dartmouth C Vector Library, Thomas H. Cormen, Sumit Chawla, Preston Crow, Melissa Hirschl, Roberto Hoyle, Keith D. Kotay, Rolf H. Nelson, Nils Nieuwejaar, Scott M. Silver, Michael B. Taylor, Rajiv Wickremesinghe

Computer Science Technical Reports