Catalogue of Molecular Biology Programs Release 6.1 23 Jul 1999 All thanks are in the file biocatal.thanks. ===================================================================== AC BC00402 NAME DBCOMP DOMAIN Databases tools DESCRIPTION DBCOMP (and its counterpart DBUNCOMP) is based on the idea DESCRIPTION of substitutional compression. Whereas DESCRIPTION conventional substitutional compressors reduce DESCRIPTION redundant phrases of rather small word-size, they do DESCRIPTION not eliminate identical lines of consecutive entries, DESCRIPTION which is a typical source of redundancy in DESCRIPTION automatically acquired biological sequence data. In DESCRIPTION contrast, DBCOMP is a line-oriented substitutional DESCRIPTION compressor. By using a flexible word-size of one line, DESCRIPTION the program replaces identical lines with a pointer to DESCRIPTION the previous occurrence of those lines. The DESCRIPTION compression scheme of DBCOMP is particularly suited to DESCRIPTION compress databases that are highly redundant in the DESCRIPTION annotation section as is typical of databases derived DESCRIPTION from genome projects. AUTHOR R. Doelz and F. Eggenberger RA Doelz R., Eggenberger F.; RT "A compression mechanism for sequence databases RT to improve the efficiency of conventional tools."; RL Comput. Appl. Biosci. 11(2):219-223(1995). RX Medline; 95346420. RX SeqAnalRef; DOER9501. ADDRESS Biocomputing, Basel University, Biozentrum, ADDRESS Klingelbergstrasse 70, CH-4056 Basel, Switzerland CONTACT embnet@comp.bioz.unibas.ch SITE ftp anonymous nic.switch.ch SITE Directory /mirror/embnet-ch/dbcomp SITE-CONTACT - OS Irix, Ultrix, OSF/1,AIX, SunOS, VAX/VMS, AXP/VMS, DOS, OS DOS/Windows and MacOS LANGUAGE ANSI C VOLUME - REQUIRES - COMMENTS Both a command-line version and a graphical user COMMENTS interface (GUI) are available. The GUI was written with the Software COMMENTS Development ToolKit from the National Center for Biotechnology COMMENTS Information (NCBI).