Next: , Previous: Refining clusters, Up: Running wcd


3.10 Format of input files

The format of input files is:

What is meant by FASTA format? Each sequence MUST be preceded by an identification line. Each sequence itself may be on one line, or it may be on several lines. If it is on several lines, each line should terminate with a carriage return and there must be NO spaces on each line.

The identification line starts with a `greater-than' sign (>). This is all that is required. IF there is an alphanumeric sequences (string with no blanks) IMMEDIATELY following the greater than sign then that is treated as a sequence ID that is used by a few of the options for display purposes. The rest of the identification line is completely ignored.

Format of clustering input

The merge and add options require as input files that specify a clustering. These files must use the compressed format described below.

Format of constraint file

Constraint files consist of a sequence of constraints, each on a line by itself. Each line in the constraint file is a directive followed by a list of indices, terminated by a full stop `.'. There are three directives and their semantics are described below.