Sequence clustering
The software used is given below. This software performs a censored c-means clustering of DNA sequences. It will handle both directions of possible reading.
The sources will compile on linux with GNU ++. To install download the tar file, extract into a directory and type make. Example inputs can be found in the files *.in. The program fasta_to_profile.cpp shows how to make an input file from a fasta format sequence file.
The files are tar files or gzip'ed tar files. To unpack use the
tar -xvf command or the tar -xvzf command. If need be contact the
maintainer for a different format of archive.