What is required to run the dereplicator?
This tool takes two files, alignment file and cluster file, both should be from RDP Pyrosequence Pipeline.
In a dereplicate request, maximum distance is mentioned. What is meant by this and how is it calculated?
Dereplication identifies the centroid of a cluster by the criterion of minimal sum of square distance, i.e. the sequence that is located in the center of the cluster. This tool does this task for clusters at different distance cutoffs and the maximum distance specifies the highest cluster cutoff you want dereplication to perform.
What files are acceptable for upload?
Only FASTA files may be uploaded. File names can only contain letters, numbers, underscores, and periods.