Sequencing Files
Last updated
Last updated
Please use this file naming template for raw sequences tool as a guide to create file names.
Each number in the example above corresponds to a field in the file name. Fields are separated by β_β to enhance computer readability. Shortened column names used in the template above are provided in parentheses next to the appropriate field definition.
Dataset Number (Dataset_No):
All C-CoMP datasets will be assigned an internal dataset number. Please request this number on the #dataset_number_requests slack channel following the instructions provided above.
Metadata about the dataset (including Dataset number, method type, and data storage location) will be recorded in the C-CoMP Data Catalog.
Approach (Approach):
The kind of method that was used for this specific project X sample (see examples and abbreviations below)
Sample type (Sample_Type):
Use this field to distinguish sample types. Sample type should fall into one of these categories: quality control (QC) or biological sample (SA). QC includes samples run as DNA extraction or sequencing controls to check for contamination during sample preparation.
File number (File_No)
Forward or Reverse Reads (Forward_Reverse):
Either the forward reads (R1) or reverse reads (R2) if applicable to the file type. Use βnoRβ if this is not applicable.
Sequencing number (Seq_no):
Used if there is more sequencing data for the same sample and data type. This field is only changed if the sample is a technical replicate. If the sample is a biological replicate or from a separate extraction process, the sample is assigned a different sample_ID. Default number is 001.
File-type extension
Approach Abbreviations:
WMX - Whole Metagenomic Sequencing (environmental metagenomics)
WQX - Whole Genome Sequencing
AMP - Amplicon Sequencing (e. g. 16S rRNA)
TXX - Transcriptomics
MTX - Environmental Metatranscriptomics
Sample Type Abbreviations:
QC - Quality Control
SA - Biological Sample