Glossary of Terms

C-CoMP Data Roadmap - Detailed directions for how and when to share research products in the context of the research life cycle. A detailed description is provided here.

Internal C-CoMP Dataset Numbers - Internal tracking number (format CMP###) assigned to every C-CoMP dataset for organization and sharing purposes. A detailed description and directions for how to request an internal dataset number are provided here.

C-CoMP Data Catalog - A spreadsheet record of all C-CoMP datasets. The C-CoMP Data Catalog is an internal document but an example template is provided here.

Biological and Chemical Oceanography Data Management Office (BCO-DMO) - NSF-funded repository for storing biological and chemical oceanography datasets. Read more about BCO-DMO on their website here.

Domain repository - specialized repository for storing ‘omics files and associated metadata. Example repositories include MetaboLights, Metabolomics Workbench, PRIDE (ProteomeXchange), and NCBI’s Sequence Read Archive.

Metadata - Descriptive qualitative and quantitative data that provide contextual information for other data. A detailed description is provided here.

Raw data - Raw data refers to the initial files that have not been modified, corrected, compressed, or filtered. A detailed description is provided here.

Derived data - Derived data files include files that have been converted into a different format from the original version. A detailed description is provided here.

Data products - Any files generated from data analysis of raw or derived files. A detailed description is provided here.

Research products - All files generated through the research process. Research products include: protocols, code, data (raw, derived), data products, and research articles.

FAIR principles - A set of principles that govern the management and stewardship of data in an open science context. FAIR is the acronym and each letter refers to a separate principle. F = Finadable, A = Accessible, I = Interoperable, R = Reusable. Detailed information about FAIR can be found here.

Tabular data - Datasets that are recorded in a spreadsheet using the rows and columns convention. Rows can be samples and columns can be measurements or conditions. The intersection between a column and row reflects a quantitative or qualitative measure for that sample X measurement type or condition. Most datasets will include tabular data in some form.

Last updated