🔢
C-CoMP Data Management Handbook
  • C-CoMP Data Management Handbook
  • Table of Contents
  • Executive Summary
  • Glossary of Terms
  • Overview
  • C-CoMP Data Roadmap
  • Internal C-CoMP Dataset Numbers
  • Sending samples to other labs
  • Data Group Definitions
  • Data Deposition Instructions
    • Metadata and Tabular Data Files
    • Raw and Derived Data Files
      • LC-MS Metabolomics
      • LC-MS Proteomics
      • NMR Metabolomics
      • Genomics/Sequencing Data
  • Numerical Models
  • Software & Tools
  • Data Products
  • File Naming Conventions
    • LC-MS Metabolomics
    • LC-MS Proteomics
    • NMR Metabolomics
    • Sequencing Files
    • Sequencing Products
    • Numerical Models & Products
    • Derived Files
    • Metadata & Tabular Data
  • File Naming and Data Deposition Example
  • Digital Coordinator Role
  • FAQ
  • Appendix
    • Quick Links
    • Spreadsheet Templates
Powered by GitBook
On this page

Glossary of Terms

PreviousExecutive SummaryNextOverview

Last updated 2 years ago

C-CoMP Data Roadmap - Detailed directions for how and when to share research products in the context of the research life cycle. A detailed description is provided here.

Internal C-CoMP Dataset Numbers - Internal tracking number (format CMP###) assigned to every C-CoMP dataset for organization and sharing purposes. A detailed description and directions for how to request an internal dataset number are provided here.

C-CoMP Data Catalog - A spreadsheet record of all C-CoMP datasets. The C-CoMP Data Catalog is an internal document but an example template is provided here.

Biological and Chemical Oceanography Data Management Office (BCO-DMO) - NSF-funded repository for storing biological and chemical oceanography datasets. Read more about BCO-DMO on their website .

Domain repository - specialized repository for storing ‘omics files and associated metadata. Example repositories include MetaboLights, Metabolomics Workbench, PRIDE (ProteomeXchange), and NCBI’s Sequence Read Archive.

Metadata - Descriptive qualitative and quantitative data that provide contextual information for other data. A detailed description is provided .

Raw data - Raw data refers to the initial files that have not been modified, corrected, compressed, or filtered. A detailed description is provided .

Derived data - Derived data files include files that have been converted into a different format from the original version. A detailed description is provided .

Data products - Any files generated from data analysis of raw or derived files. A detailed description is provided .

Research products - All files generated through the research process. Research products include: protocols, code, data (raw, derived), data products, and research articles.

FAIR principles - A set of principles that govern the management and stewardship of data in an open science context. FAIR is the acronym and each letter refers to a separate principle. F = Finadable, A = Accessible, I = Interoperable, R = Reusable. Detailed information about FAIR can be found .

Tabular data - Datasets that are recorded in a spreadsheet using the rows and columns convention. Rows can be samples and columns can be measurements or conditions. The intersection between a column and row reflects a quantitative or qualitative measure for that sample X measurement type or condition. Most datasets will include tabular data in some form.

here
here
here
here
here
here