ISA Best Practice Guide

This guide refers to SEEK, but is also relevant for FAIRDOMHUB, which is an instance of SEEK.

The ISA Infrastructure (Investigation, Study, Assay) is a general purpose framework for describing how experiments relate to one another. It describes both metadata (samples, characteristics, technologies, etc) and data (transcriptomics, proteomics etc), but the data itself is stored separately, and can therefore be as public or private as required. The ISA descriptions are visible to the rest of SEEK users.
The original purpose of the ISA infrastructure was to provide a common framework for relating multiple omics data and to provide a single mechanism for submission to omics data repositories (such as ArrayExpress for Microarray data, or PRIDE, for proteomics data). SEEK data often involves omics data and generally relies on the integration of multiple data types, so the ISA infrastructure will provide a mechanism for SEEK data export as well as providing a common framework for navigation.
For more information on the ISA community work, please see:
http://isa-tools.org/

Examples

There are many ways to describe your projects and the relationships between the different work-packages and research topics. The following guidelines provide a general description of the ISA Infrastructure and examples of how this may fit with your SEEK project. We use examples from the Bioinvestigation Index (BII), which is a public database of ISA compliant data. We also use examples already available in SEEK

  • Investigation: a high level description of the area and the main aims of a project
    • In SEEK, this may be the overall aims as stated on your grant.
      If your project has several subprojects that do not share any data, you should define an investigation for each.
    • BII Example: Growth control of the eukaryote cell: a systems biology study in yeast
    • SEEK Example Analysis of Central Carbon Metabolism of Sulfolobus solfataricus under varying temperatures
  • Study: a particular biological hypothesis or analysis
    • In SEEK, these studies may contain only experimental assays, only modelling analyses, only informatics analyses,
      or a mixture of all
    • BII Example: Study of the impact of changes in flux on the transcriptome, proteome, endometabolome and exometabolome
      of the yeast Saccharomyces cerevisiae under different nutrient limitations
    • SEEK Example: Comparison of S. solfataricus grown at 70 and 80 degrees
  • Assay: specific, individual experiments required to be undertaken together in order to address the study hypotheses.
    • In SEEK, this may be one single microarray experiment, or one flux balance analysis, for example
    • BII Example:
      • Transcriptional profiling
      • DNA Microarray
    • SEEK Example:
      • Comparison of transcriptome 70 and 80c (Cdna microarray)
      • Comparison of proteome at 70 and 80c (Protein expression profiling)
      • Intracellular metabolomics of s. solfataricus at 70 and 80c (Metabolomics)

Required Fields in the SEEK ISA

A short guide to creating a new ISA file

Investigation

Title – The name of the investigation. All assets should have a title
Description – free text describing the purpose of the investigation
Project – The project that the investigation belongs to. All assets should be associated with a project

Study

Title – The name of the Study. All assets should have a title
Description – free text describing the purpose of the study
Person Responsible – the creator of the study description
Experimentalists – other scientists involved in any of the work in the study. If they are in your instance of SEEK, please give their SEEK ID, so we can link to their profiles in SEEK. If they are not in SEEK, their names will still be displayed, but not linked. People’s involvement can also be inferred from anyone named in included assays.
Project – The project that the study belongs to. All assets should be associated with a project
Investigation title – The name of the investigation the study belongs to.

Assay

Assay Title – The name of the assay that the asset links to.
Assay type – The controlled vocabulary term describing the assay classification
If you cannot find a suitable Assay Type term, then you can generate your own.
Technology type – The controlled vocabulary term describing the technology used in the assay
If you cannot find a suitable Technology Type term then you can generate your own.
Study title – The name of the study the assay belongs to. An assay can belong to more than one study
Organism The name of the organism being studied with this asset. If uploading through SEEK, organism names are provided as a drop-down list
Strain The name of the particular strain being studied
WT/Mutant – Specifies whether this strain is wild-type or mutant
Genotype – a brief text description of the genotype
Phenotype – a brief text description of the phenotype
Culture Growth – This describes how the culture was grown for this assay. You can choose values from a drop-down list when adding new assays directly to SEEK
Data File Titles – the title of one or many data files produced during this assay
SOP Titles – the title of one or many SOPs used to execute this assay

Contributing

SEEK documentation is a community driven activity. If you have any modifications you want to make to the guidelines please send requests, or feedback to community@fair-dom.org.