OpenCGA
OverviewInstallationUsing OpenCGA
v2.2
v2.2
  • Home
  • Overview
    • Features
    • Architecture Diagram
    • Security
    • Metadata and Clinical Data
    • Data Models
      • Sample
      • Individual
    • Alignment and Coverage
    • Variant Query and Analysis
    • Clinical Data Analysis
    • Running Analysis
    • Scalability and Performance
    • Sizing OpenCGA
  • OpenCGA Architecture
    • Catalog
    • Alignment and Coverage
    • Variant Storage Engine
      • Variant Data Model
    • Clinical Analysis
    • Analysis Framework
  • Data Models
    • User
    • Project
    • Study
    • File
    • Sample
    • Individual
    • Family
    • Cohort
    • Job
    • Clinical Analysis
  • Case Studies
    • Genomics England Research
    • Microsoft Azure
  • User Manual
    • Installation
      • Kubernetes Cluster
        • Azure
        • Configuration
      • On-Premise HPC Cluster
        • Server Configuration
        • Getting OpenCGA
        • Choosing Variant Storage Engine
        • Configuration
      • Running Docker
    • Login
    • Using OpenCGA
      • REST Web Service API
      • IVA Web App
      • Client Libraries
        • pyopencga - Python Library
        • opencgaR - R library
        • Java
        • JavaScript
      • Command Line
        • Configuration
        • Files
      • Public Demo
    • Managing Data
      • Working with Projects and Studies
      • Loading of VCF files
      • Create Projects and Studies
      • Load VCF Files to a Study
      • Working with Files
      • Sharing and Permissions
        • Users and Groups
        • Study ACLs
      • Population of metadata
        • Data Versioning
        • Adding Custom Metadata
    • Alignment Engine
      • BAM Index
      • Alignment Read Query
      • Working with Coverage
      • Alignment Analysis
    • Variant Storage Engine
      • Variant Query
      • Variant Aggregation Stats
      • Variant Analysis
    • Clinical Genomics
      • Clinical Interpretation
      • Clinical Analysis
    • Using JupyterLab
    • Administrator
      • User Management
      • Templates / Manifest
  • About
    • Roadmap
    • Release Notes
    • Community
Powered by GitBook
On this page
  • Overview
  • Summary
  • Data Model
  • ClinicalAnalysis
  • File
  • FileExperiment
  • Enum FileExperiment.Method
  • Enum FileExperiment.NucleicAcidType
  • Enum FileExperiment.Technology
  • CustomStatus
  • Enum File.Format
  • FileQualityControl
  • AlignmentFileQualityControl
  • CoverageFileQualityControl
  • VariantFileQualityControl
  • FileRelatedFile
  • Enum FileRelatedFile.Relation
  • Enum File.Type
  • Enum File.Bioformat
  • FileInternal
  • FileIndex
  • FileIndex.TransformedFile
  • FileIndex.LocalFileIndex
  • FileIndex.IndexStatus
  • FileStatus
  • MissingSamples
  • Family
  • FamilyQualityControl
  • Individual
  • IndividualQualityControl
  • Sample
  • SampleCollection
  • SampleQualityControl
  • SampleVariantQualityControlMetrics
  • SampleProcessing
  • SampleInternal
  • RgaIndex
  • Enum RgaIndex.Status
  • Location
  • IndividualPopulation
  • IndividualInternal
  • FamilyInternal
  • FamilyStatus
  • ClinicalAnalysisQualityControl
  • Enum ClinicalAnalysisQualityControl.QualityControlSummary
  • Enum ClinicalAnalysis.Type
  • ClinicalConsentAnnotation
  • ClinicalConsentParam
  • Enum ClinicalConsentParam.Value
  • FlagAnnotation
  • ClinicalPriorityAnnotation
  • Interpretation
  • InterpretationStats
  • InterpretationFindingStats
  • Panel
  • Status
  • InterpretationInternal
  • InterpretationStatus
  • ClinicalAnalysisInternal
  • ClinicalAnalysisStatus

Was this helpful?

  1. Data Models

Clinical Analysis

Overview

Created by pfurio on 05/06/17.

Summary

Field
Create
Update
Unique
Required

id

description

type

disorder

files

proband

family

panels

panelLock

locked

interpretation

secondaryInterpretations

consent

analyst

priority

flags

dueDate

qualityControl

comments

audit

attributes

status

uuid

creationDate

modificationDate

release

internal

Data Model

ClinicalAnalysis

Field
Description

id String

ClinicalAnalysis ID is a mandatory parameter when creating a new ClinicalAnalysis, this ID cannot be changed at the moment.

Tags: required, immutable, unique

uuid String

Global unique ID at the whole OpenCGA installation. This is automatically created during the ClinicalAnalysis creation and cannot be changed.

Tags: internal, unique, immutable

description String

An string to describe the properties of the ClinicalAnalysis.

disorder Disorder

panelLock boolean

locked boolean

analyst ClinicalAnalyst

creationDate String

String representing when the sample was created, this is automatically set by OpenCGA.

Tags: internal

modificationDate String

String representing when was the last time the sample was modified, this is automatically set by OpenCGA.

Tags: internal

dueDate String

release int

An integer describing the current data release.

Tags: internal

comments List<ClinicalComment>

audit List<ClinicalAudit>

An object describing the internal information of the ClinicalAnalysis. This is managed by OpenCGA.

Tags: internal

attributes Map<Object,String>

You can use this field to store any other information, keep in mind this is not indexed so you cannot search by attributes.

status Status

An object describing the status of the ClinicalAnalysis.

File

Field
Description

id String

File ID is a mandatory parameter when creating a new File, this ID cannot be changed at the moment.

Tags: required, immutable, unique

uuid String

Global unique ID at the whole OpenCGA installation. This is automatically created during the File creation and cannot be changed.

Tags: internal, unique, immutable

name String

Formats: file, folder, index.

Formats: txt, executable, image, ...

BAM, VCF, ...

checksum String

uri URI

path String

release int

An integer describing the current data release.

Tags: internal

creationDate String

String representing when the File was created, this is automatically set by OpenCGA.

Tags: internal

modificationDate String

String representing when was the last time the File was modified, this is automatically set by OpenCGA.

Tags: internal

description String

An string to describe the properties of the File.

external boolean

size long

software Software

sampleIds List<String>

jobId String

tags List<String>

stats Map<Object,String>

An object describing the status of the File.

An object describing the internal information of the File. This is managed by OpenCGA.

Tags: internal

attributes Map<Object,String>

You can use this field to store any other information, keep in mind this is not indexed so you cannot search by attributes.

FileExperiment

Field
Description

manufacturer String

platform String

library String

date String

center String

lab String

responsible String

description String

attributes Map<Object,String>

Enum FileExperiment.Method

Field
Description

WHOLE_EXOME_SEQUENCING

WHOLE_GENOME_SEQUENCING

TARGETED_DNA_SEQUENCING

AMPLICON_SEQUENCING

GENOTYPING_MICROARRAY

Enum FileExperiment.NucleicAcidType

Field
Description

DNA

RNA

Enum FileExperiment.Technology

Field
Description

SEQUENCING

MICROARRAY

CustomStatus

Field
Description

name String

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Sed vestibulum aliquet lobortis. Pellentesque venenatis lacus quis nibh interdum finibus.

Tags: required, immutable

description String

Proin aliquam ante in ligula tincidunt, cursus volutpat urna suscipit. Phasellus interdum, libero at posuere blandit, felis dui dignissim leo, quis ullamcorper felis elit a augue.

Tags: required

date String

Nullam commodo tortor nec lectus cursus finibus. Sed quis orci fringilla, cursus diam quis, vehicula sapien. Etiam bibendum dapibus lectus, ut ultrices nunc vulputate ac.

Tags: internal, unique, immutable

Enum File.Format

Field
Description

VCF

BCF

GVCF

TBI

BIGWIG

SAM

BAM

BAI

CRAM

CRAI

FASTQ

FASTA

PED

TAB_SEPARATED_VALUES

COMMA_SEPARATED_VALUES

XML

PROTOCOL_BUFFER

JSON

AVRO

PARQUET

IMAGE

PLAIN

BINARY

NONE

UNKNOWN

FileQualityControl

Field
Description

comments List<ClinicalComment>

AlignmentFileQualityControl

Field
Description

fastQcMetrics FastQcMetrics

samtoolsStats SamtoolsStats

samtoolsFlagStats SamtoolsFlagstats

hsMetrics HsMetrics

CoverageFileQualityControl

Field
Description

geneCoverageStats List<GeneCoverageStats>

VariantFileQualityControl

Field
Description

variantSetMetrics VariantSetStats

ascatMetrics AscatMetrics

FileRelatedFile

Field
Description

Enum FileRelatedFile.Relation

Field
Description

PRODUCED_FROM

PART_OF_PAIR

PEDIGREE

REFERENCE_GENOME

Enum File.Type

Field
Description

FILE

DIRECTORY

Enum File.Bioformat

Field
Description

MICROARRAY_EXPRESSION_ONECHANNEL_AGILENT

MICROARRAY_EXPRESSION_ONECHANNEL_AFFYMETRIX

MICROARRAY_EXPRESSION_ONECHANNEL_GENEPIX

MICROARRAY_EXPRESSION_TWOCHANNELS_AGILENT

MICROARRAY_EXPRESSION_TWOCHANNELS_GENEPIX

DATAMATRIX_EXPRESSION

IDLIST

IDLIST_RANKED

ANNOTATION_GENEVSANNOTATION

OTHER_NEWICK

OTHER_BLAST

OTHER_INTERACTION

OTHER_GENOTYPE

OTHER_PLINK

OTHER_VCF

OTHER_PED

VCF4

VARIANT

ALIGNMENT

COVERAGE

SEQUENCE

PEDIGREE

REFERENCE_GENOME

NONE

UNKNOWN

FileInternal

Field
Description

sampleMap Map<String>

FileIndex

Field
Description

userId String

creationDate String

jobId long

release long

attributes Map<Object,String>

FileIndex.TransformedFile

Field
Description

id long

metadataId long

FileIndex.LocalFileIndex

Field
Description

fileId long

indexer String

FileIndex.IndexStatus

Field
Description

NONE String

TRANSFORMING String

TRANSFORMED String

LOADING String

INDEXING String

STATUS_LIST List>

FileStatus

Field
Description

TRASHED String

TRASHED name means that the object is marked as deleted although is still available in the database.

STAGE String

MISSING String

PENDING_DELETE String

DELETING String

REMOVED String

MISSING_SAMPLES String

STATUS_LIST List<String>

MissingSamples

Field
Description

existing List<String>

nonExisting List<String>

Family

Field
Description

id String

Family is a mandatory parameter when creating a new sample, this ID cannot be changed at the moment.

Tags: required, immutable, unique

uuid String

Global unique ID at the whole OpenCGA installation. This is automatically created during the Family creation and cannot be changed.

Tags: internal, unique, immutable

name String

phenotypes List<Phenotype>

disorders List<Disorder>

creationDate String

String representing when the Family was created, this is automatically set by OpenCGA.

Tags: internal

modificationDate String

String representing when was the last time the Family was modified, this is automatically set by OpenCGA.

Tags: internal

expectedSize int

description String

An string to describe the properties of the Family.

release int

An integer describing the current data release.

Tags: internal

version int

An integer describing the current version.

Tags: internal

An object describing the status of the Family.

An object describing the internal information of the Family. This is managed by OpenCGA.

Tags: internal

roles Map<MapString,String,FamiliarRelationship>

attributes Map<Object,String>

You can use this field to store any other information, keep in mind this is not indexed so you cannot search by attributes.

FamilyQualityControl

Field
Description

relatedness List<RelatednessReport>

fileIds List<String>

comments List<ClinicalComment>

Individual

Field
Description

id String

Individual ID in the study, this must be unique in the study but can be repeated in different studies. This is a mandatory parameter when creating a new Individual, this ID cannot be changed at the moment.

Tags: required, immutable, unique

uuid String

Global unique ID at the whole OpenCGA installation. This is automatically created during the sample creation and cannot be changed.

Tags: internal, unique, immutable

name String

familyIds List<String>

sex Sex

karyotypicSex KaryotypicSex

ethnicity String

dateOfBirth String

release int

An integer describing the current data release.

Tags: internal

version int

An integer describing the current version.

Tags: internal

creationDate String

String representing when the sample was created, this is automatically set by OpenCGA.

Tags: internal

modificationDate String

String representing when was the last time the sample was modified, this is automatically set by OpenCGA.

Tags: internal

lifeStatus LifeStatus

phenotypes List<Phenotype>

A List with related phenotypes.

disorders List<Disorder>

A List with related disorders.

A List with related samples.

parentalConsanguinity boolean

attributes Map<Object,String>

You can use this field to store any other information, keep in mind this is not indexed so you cannot search by attributes.

IndividualQualityControl

Field
Description

inferredSexReports List<InferredSexReport>

List of inferred sex reports, it depends on the method (currently by coverage ratio)

sampleRelatednessReport SampleRelatednessReport

mendelianErrorReports List<MendelianErrorReport>

Mendelian errors

fileIds List<String>

File IDs related to the quality control

comments List<ClinicalComment>

Comments related to the quality control

Sample

Field
Description

id String

Sample ID in the study, this must be unique in the study but can be repeated in different studies. This is a mandatory parameter when creating a new sample, this ID cannot be changed at the moment.

Tags: required, immutable, unique

uuid String

Generic: Unique 32-character identifier assigned automatically by OpenCGA.

Tags: immutable, unique

Describes how the sample was processed in the lab.

Describes how the sample was collected.

Note: The sample collection is a list of samples

Contains different metrics to evaluate the quality of the sample.

release int

An integer describing the current data release.

Tags: immutable

version int

Generic: Autoincremental version assigned to the registered entry. By default, updates does not create new versions. To enable versioning, users must set the incVersion flag from the /update web service when updating the document.

Tags: immutable

creationDate String

Generic: Autogenerated date following the format YYYYMMDDhhmmss containing the date when the entry was first registered.

Tags: immutable

modificationDate String

Generic: Autogenerated date following the format YYYYMMDDhhmmss containing the date when the entry was last modified.

Tags: immutable

description String

Generic: Users may provide a description for the entry.

somatic boolean

Indicates if the sample is somatic or germline (default)

phenotypes List<Phenotype>

individualId String

fileIds List<String>

cohortIds List<String>

Generic: Object to define the status of the entry.

Generic: Field automatically managed by OpenCGA containing relevant information of the entry. This field is used for internal purposes and is visible for users.

Tags: immutable

attributes Map<Object,String>

Dictionary that can be customised by users to store any additional information users may require.

Note: This field is not meant to be queried. It should only contain extra information. To store additional information meant to be queried, please use annotationSets.

SampleCollection

Field
Description

tissue String

organ String

quantity String

method String

date String

attributes Map<Object,String>

SampleQualityControl

Field
Description

fileIds List<String>

comments List<ClinicalComment>

SampleVariantQualityControlMetrics

Field
Description

variantStats List<SampleQcVariantStats>

signatures List<Signature>

genomePlots List<GenomePlot>

SampleProcessing

Field
Description

product String

preparationMethod String

extractionMethod String

labSampleId String

quantity String

date String

attributes Map<Object,String>

SampleInternal

Field
Description

RgaIndex

Field
Description

Nullam commodo tortor nec lectus cursus finibus. Sed quis orci fringilla, cursus diam quis, vehicula sapien. Etiam bibendum dapibus lectus, ut ultrices nunc vulputate ac.

Tags: internal, unique, immutable

date String

Nullam commodo tortor nec lectus cursus finibus. Sed quis orci fringilla, cursus diam quis, vehicula sapien. Etiam bibendum dapibus lectus, ut ultrices nunc vulputate ac.

Tags: internal, unique, immutable

Enum RgaIndex.Status

Field
Description

NOT_INDEXED

INDEXED

INVALID_PERMISSIONS

INVALID_METADATA

INVALID

Location

Field
Description

address String

postalCode String

city String

state String

country String

IndividualPopulation

Field
Description

name String

subpopulation String

description String

IndividualInternal

FamilyInternal

Field
Description

FamilyStatus

Field
Description

INCOMPLETE String

STATUS_LIST List<String>

ClinicalAnalysisQualityControl

Field
Description

comments List<ClinicalComment>

Enum ClinicalAnalysisQualityControl.QualityControlSummary

Field
Description

HIGH

MEDIUM

LOW

DISCARD

NEEDS_REVIEW

UNKNOWN

Enum ClinicalAnalysis.Type

Field
Description

SINGLE

FAMILY

CANCER

COHORT

AUTOCOMPARATIVE

ClinicalConsentAnnotation

Field
Description

date String

ClinicalConsentParam

Field
Description

id String

name String

description String

Enum ClinicalConsentParam.Value

Field
Description

YES

NO

UNKNOWN

FlagAnnotation

Field
Description

id String

description String

date String

ClinicalPriorityAnnotation

Field
Description

id String

description String

rank int

date String

Interpretation

Field
Description

studyUid long

uid long

release int

InterpretationStats

Field
Description

InterpretationFindingStats

Field
Description

numVariants int

tierCount Map<Integer,String>

variantStatusCount Map<Integer,ClinicalVariant.Status>

geneCount Map<Integer,String>

Panel

Field
Description

uuid String

release int

version int

OpenCGA version of this panel, this is incremented when the panel is updated.

author String

studyUid long

uid long

Status

Field
Description

name String

date String

description String

message String

READY String

READY name means that the object is being used.

DELETED String

DELETED name means that the object is marked as removed, so it can be completely removed from the database with a clean action.

STATUS_LIST List<String>

InterpretationInternal

Field
Description

InterpretationStatus

Field
Description

NOT_REVIEWED String

UNDER_REVIEW String

REVIEWED String

REJECTED String

STATUS_LIST List<String>

ClinicalAnalysisInternal

Field
Description

ClinicalAnalysisStatus

Field
Description

INCOMPLETE String

READY_FOR_VALIDATION String

READY_FOR_INTERPRETATION String

INTERPRETATION_IN_PROGRESS String

READY_FOR_INTEPRETATION_REVIEW String

INTERPRETATION_REVIEW_IN_PROGRESS String

READY_FOR_REPORT String

REPORT_IN_PROGRESS String

DONE String

REVIEW_IN_PROGRESS String

CLOSED String

REJECTED String

STATUS_LIST List<String>

PreviousJobNextCase Studies

Last updated 3 years ago

Was this helpful?

You can find the Java code .

type

files List<>

proband

family

panels List<>

interpretation

secondaryInterpretations List<>

consent

priority

flags List<>

qualityControl

internal

You can find the Java code .

type

format

bioformat

experiment

relatedFiles List<>

qualityControl

status

internal

You can find the Java code .

technology

method

nucleicAcidType

Enumeration class. You can find the Java code .

Enumeration class. You can find the Java code .

Enumeration class. You can find the Java code .

You can find the Java code .

Enumeration class. You can find the Java code .

You can find the Java code .

variant

alignment

coverage

You can find the Java code .

You can find the Java code .

You can find the Java code .

You can find the Java code .

file

relation

Enumeration class. You can find the Java code .

Enumeration class. You can find the Java code .

Enumeration class. You can find the Java code .

You can find the Java code .

status

index

missingSamples

You can find the Java code .

status

transformedFile

localFileIndex

You can find the Java code .

You can find the Java code .

You can find the Java code .

You can find the Java code .

You can find the Java code .

You can find the Java code .

members List<>

qualityControl

status

internal

You can find the Java code .

You can find the Java code .

father

mother

location

qualityControl

population

samples List<>

status

internal

You can find the Java code .

You can find the Java code .

processing

collection since: 2.1

qualityControl since: 2.1

Note: The sample collection is a list of samples More info at:

status

internal

You can find the Java code .

You can find the Java code .

variant

You can find the Java code .

You can find the Java code .

You can find the Java code .

rga

You can find the Java code .

status

Enumeration class. You can find the Java code .

You can find the Java code .

You can find the Java code .

You can find the Java code .

You can find the Java code .

status

You can find the Java code .

You can find the Java code .

summary

Enumeration class. You can find the Java code .

Enumeration class. You can find the Java code .

You can find the Java code .

consents List<>

You can find the Java code .

value

Enumeration class. You can find the Java code .

You can find the Java code .

You can find the Java code .

You can find the Java code .

panels List<>

stats

internal

You can find the Java code .

primaryFindings

secondaryFindings

You can find the Java code .

You can find the Java code .

status

You can find the Java code .

You can find the Java code .

status

You can find the Java code .

You can find the Java code .

status

You can find the Java code .

here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
here
ZetaGenomics
ClinicalAnalysis.Type
File
Individual
Family
Panel
Interpretation
Interpretation
ClinicalConsentAnnotation
ClinicalPriorityAnnotation
FlagAnnotation
ClinicalAnalysisQualityControl
ClinicalAnalysisInternal
File.Type
File.Format
File.Bioformat
FileExperiment
FileRelatedFile
FileQualityControl
CustomStatus
FileInternal
FileExperiment.Technology
FileExperiment.Method
FileExperiment.NucleicAcidType
VariantFileQualityControl
AlignmentFileQualityControl
CoverageFileQualityControl
File
FileRelatedFile.Relation
FileStatus
FileIndex
MissingSamples
FileIndex.IndexStatus
FileIndex.TransformedFile
FileIndex.LocalFileIndex
Individual
FamilyQualityControl
CustomStatus
FamilyInternal
Individual
Individual
Location
IndividualQualityControl
IndividualPopulation
Sample
CustomStatus
IndividualInternal
SampleProcessing
SampleCollection
SampleQualityControl
CustomStatus
SampleInternal
SampleVariantQualityControlMetrics
RgaIndex
RgaIndex.Status
FamilyStatus
ClinicalAnalysisQualityControl.QualityControlSummary
ClinicalConsentParam
ClinicalConsentParam.Value
Panel
InterpretationStats
InterpretationInternal
InterpretationFindingStats
InterpretationFindingStats
Status
InterpretationStatus
ClinicalAnalysisStatus