package google.genomics.v1

Mouse Melon logoGet desktop application:
View/edit binary Protocol Buffers messages

service AnnotationServiceV1

annotations.proto:34

This service provides storage and positional retrieval of genomic reference annotations, including variant annotations.

service DatasetServiceV1

datasets.proto:33

This service manages datasets, which are collections of genomic data.

service ReadServiceV1

reads.proto:45

The Readstore. A data store for DNA sequencing Reads.

service ReferenceServiceV1

references.proto:27

service StreamingReadService

reads.proto:33

service StreamingVariantService

variants.proto:31

service VariantServiceV1

variants.proto:43

message Annotation

annotations.proto:220

An annotation describes a region of reference genome. The value of an annotation may be one of several canonical types, supplemented by arbitrary info tags. An annotation is not inherently associated with a specific sample or individual (though a client could choose to use annotations in this way). Example canonical annotation types are `GENE` and `VARIANT`.

Used as response type in: AnnotationServiceV1.CreateAnnotation, AnnotationServiceV1.GetAnnotation, AnnotationServiceV1.UpdateAnnotation

Used as field type in: BatchCreateAnnotationsRequest, BatchCreateAnnotationsResponse.Entry, CreateAnnotationRequest, SearchAnnotationsResponse, UpdateAnnotationRequest

message AnnotationSet

annotations.proto:188

An annotation set is a logical grouping of annotations that share consistent type information and provenance. Examples of annotation sets include 'all genes from refseq', and 'all variant annotations from ClinVar'.

Used as response type in: AnnotationServiceV1.CreateAnnotationSet, AnnotationServiceV1.GetAnnotationSet, AnnotationServiceV1.UpdateAnnotationSet

Used as field type in: CreateAnnotationSetRequest, SearchAnnotationSetsResponse, UpdateAnnotationSetRequest

enum AnnotationType

annotations.proto:685

When an [Annotation][google.genomics.v1.Annotation] or [AnnotationSet][google.genomics.v1.AnnotationSet] is created, if `type` is not specified it will be set to `GENERIC`.

Used in: Annotation, AnnotationSet, SearchAnnotationSetsRequest

message BatchCreateAnnotationsResponse.Entry

annotations.proto:594

Used in: BatchCreateAnnotationsResponse

message CallSet

variants.proto:574

A call set is a collection of variant calls, typically for one sample. It belongs to a variant set. For more genomics resource definitions, see [Fundamentals of Google Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics)

Used as response type in: VariantServiceV1.CreateCallSet, VariantServiceV1.GetCallSet, VariantServiceV1.UpdateCallSet

Used as field type in: CreateCallSetRequest, SearchCallSetsResponse, UpdateCallSetRequest

message CigarUnit

cigar.proto:28

A single CIGAR operation.

Used in: LinearAlignment

enum CigarUnit.Operation

cigar.proto:31

Describes the different types of CIGAR alignment operations that exist. Used wherever CIGAR alignments are used.

Used in: CigarUnit

message CoverageBucket

reads.proto:369

A bucket over which read coverage has been precomputed. A bucket corresponds to a specific range of the reference sequence.

Used in: ListCoverageBucketsResponse

message Dataset

datasets.proto:166

A Dataset is a collection of genomic data. For more genomics resource definitions, see [Fundamentals of Google Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics)

Used as response type in: DatasetServiceV1.CreateDataset, DatasetServiceV1.GetDataset, DatasetServiceV1.UndeleteDataset, DatasetServiceV1.UpdateDataset

Used as field type in: CreateDatasetRequest, ListDatasetsResponse, UpdateDatasetRequest

enum ExportVariantSetRequest.Format

variants.proto:667

Used in: ExportVariantSetRequest

message ExternalId

annotations.proto:498

Used in: VariantAnnotation.ClinicalCondition

enum ImportReadGroupSetsRequest.PartitionStrategy

reads.proto:228

Used in: ImportReadGroupSetsRequest

message ImportReadGroupSetsResponse

reads.proto:275

The read group set import response.

enum ImportVariantsRequest.Format

variants.proto:612

Used in: ImportVariantsRequest

message ImportVariantsResponse

variants.proto:654

The variant data import response.

enum InfoMergeOperation

variants.proto:946

Operations to be performed during import on Variant info fields. These operations are set for each info field in the info_merge_config map of ImportVariantsRequest, which is plumbed down to the MergeVariantRequests generated by the import job.

Used in: ImportVariantsRequest, MergeVariantsRequest

message LinearAlignment

readalignment.proto:32

A linear alignment can be represented by one CIGAR string. Describes the mapped position and local alignment of the read to the reference.

Used in: Read

message OperationEvent

operations.proto:66

An event that occurred during an [Operation][google.longrunning.Operation].

Used in: OperationMetadata

message OperationMetadata

operations.proto:30

Metadata describing an [Operation][google.longrunning.Operation].

message Position

position.proto:31

An abstraction for referring to a genomic position, in relation to some already known reference. For now, represents a genomic position as a reference name, a base number on that reference (0-based), and a determination of forward or reverse strand.

Used in: LinearAlignment, Read

message Range

range.proto:28

A 0-based half-open genomic coordinate range for search requests.

Used in: CoverageBucket

message Read

readalignment.proto:130

A read alignment describes a linear alignment of a string of DNA to a [reference sequence][google.genomics.v1.Reference], in addition to metadata about the fragment (the molecule of DNA sequenced) and the read (the bases which were read by the sequencer). A read is equivalent to a line in a SAM file. A read belongs to exactly one read group and exactly one [read group set][google.genomics.v1.ReadGroupSet]. For more genomics resource definitions, see [Fundamentals of Google Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics) ### Reverse-stranded reads Mapped reads (reads having a non-null `alignment`) can be aligned to either the forward or the reverse strand of their associated reference. Strandedness of a mapped read is encoded by `alignment.position.reverseStrand`. If we consider the reference to be a forward-stranded coordinate space of `[0, reference.length)` with `0` as the left-most position and `reference.length` as the right-most position, reads are always aligned left to right. That is, `alignment.position.position` always refers to the left-most reference coordinate and `alignment.cigar` describes the alignment of this read to the reference from left to right. All per-base fields such as `alignedSequence` and `alignedQuality` share this same left-to-right orientation; this is true of reads which are aligned to either strand. For reverse-stranded reads, this means that `alignedSequence` is the reverse complement of the bases that were originally reported by the sequencing machine. ### Generating a reference-aligned sequence string When interacting with mapped reads, it's often useful to produce a string representing the local alignment of the read to reference. The following pseudocode demonstrates one way of doing this: out = "" offset = 0 for c in read.alignment.cigar { switch c.operation { case "ALIGNMENT_MATCH", "SEQUENCE_MATCH", "SEQUENCE_MISMATCH": out += read.alignedSequence[offset:offset+c.operationLength] offset += c.operationLength break case "CLIP_SOFT", "INSERT": offset += c.operationLength break case "PAD": out += repeat("*", c.operationLength) break case "DELETE": out += repeat("-", c.operationLength) break case "SKIP": out += repeat(" ", c.operationLength) break case "CLIP_HARD": break } } return out ### Converting to SAM's CIGAR string The following pseudocode generates a SAM CIGAR string from the `cigar` field. Note that this is a lossy conversion (`cigar.referenceSequence` is lost). cigarMap = { "ALIGNMENT_MATCH": "M", "INSERT": "I", "DELETE": "D", "SKIP": "N", "CLIP_SOFT": "S", "CLIP_HARD": "H", "PAD": "P", "SEQUENCE_MATCH": "=", "SEQUENCE_MISMATCH": "X", } cigarStr = "" for c in read.alignment.cigar { cigarStr += c.operationLength + cigarMap[c.operation] } return cigarStr

Used in: SearchReadsResponse, StreamReadsResponse

message ReadGroup

readgroup.proto:29

A read group is all the data that's processed the same way by the sequencer.

Used in: ReadGroupSet

message ReadGroup.Experiment

readgroup.proto:30

Used in: ReadGroup

message ReadGroup.Program

readgroup.proto:50

Used in: ReadGroup

message ReadGroupSet

readgroupset.proto:40

A read group set is a logical collection of read groups, which are collections of reads produced by a sequencer. A read group set typically models reads corresponding to one sample, sequenced one way, and aligned one way. * A read group set belongs to one dataset. * A read group belongs to one read group set. * A read belongs to one read group. For more genomics resource definitions, see [Fundamentals of Google Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics)

Used as response type in: ReadServiceV1.GetReadGroupSet, ReadServiceV1.UpdateReadGroupSet

Used as field type in: SearchReadGroupSetsResponse, UpdateReadGroupSetRequest

message Reference

references.proto:110

A reference is a canonical assembled DNA sequence, intended to act as a reference coordinate space for other genomic annotations. A single reference might represent the human chromosome 1 or mitochandrial DNA, for instance. A reference belongs to one or more reference sets. For more genomics resource definitions, see [Fundamentals of Google Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics)

Used as response type in: ReferenceServiceV1.GetReference

Used as field type in: SearchReferencesResponse

message ReferenceBound

variants.proto:601

ReferenceBound records an upper bound for the starting coordinate of variants in a particular reference.

Used in: VariantSet

message ReferenceSet

references.proto:145

A reference set is a set of references which typically comprise a reference assembly for a species, such as `GRCh38` which is representative of the human genome. A reference set defines a common coordinate space for comparing reference-aligned experimental data. A reference set contains 1 or more references. For more genomics resource definitions, see [Fundamentals of Google Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics)

Used as response type in: ReferenceServiceV1.GetReferenceSet

Used as field type in: SearchReferenceSetsResponse

message Transcript

annotations.proto:423

A transcript represents the assertion that a particular region of the reference genome may be transcribed as RNA.

Used in: Annotation

message Transcript.CodingSequence

annotations.proto:451

Used in: Transcript

message Transcript.Exon

annotations.proto:424

Used in: Transcript

message Variant

variants.proto:473

A variant represents a change in DNA sequence relative to a reference sequence. For example, a variant could represent a SNP or an insertion. Variants belong to a variant set. For more genomics resource definitions, see [Fundamentals of Google Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics) Each of the calls on a variant represent a determination of genotype with respect to that variant. For example, a call might assign probability of 0.32 to the occurrence of a SNP named rs1234 in a sample named NA12345. A call belongs to a call set, which contains related calls typically from one sample.

Used as response type in: VariantServiceV1.CreateVariant, VariantServiceV1.GetVariant, VariantServiceV1.UpdateVariant

Used as field type in: CreateVariantRequest, MergeVariantsRequest, SearchVariantsResponse, StreamVariantsResponse, UpdateVariantRequest

message VariantAnnotation

annotations.proto:271

Used in: Annotation

message VariantAnnotation.ClinicalCondition

annotations.proto:272

Used in: VariantAnnotation

enum VariantAnnotation.ClinicalSignificance

annotations.proto:356

Used in: VariantAnnotation

enum VariantAnnotation.Effect

annotations.proto:317

Used in: VariantAnnotation

enum VariantAnnotation.Type

annotations.proto:288

Used in: VariantAnnotation

message VariantCall

variants.proto:528

A call represents the determination of genotype with respect to a particular variant. It may include associated information such as quality and phasing. For example, a call might assign a probability of 0.32 to the occurrence of a SNP named rs1234 in a call set with the name NA12345.

Used in: Variant

message VariantSet

variants.proto:428

A variant set is a collection of call sets and variants. It contains summary statistics of those contents. A variant set belongs to a dataset. For more genomics resource definitions, see [Fundamentals of Google Genomics](https://cloud.google.com/genomics/fundamentals-of-google-genomics)

Used as response type in: VariantServiceV1.CreateVariantSet, VariantServiceV1.GetVariantSet, VariantServiceV1.UpdateVariantSet

Used as field type in: CreateVariantSetRequest, SearchVariantSetsResponse, UpdateVariantSetRequest

message VariantSetMetadata

variants.proto:381

Metadata describes a single piece of variant call metadata. These data include a top level key and either a single value string (value) or a list of key-value pairs (info.) Value and info are mutually exclusive.

Used in: VariantSet

enum VariantSetMetadata.Type

variants.proto:382

Used in: VariantSetMetadata