Page tree
Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »


id

String

The read alignment ID. This ID is unique within the read group this alignment belongs to. For performance reasons, this field may be omitted by a backend. If provided, its intended use is to make caching and UI display easier for genome browsers and other lightweight clients.



read_group_id

String

The ID of the read group this read belongs to. Every read must belong to exactly one read group.



fragment_name

String

The fragment name. Equivalent to QNAME (query template name) in SAM.



improper_placement

boolean

The orientation and the distance between reads from the fragment are inconsistent with the sequencing protocol (inverse of SAM flag 0x2).



duplicate_fragment 

boolean

The fragment is a PCR or optical duplicate (SAM flag 0x400).



number_reads

int

The number of reads in the fragment (extension to SAM flag 0x1).



fragment_length

int

The observed length of the fragment, equivalent to TLEN in SAM.



read_number

int

he read ordinal in the fragment, 0-based and less than numberReads. This field replaces SAM flag 0x40 and 0x80 and is intended to more cleanly represent multiple reads per fragment.



failed_vendor_quality_checks

boolean

The read fails platform or vendor quality checks (SAM flag 0x200).



alignment

LinearAlignment

A linear alignment describes the alignment of a read to a Reference, using a position and CIGAR array

position

Position

The position of this alignment: an unoriented base in some Reference. A Position is represented by a reference name and a base number on that reference (0-based).",

referenceName

String

The name of the Reference on which the Position is located.

position

long

The 0-based offset from the start of the forward strand for that Reference. Genomic positions are non-negative integers less than Reference length.

strand

Strand

Indicates the DNA strand associate for some data item.

NEG_STRANDThe negative (-) strand.
POS_STRANDThe postive (+) strand.

mappingQuality

int

The mapping quality of this alignment, meaning the likelihood that the read maps to this position. Specifically, this is -10 log10 Pr(mapping position is wrong), rounded to the nearest integer.

cigar

List<CigarUnit>

A list of structures for an instance of a CIGAR operation.

operation

CigarOperation

An enum for the different types of CIGAR alignment operations that exist.\\nUsed wherever CIGAR alignments are used. The different enumerated values\\nhave the following usage:

ALIGNMENT_MATCH
INSERT
DELETE
SKIP
CLIP_SOFT
CLIP_HARD
PAD
SEQUENCE_MATCH
SEQUENCE_MISMATCH



* `ALIGNMENT_MATCH`: An alignment match indicates that a sequence can be\\n aligned to the reference without evidence of an INDEL. Unlike the\\n `SEQUENCE_MATCH` and `SEQUENCE_MISMATCH` operators, the `ALIGNMENT_MATCH`\\n operator does not indicate whether the reference and read sequences are an\\n exact match. This operator is equivalent to SAM's `M`.\\n* `INSERT`: The insert operator indicates that the read contains evidence of\\n bases being inserted into the reference. This operator is equivalent to\\n SAM's `I`.\\n* `DELETE`: The delete operator indicates that the read contains evidence of\\n bases being deleted from the reference. This operator is equivalent to\\n SAM's `D`.\\n* `SKIP`: The skip operator indicates that this read skips a long segment of\\n the reference, but the bases have not been deleted. This operator is\\n commonly used when working with RNA-seq data, where reads may skip long\\n segments of the reference between exons. This operator is equivalent to\\n SAM's 'N'.\\n* `CLIP_SOFT`: The soft clip operator indicates that bases at the start/end\\n of a read have not been considered during alignment. This may occur if the\\n majority of a read maps, except for low quality bases at the start/end of\\n a read. This operator is equivalent to SAM's 'S'. Bases that are soft clipped\\n will still be stored in the read.\\n* `CLIP_HARD`: The hard clip operator indicates that bases at the start/end of\\n a read have been omitted from this alignment. This may occur if this linear\\n alignment is part of a chimeric alignment, or if the read has been trimmed\\n (e.g., during error correction, or to trim poly-A tails for RNA-seq). This\\n operator is equivalent to SAM's 'H'.\\n* `PAD`: The pad operator indicates that there is padding in an alignment.\\n This operator is equivalent to SAM's 'P'.\\n* `SEQUENCE_MATCH`: This operator indicates that this portion of the aligned\\n sequence exactly matches the reference (e.g., all bases are equal to the\\n reference bases). This operator is equivalent to SAM's '='.\\n* `SEQUENCE_MISMATCH`: This operator indicates that this portion of the\\n aligned sequence is an alignment match to the reference, but a sequence\\n mismatch (e.g., the bases are not equal to the reference). This can\\n indicate a SNP or a read error. This operator is equivalent to SAM's 'X'.\",\"symbols\":[\"ALIGNMENT_MATCH\",\"INSERT\",\"DELETE\",\"SKIP\",\"CLIP_SOFT\",\"CLIP_HARD\",\"PAD\",\"SEQUENCE_MATCH\",\"SEQUENCE_MISMATCH\"]},\"doc\":\"The operation type.\"},

operationLength

long


referenceSequence

String




f

boolean




f

boolean




f

boolean




f

boolean




type

VariantType

Type of variant, the accepted types and Sequence Ontology (SO) terms are:

SNVSO:0001483
SNPSO:0000694
MNVSO:0002007
MNPSO:0001013
INDELSO:1000032
INSERTIONSO:0000667
DELETIONSO:0000159
TRANSLOCATIONSO:0000199
INVERSIONSO:1000036
CNVSO:0001019
DUPLICATIONSO:1000035
BREAKENDNA
SYMBOLICNA


  • No labels