Top

Mutations can be stated as a protein change (HGVS) or as a chromosomal change (HGVS, GTF or VCF). Currently, the system only accepts HG19 coordinates. Copy number changes need to state the type of alteration (CNA), and translocations need to state both partners (TRA).

HGVS - protein

Transcript identifier (Ensembl or RefSeq) or name of the gene (HUGO symbol) plus the aminoacid change following HGVS format.

 

missense
NM_005157:p.T315I
stop
TSC2:p.Q1178*
in-frame insertion
ENST00000326724:p.P1331_A1332insTP
in-frame deletion
TP53:p.I254_T256delIIT
TP53:p.I254_T256del3
TP53:p.I254_T256del
frameshift
APC:p.I1557fs*30

The longitude of the frameshift (till the new reading frame ends in a stop codon) needs to be stated to retrieve the corresponding nucleotide change, which is used to calculate certain metrics used by the CGI; if not available, APC:p.I1557fs is also allowed)

If you upload a file, the protein change column needs to be labeled as protein in the header. The sample identifier can be also included in a column labeled sample separated using a <tab> character. Any other column will be ignored but included in the output.

HGVS - nucleotide

The chromosome, genome position and reference/alternative alleles (annotated in the positive strand) following HGVS format.

 

point mutations
chr3:g.178936091G>A
block substitution
chr3:g.41266066TG>AA
chr3:g.41266066_41266067delinsAA
insertions
chr5:g.170837546_170837547insCTGT
chr5:g.170837545C>CTCTG
duplications
chr3:g.30732988_30732989dupTG
chr3:g.30732988_30732989delinsTGTG
chr3:30732989_30732990insTG
deletions
chr2:g.234183368_234183372delACTCA
chr2:g.234183368_234183372del
chr2:g.234183368_234183372del5
complex indels
chr10:g.52595929_52595931delGGGinsTA
chr10:g.52595929_52595931delinsTA

If you upload a file, the nucleotide change column needs to be labeled as gdna in the header. The sample identifier can be also included in a column labeled sample separated using a <tab> character. Any other column will be ignored but included in the output. Only hg19 annotation is currenlty supported.

Genomic tabular format

The chromosome, genomic position and reference/alternative alleles (annotated in the positive strand) separated in different columns.

 

point mutation
chr17 7578412 A G
block substitution
chr11 533873 CT AC
insertion
chr17 37881002 G GGGCTCCCCA
chr17 37881003 - GGCTCCCCA
deletion
chr17 37880218 GTTGAGGGAAAACACA G
chr17 37880219 TTGAGGGAAAACACA -
complex indel
chr7 140453155 CA TCC

If you upload a file, the columns need to be labeled as chr, pos, ref and alt and separated using a <tab> character. The sample identifier can be also included in a column labeled sample. Any other column will be ignored but included in the output. Only hg19 annotation is currenlty supported.

VCF - Variant Call Format

The VCF format is accepted only if you upload a file. Check the format details in this link

 

Only hg19 annotation is currenlty supported.

CNA - Copy Number Alteration

The gene plus the alteration type separated by the colon character.

 

amplification
ERBB2:amp
deletion
TP53:del

If you upload a file, this information needs to be stated in two different columns labeled as gene and cna in the header, respectively, separated using a <tab> character. The valid entries for the cna column are etiher amp or del (case insensitive). The sample identifier can be also included in a column labeled sample. Any other column will be ignored but included in the output.

TRA - Translocations

Translocations are stated by the two partners separated with the '__' character (two underscores). The translocations are processed regardless of the order of the partners.

 

translocation
BCR__ABL1 (equivalent to ABL1__BCR)
translocation
PML__RARA (equivalent to RARA__PML)

If you upload a file, the fusion partners column need to be labeled as fus in the header of a tab-separated file. The sample identifier can be also included in a column labeled sample. Any other column will be ignored but included in the output.