Recommendations for the description of DNA sequence variants


Last modified June 15, 2007

Since references to WWW-sites are not yet acknowledged as citations, please mention den Dunnen JT and Antonarakis SE (2000). Hum.Mutat. 15: 7-12 when referring to these pages.


Contents


DNA level

(suggestions extending the published recommendations are in italics)



Substitutions

Single nucleotide substitutions are designated by a ">"-character (indicating "changes to"). Changes of two or more consecutive nucleotides are described as deletion/insertions (indels, see Deletion/insetions).

NOTE: it is not correct to describe polymorphic variants as c.76A/G (see Discussion)


Deletions

Deletions are designated by "del" after an indication of the first and last nucleotide(s) deleted.


Duplications, triplications, ...

Duplications are designated by "dup" after an indication of the first and last nucleotide(s) duplicated.


Insertions

Insertions are designated by "ins" after an indication of the nucleotides flanking the insertion site, followed by a description of the nucleotides inserted. Duplicating insertions should be described as duplications (see Discussion), not as insertion. For large insertions the number of inserted nucleotides should be mentioned, together with an accession.version number referring to a sequence database file containing the complete inserted sequence. 


Variability of short sequence repeats

Variability of short sequence repeats (e.g. ATGCGATGTGTGCC) are described as c.123+74TG(3_6); c.123+74 indicates the start of the first nucelotide of the variable repeat (not the end like c.123+79TG) and TG indicates the sequence of the repeat unit.
NOTE: the underscore is used to indicate the range (3 to 6 times); when the repeat-sequence becomes too large its size is indicated, not its range.


Deletion / insertions (indels)

Deletion/insertions (indels) are described as a deletion followed by an insertion after an indication of the nucleotides flanking the site of the deletion/insertion (see Discussion). Changes of two or more consecutive nucleotides are described as deletion/insertions (indels).


Inversions

Inversions are designated by "inv" after an indication of the first and last nucleotides affected by the inversion.


Conversions

Conversions are designated by "con" after an indication of the first and last nucleotides affected by the conversion, followed by a description of the origin of the new nucleotides (see Discussion).


Translocations

Translocations are described at the molecular level using the format "t(X;4)(p21.2;q34)", followed by the usual numbering, indicating the position translocation breakpoint. The sequences of the translocation breakpoints need to be submitted to a sequence database (Genbank, EMBL, DDJB) and the accession.version numbers should be given (see Discussion).


More changes in one individual

Two or more changes in one individual are described by combining the changes, per allele (chromosome) between square brackets ("[]").

Changes in different alleles (e.g. in recessive diseases) are described as "[change allele 1]+[change allele 2]". Mixed descriptions like c.[76A>C]+g.[91C>G] should not be used. Two variations in one allele, separated by at least one nucleotide, are described as "[first change; second change]". Consequently, the description c.76_77delinsTT is preferred over c.[76A>T; 77G>T]. For the description of haplotypes see Discussion. Mosaic cases - two different nucleotides in one position are described as "[=, nucleotide 2]" (see FAQ). Two sequence changes with alleles unknown are described as "[change allele 1(+)change allele 2]" (see FAQ).

Complex changes

Sequence changes can be very complex, involving several changes at a specific location. The description of such changes using the recommendations given above can become rather complicated and at some point, although literally correct, effectively meaningless. In such cases the recommendation is to submit the sequence that has been determined to GenBank and to use the accession.version number in the description.


| Top of page | MutNomen homepage | Check-list |
| Recommendations: generalRNAprotein, uncertain |
| Discussions | FAQ's | Codons / amino acids | History |
| Example descriptions:  QuickRef / symbolsDNARNAprotein |

Copyright © HGVS 2007 All Rights Reserved
Website Created by Rania Horaitis, Nomenclature by J.T. Den Dunnen - Disclaimer