Recommendations for the description of protein sequence variants


Last modified May 12, 2007

Since references to WWW-sites are not yet acknowledged as citations, please mention den Dunnen JT and Antonarakis SE (2000). Hum.Mutat. 15:7-12 when referring to these pages.


Contents


Protein level

(suggestions extending the published recommendations in italics)


Designations at protein level describe the consequence of a change, the origin lies at DNA-level, and they will be rarely experimentally verified. Sequence changes at protein level are described as those at the DNA level with the following modifications / additions;


Silent changes

Description of so called "silent" changes in the format p.Leu54Leu (or p.L54L) is not allowed; descriptions should be given at DNA level. The description at protein level is not informative and not unequivocal (there are five possibilities at DNA level which may underlie p.Leu54Leu). A correct description has the format c.162C>G (p.=), with "p.=" indicating that there is no effect on protein level expected (see Discussion).


Substitutions

Substitutions are designated by a ">"-character (indicating "changes to").


Deletions

Deletions are designated by "del" after an indication of the first and last amino acid(s) deleted.


Duplications

Duplications are designated by "dup" after an indication of the first and last amino acid(s) duplicated.


Insertions

Insertions are designated by "ins" after an indication of the amino acids flanking the insertion site, followed by a description of the amino acid(s) inserted. Duplicating insertions should be described as duplications (see Discussion), not as insertion. For large insertions the number of inserted amino acids should be mentioned, together with an accession.version number referring to a sequence database file containing the complete inserted sequence. 


Variability of short sequence repeats

Variability of short sequence repeats are described as p.Gln6(3_6); the description indicates that a stretch of Glutamines (Gln, Q) is present, starting at amino acid position 6 (e.g. in MKMGHQQQCC), which is found with a variable length from 3 to 6 in the population.
NOTE: the underscore is used to indicate the range (3 to 6 times).


Insertion-deletions (indels)

Insertion/deletions (indels) are described as a deletion followed by an insertion after an indication of the amino acid(s) flanking the site of the insertion/deletion (see Discussion).


Frame shifts

Frame shifting mutations are designated by "fs" after the amino acid(s) affected by the change. Descriptions either use a short ("fs") or long ("fsX#") description. The description of frame shifts does not include the deletion at protein level from the site of the frame shift to the natural end of the protein (stop codon).
NOTE: typing error in den "Dunnen & Antonarakis (2000)". The suggestion to use ">" to indicate "delins" in frame shift descriptions has been retracted.
NOTE:
also for frame shifts the changes observed should be described on protein level and not try to incorporate any knowledge regarding the change at DNA-level (see above). Thus, p.His150HisfsX10 is not correct, but p.Gln151ThrfsX9 is.


More changes in one individual

Two or more changes in one individual are described by combining the changes, per allele (chromosome) between brackets ("[]").

Changes in different alleles (e.g. in recessive diseases) are described as "[change allele 1]+[change allele 2]" (see Discussion). Two variations in one allele Two sequence changes with alleles unknown are described as "[change allele 1(+)change allele 2]" (see Disucssion).

| Top of page | MutNomen homepage | Check-list |
| Recommendations:  generalDNARNA, uncertain |
| Discussions | FAQ's | Codons / amino acids | History |
| Example descriptions:  QuickRef / symbolsDNARNAprotein |

Copyright © HGVS 2007 All Rights Reserved
Website Created by Rania Horaitis, Nomenclature by J.T. Den Dunnen - Disclaimer