Description
NOTE:
This track is intended for use primarily by physicians and other
professionals concerned with genetic disorders, by genetics researchers, and
by advanced students in science and medicine. While the genome browser database
is open to the public, users seeking information about a personal medical or
genetic condition are urged to consult with a qualified physician for
diagnosis and for answers to personal questions.
This track shows the genomic positions of natural and artifical amino acid variants
in the UniProt/SwissProt database.
The data has been curated from scientific publications by the UniProt staff.
Display Conventions and Configuration
Genomic locations of UniProt/SwissProt variants are labeled with the amino acid
change at a given position and, if known, the abbreviated disease name. A
"?" is used if there is no disease annotated at this location, but the
protein is described as being linked to only a single disease in UniProt.
Mouse over a mutation to see the UniProt comments.
Artificially-introduced mutations are colored green and naturally-occurring variants are colored
red. For full information about a particular variant, click the "UniProt variant" linkout.
The "UniProt record" linkout lists all variants of a particular protein sequence.
The "Source articles" linkout lists the articles in PubMed that originally described
the variant(s) and were used as evidence by the UniProt curators.
Methods
UniProt sequences were aligned to RefSeq sequences first with BLAT, then lifted
to genome positions with pslMap. UniProt variants were parsed from the UniProt
XML file. The variants were then mapped to the genome through the alignment
using the pslMap program. This mapping approach
draws heavily on the LS-SNP pipeline by Mark Diekhans. The complete script is
part of the kent source tree and is located in src/hg/utils/uniprotMutations.
Data Access
The raw data can be explored interactively with the
Table Browser, or the
Data Integrator.
For automated analysis, the genome annotation is stored in a bigBed file that
can be downloaded from the
download server.
The underlying data file for this track is called spMut.bb. Individual
regions or the whole genome annotation can be obtained using our tool bigBedToBed
which can be compiled from the source code or downloaded as a precompiled binary
for your system. Instructions for downloading source code and binaries can be found
here.
The tool can also be used to obtain only features within a given range, for example:
bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/hg38/bbi/uniprot/spMut.bb -chrom=chr6 -start=0 -end=1000000 stdout
Please refer to our
mailing list archives
for questions, or our
Data Access FAQ
for more information.
Credits
This track was created by Maximilian Haeussler, with advice from Mark Diekhans and Brian Raney.
References
UniProt Consortium.
Reorganizing the protein space at the Universal Protein Resource (UniProt).
Nucleic Acids Res. 2012 Jan;40(Database issue):D71-5.
PMID: 22102590; PMC: PMC3245120
Yip YL, Scheib H, Diemand AV, Gattiker A, Famiglietti LM, Gasteiger E, Bairoch A.
The Swiss-Prot variant page and the ModSNP database: a resource for sequence and structure
information on human protein variants.
Hum Mutat. 2004 May;23(5):464-70.
PMID: 15108278
|