Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Introduction

Number of entries
New entries 1,479,097
Updated entries 19,802,872
Unchanged entries 94,396,842
Total 115,678,811
Entries with updated sequences 4,598
With a fragmented AA sequence 10,797,121
With known alternative products 0
Protein Existence (PE) Number of entries
1 Evidence at protein level 142,928
2 Evidence at transcript level 1,180,723
3 Inferred from homology 29,227,113
4 Predicted 85,128,047
5 Uncertain 0

Taxonomic Origin


Statistics on the number of species

Number of species in
New entries 31,826
Updated entries 396,083
Unchanged entries 360,812
Total 642,203

Sequence data

The shortest sequence is C4PYW0 at 2 AA while the longest sequence is A0A1V4K6M4 at 36,991 AA

Some annotation statistics

General Annotation (comments)

Annotations Entries
Allergenic properties 0 0
Alternative products 0 0
Biophysicochemical properties 0 0
Biotechnological use 0 0
Catalytic activity 13,707,102 12,455,861
Caution 63,215,657 61,738,658
Cofactor 9,787,031 0
Developmental stage 0 0
Involvement in disease 0 0
Disruption phenotype 0 0
Domain 1,199,457 917,269
Enzyme regulation 330,948 330,946
Function 16,024,944 15,126,438
Induction 84,605 84,605
Mass spectrometry 0 0
Miscellaneous 701,792 632,250
Pathway 6,969,173 6,268,730
Pharmaceutical use 0 0
Polymorphism 0 0
Post-translational modification 1,192,853 889,261
RNA Editing 0 0
Sequence caution 0 0
Sequence similarities 29,375,998 28,972,865
Subcellular Location 0 0
Subunit structure 8,351,688 8,245,142
Tissue specificity 0 0
Toxic dose 0 0

Sequence Annotation (featues)

Annotations Entries
Molecule processing 17,308,940 8,683,875
Chain 8,630,770 8,619,536
Initiator methionine 53,114 53,114
Peptide 694 429
Propeptide 16,705 16,705
Signal peptide 8,607,528 8,607,518
Transit peptide 129 129
Regions 228,749,101 78,426,574
Calcium binding 254,383 125,947
Coiled-coil 17,475,498 11,657,162
Compositional bias 4,516 4,516
DNA binding 3,002,420 2,660,736
Domain 83,811,578 60,454,686
Motif 1,531,174 1,037,683
Nucleotide binding 6,708,646 4,248,048
Repeat 4,582,786 1,098,429
Region 5,342,775 2,742,637
Topological domain 318,157 148,431
Transmembrane 105,303,032 23,132,017
Zinc finger 413,019 326,152
Sites 37,171,240 8,146,188
Active site 7,248,099 4,355,583
Metal binding 12,507,727 3,329,990
Binding site 15,400,592 3,975,197
Other 2,014,822 1,218,133
Amino acid modifications 4,918,442 2,649,894
Cross-link 31,671 29,680
Disulfide bond 2,087,605 551,667
Glycosylation 21,133 20,028
Lipidation 355,038 203,595
Modified residue 2,418,297 2,112,182
Non-standard residue 4,698 4,505
Experimental info 16,445,762 10,872,243
Mutagenesis 0 0
Non-adjacent residues 0 0
Non-terminal residue 16,360,471 10,844,864
Sequence conflict 0 0
Sequence uncertainty 85,291 71,125

Citation usage

Citation type Citations Entries
Submission00
Journal article00
Book00
Thesis00
Patent00
Unpublished observations00
Online journal article00

Additional automatically mapped literature

Citation type Citations Entries
Journal articles 721,347 452,627

For information about which journals are used in citing or mapping to UniProtKB see the journals section.

Database Cross-Reference Statistics

DatabaseEntities linked toEntries
Sequence databases
EMBL0111,935,062
PIR0130,479
RefSeq043,622,632
UniGene0749,059
3D structure databases
DisProt096
PDB018,031
PDBsum016,797
ProteinModelPortal07,292,512
SMR01,219,907
Protein-protein interaction databases
CORUM0114
DIP03,219
ELM0110
IntAct026,148
MINT02,463
STRING06,487,134
Chemistry
BindingDB0264
ChEMBL0882
DrugBank0449
GuidetoPHARMACOLOGY04
SwissLipids081
Protein family/group databases
Allergome03,156
CAZy0120,976
ESTHER074,750
MEROPS0245,684
MoonProt067
PeroxiBase02,468
REBASE031,675
TCDB08,140
mycoCLAP0447
PTM databases
CarbonylDB0266
GlyConnect013
PhosphoSitePlus02,254
SwissPalm02,058
UniCarbKB017
iPTMnet05,160
Polymorphism and mutation databases
2D gel databases
COMPLUYEAST-2DPAGE04
OGP03
REPRODUCTION-2DPAGE062
SWISS-2DPAGE01
World-2DPAGE0311
Proteomic databases
EPD013,095
MaxQB041,701
PRIDE0333,174
PaxDb0329,928
PeptideAtlas0131,317
ProMEX02,526
TopDownProteomics0280
Protocols and materials databases
DNASU040,881
Genome annotation databases
Ensembl01,811,222
EnsemblBacteria037,469,046
EnsemblFungi06,084,607
EnsemblMetazoa01,100,202
EnsemblPlants01,984,173
EnsemblProtists01,754,316
GeneDB0112,896
GeneID010,569,201
Gramene01,984,173
KEGG015,477,418
PATRIC017,719,854
UCSC093,170
VectorBase0559,616
WBParaSite0845,705
Organism-specific databases
ArachnoServer0201
Araport015,193
CGD020,739
CTD0979,432
ConoServer0160
EuPathDB0678,264
FlyBase0211,437
GeneCards01,352
H-InvDB0441
HGNC050,479
LegioList02,483
Leproma01,269
MGI061,083
MIM04
MalaCards012
OpenTargets048,542
PharmGKB03,146
PomBase031
PseudoCAP04,445
RGD021,725
SGD07
TAIR011,859
TubercuList01,001
VGNC077,336
WormBase055,504
Xenbase034,259
ZFIN053,081
dictyBase07,765
euHCVdb075,264
Phylogenomic databases
GeneTree01,790,061
HOGENOM03,024,074
HOVERGEN0300,484
InParanoid02,377,791
KO06,906,283
OMA06,897,318
OrthoDB014,352,207
PhylomeDB0461,627
TreeFam0563,509
eggNOG06,997,151
Enzyme and pathway databases
BRENDA09,285
BioCyc06,067,531
Reactome0101,809
SABIO-RK0614
SIGNOR08
SignaLink03,799
UniPathway06,247,467
Other
ChiTaRS0131,701
EvolutionaryTrace05,948
GenomeRNAi030,067
PMAP-CutDB0130
PRO02,196
Gene expression databases
Bgee0537,235
CollecTF0200
ExpressionAtlas0679,897
Genevisible015,865
Ontologies
Family and domain databases
CDD018,459,573
Gene3D042,700,717
HAMAP013,010,271
InterPro092,053,642
PANTHER022,605,250
PIRSF010,273,455
PRINTS014,021,983
PROSITE039,578,538
Pfam084,071,820
ProDom01,617,658
SFLD0557,642
SMART021,329,379
SUPFAM060,673,144
TIGRFAMs022,726,322

Web resource

0 UniProtKB/TrEMBL entries have at least one link to a webpage of general interest on the protein.

Amino acid distribution statistics

    • Aliphatic
    • Acidic
    • Small hydroxy
    • Basic
    • Amide
    • Aromatic
    • Sulfur

    Miscellaneous Statistics

    1,808,919 entries are encoded on a mitochondrion, and 733,597 are encoded on a plasmid.

    741,887 entries are encoded on a plastid, of which 785 are encoded on apicoplasts, 618,476 on chloroplasts, 1 on organellar chromatophores, 8 on cyanelles, 1,521 on non-photosynthetic plastids and 3,190 on unspecified types of plastid.

    Cookie policy

    We would like to use anonymized google analytics cookies to gather statistics on how uniprot.org is used in aggregate. Learn more

    UniProt is an ELIXIR core data resource
    Main funding by: National Institutes of Health