Skip Header

You are using a version of browser that may not display all the features of this website. Please consider upgrading your browser.

Retrieve / ID mapping

Last modified June 2, 2017


Select the Retrieve/ID mapping tab of the toolbar and enter or upload a list of identifiers (or gene names) to do one of the following:

  • Retrieve the corresponding UniProt entries to download them or work with them on this website.
  • Convert identifiers which are of a different type to UniProt identifiers or vice versa, and download the identifier lists.

How to use this tool

  1. Enter identifiers or upload them from a file, separated by a space or a new line, into the form field, for example: P31946 P62258 ALBU_HUMAN
  2. If you need to convert to another identifier type (as performed previously by the “ID mapping” service), select the source and target type from the “From/To” dropdown menus under “Options”. Otherwise, to retrieve or download a list UniProtKB entries, keep the default selection of these menus (from UniProtKB AC/ID to UniProtKB)
  3. Click the Go button.

The following kinds of UniProt identifiers are supported:

UniProtKB P00750 UniProtKB entry
  P00750-2 UniProtKB entry isoform sequence
  P00750[39-81] UniProtKB sequence range
  A4_HUMAN UniProtKB entry name
UniParc UPI0000000001 UniParc entry
UniRef UniRef100_P00750 UniRef entry

When mapping from a source database external to UniProt, you can submit any identifier as used in the UniProtKB cross-references . If your job is not successful and you are not sure which source database to use, try a text search in UniProtKB with one of your identifiers, and look at an example entry. Check out the cross-reference section to find out which database uses these identifiers.

Further queries involving your UniProtKB data sets

After you have submitted your data, you are forwarded to a query result page showing the correspondence of submitted identifiers (from external databases, or obsolete UniProtKB identifiers) with current UniProtKB accession numbers. You can use the basket, download and align services like in any query result, as well as reconfigure the table layout (“Columns”) or add additional constraints to your query.

Jobs have unique identifiers, which (depending on the job type) can be used in queries (e.g. to get the intersection of two sequence similarity searches). Job identifiers and the related data are kept for 7 days, and are then deleted.

Unmapped identifiers

The list of identifiers that could not be mapped can be retrieved for further inspection or analysis.

When mapping popular sequence database identifiers such as RefSeq, gi numbers, EMBL, EMBLCDS to UniProtKB, unmapped identifiers can be further mapped to UniParc. This can be particularly useful for proteins from redundant proteomes.

Programmatic access

Code examples for programmatic access to the database identifier mapping service are available in our help page Programmatic access – Mapping database identifiers.


See also: Related questions from our FAQ

Related terms: batch, bulk