ESM-2 API#

Zeeshan Siddiqui

Oct 18, 2023

6 min read

This page explains the use of ESM-2 for generating embeddings, contacts, attentions, and predicting logits and how to access these capabilities with the the BioLM API.

Endpoints#

There are 5 BioLM endpoints corresponding to 5 different sized ESM-2 models. These model sizes, with M representing millions of parameters and B representing billions of parameters, are:

esm2_t6_8M_UR50D, 6 layers and 8M parameters, endpoint at https://biolm.ai/api/v2/esm2-8m/<model_action>/
esm2_t12_35M_UR50D, 12 layers and 35M parameters, endpoint at https://biolm.ai/api/v2/esm2-35m/<model_action>/
esm2_t30_150M_UR50D, 30 layers and 150M parameters, endpoint at https://biolm.ai/api/v2/esm2-150m/<model_action>/
esm2_t33_650M_UR50D, 33 layers and 650M parameters, endpoint at https://biolm.ai/api/v2/esm2-650m/<model_action>/
esm2_t36_3B_UR50D, 36 layers and 3B parameters, endpoint at https://biolm.ai/api/v2/esm2-3B/<model_action>/

Embedding API Usage#

The encode action produces embeddings, contacts, attention maps and logits. Appending ‘encode/’ to model endpoints above gives access to these outputs.

Using the 650M model as an example, the encode endpoint is https://biolm.ai/api/v2/esm2-650m/encode/

Making Requests#

Curl

curl --location 'https://biolm.ai/api/v2/esm2-650m/encode/' \
--header "Authorization: Token $BIOLMAI_TOKEN" \
--header 'Content-Type: application/json' \
--data '{
    "params": {
        "include": [
            "mean",
            "contacts",
            "logits",
            "attentions"
        ]
    },
    "items": [
        {
            "sequence": "MAETAVINHKKRKNSPRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRLYESLCRYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDITSMTTG"
        }
    ]
}'

Python Requests

import requests
import json

url = "https://biolm.ai/api/v2/esm2-650m/encode/"

payload = json.dumps({
    "params": {
        "include": [
            "mean",
            "contacts",
            "logits",
            "attentions"
        ]
    },
    "items": [
        {
            "sequence": "MAETAVINHKKRKNSPRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRLYESLCRYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDITSMTTG"
        }
    ]
})
headers = {
'Authorization': 'Token {}'.format(os.environ["BIOLMAI_TOKEN"]),
'Content-Type': 'application/json'
}

response = requests.request("POST", url, headers=headers, data=payload)

print(response.text)

biolmai SDK

import biolmai
seqs = ["MAETAVINHKKRKNSPRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRLYESLCRYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDITSMTTG"]

cls = biolmai.ESM2_650M()
resp = cls.encode(seqs, params={
        "include": [
            "mean",
            "contacts",
            "logits",
            "attentions"
        ]
    })

library(RCurl)
headers = c(
'Authorization' = paste('Token', Sys.getenv('BIOLMAI_TOKEN')),
"Content-Type" = "application/json"
)
payload = "{
    \"params\": {
        \"include\": [
            \"mean\",
            \"contacts\",
            \"logits\",
            \"attentions\"
        ]
    },
    \"items\": [
        {
            \"sequence\": \"MAETAVINHKKRKNSPRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRLYESLCRYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDITSMTTG\"
        }
    ]
}"
res <- postForm("https://biolm.ai/api/v2/esm2-650m/encode/", .opts=list(postfields = payload, httpheader = headers, followlocation = TRUE), style = "httppost")
cat(res)

JSON Response#

Note

The above response is only a small snippet of the full JSON response. For every item in include there is a corresponding field for each dictionary in results. Each of these dictionaries corresponds to one of the items submitted

Request Definitions#

items:: Inside items are a list of dictionaries with each dictionary corresponding to one model input.
sequence:: The input sequence for the model
params:: These are additional parameters for the endpoint that are used with every input in items. By default the ESM-2 encode endpoints will only return the extracted mean ESM-2 embeddings for the last layer of the model, modifying params allows other outputs such as contacts to be returned or different representative layers for the embeddings to be selected.
repr_layers:: This parameter specifies the representative layer of the ESM-2 model that embeddings are extracted from. If unspecified it defaults to [-1] and returns embeddings/representations for that layer (-1 indexes the last layer, -2 the second to last).
include:: For the encode endpoint, the include param in params specifies what outputs to include in the response. These could be any of ‘logits’, ‘attentions’, ‘contacts’, ‘per_token’, ‘bos’, or ‘mean’. ‘per_token’, ‘bos’, and ‘mean’ are types of embeddings. ‘per_token’ returns the entire model hidden states for each token at the representative layer(s). (this can be specified with repr_layers). These full representations can be used for additional kinds of pooling such as min or max pooling. ‘bos’ returns the hidden states for the ‘bos’ (beginning of sequence) token at the representative layer(s) ‘mean’ is the average of the ‘per_token’ representations at the representative layer(s). ‘mean’ is the default option if include is unspecified.

Response Definitions#

results:

This is the main key in the JSON object that contains an array of model results. Each element in the array represents a set of predictions for one input instance.

mean_representations:

This key holds the embeddings generated by the ESM-2 model for the corresponding input sequence. These embeddings represent average values computed over certain dimensions of the model’s output.

representations:

This key holds the entire per token hidden states generated by the ESM-2 model for the corresponding input sequence.

bos_representations:

This key holds the embeddings for the ‘bos’ (beginning of sequence) tokens generated by the ESM-2 model for the corresponding input sequence.

33:

The layer numbers corresponding to the selected representative layers in the request are sub keys under the different representations.: These keys hold the corresponding embeddings for that specific layer. This is different for each model size, ESM-2 8M only has 6 layers while ESM-2 650M has 33. If using the ESM-2 8M endpoint, this subkey would never exceed 6.

logits:

This key contains the model logits for each token in the input sequence. The returned values are of size Length of Sequence X 20 (the number of natural amino acids)

attentions:

This key corresponds to the computed attentions over each layer of the model corresponding to the input sequence. These attentions are of size Number of Layers X Sequence Length

contacts:

This key contains the predicted contacts (residues that are close together in structural space) for the input sequence. These contacts are of shape Length of Sequence X Length of Sequence

Prediction API Usage#

The predict action returns model computed logits from masked sequences (one or more amino acids are masked and unknown to the model) Appending ‘predict/’ to model endpoints above gives access to these outputs.

Using the 650M model as an example, the predict endpoint is https://biolm.ai/api/v2/esm2-650m/predict/.

Making Requests#

Curl

curl --location 'https://biolm.ai/api/v2/esm2-650m/predict/' \
--header "Authorization: Token $BIOLMAI_TOKEN" \
--header 'Content-Type: application/json' \
--data '{
    "items": [
        {
            "sequence": "MAETAVINHKKRKNSPRI<mask>QSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRLYESLCQYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDITSMTTG"
        }
    ]
}'

Python Requests

import requests
import json

url = "https://biolm.ai/api/v2/esm2-650m/predict/"

payload = json.dumps({
    "params": {
        "include": [
            "mean",
            "logits",
            "attentions"
        ]
    },
    "items": [
        {
            "sequence": "MAETAVINHKKRKNSPRI<mask>QSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRLYESLCQYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDITSMTTG"
        }
    ]
})
headers = {
'Authorization': 'Token {}'.format(os.environ["BIOLMAI_TOKEN"]),
'Content-Type': 'application/json'
}

response = requests.request("POST", url, headers=headers, data=payload)

print(response.text)

biolmai SDK

import biolmai
seqs = ["MAETAVINHKKRKNSPRI<mask>QSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRLYESLCQYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDITSMTTG"]

cls = biolmai.ESM2_650M()
resp = cls.predict(seqs)

library(RCurl)
headers = c(
'Authorization' = paste('Token', Sys.getenv('BIOLMAI_TOKEN')),
"Content-Type" = "application/json"
)
payload = "{
    \"items\": [
        {
            \"sequence\": \"MAETAVINHKKRKNSPRI<mask>QSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASKDIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNPYAMRLYESLCQYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVFSFRDITSMTTG\"
        }
    ]
}"
res <- postForm("https://biolm.ai/api/v2/esm2-650m/predict/", .opts=list(postfields = payload, httpheader = headers, followlocation = TRUE), style = "httppost")
cat(res)

JSON Response#

Note

The above response is only small snippets of the full JSON response. Each of these dictionaries corresponds to one of the items submitted

Request Definitions#

items:: Inside items are a list of dictionaries with each dictionary corresponding to one model input.
sequence:: The input sequence for the model

Response Definitions#

results:: This is the main key in the JSON object that contains an array of model results. Each element in the array represents a set of predictions for one input instance.
logits:: This key contains the models output logits for each position in the input sequence. There are 20 logits for each position corresponding to the 20 natural amino acids. These logits can be mapped to the models confidence in which of the 20 natural amino acids should be at that specif position. In the case of the mask token, these logits give the models prediction for which token most likely occupies the masked position. The logits are of size Length of Sequence X 20
sequence_tokens:: Contains the tokens of the input sequence. Size Length of Sequence
alphabet_tokens:: the 20 amino acids corresponding to the 20 output logits for each position in the sequence

ESM-2 API#

Endpoints#

Embedding API Usage#

Making Requests#

JSON Response#

Request Definitions#

Response Definitions#

Prediction API Usage#

Making Requests#

JSON Response#

Request Definitions#

Response Definitions#

Related#