AIUPred

Basics

The main aim of AIUPred is to identify Intrinsically Disordered Protein Regions (IDPRs, i.e. regions that lack a stable monomeric structure under native conditions) based on a biophysics-based model enhanced with deep learning techniques. The user can input any protein sequence and IUPred returns a score between 0 and 1 for each residue, corresponding to the probability of the given residue being part of a disordered region.

The disordered nature of a protein segment can be context dependent: certain protein regions can switch between an ordered and a disordered state depending on various environmental factors. Currently, the AIUPred server is able to detect such context-dependent disorder in the case where the environmental factors are either a change in the redox state or the presence of an ordered binding partner (for more details see here).

The following sections outline the use of AIUPred in various scenarios.

Protein sequence input

There are two basic ways to input protein sequences into AIUPred:

If the protein is deposited in the UniProt database (either in SwissProt or TrEMBL) you can specify the accession code or the Entry of the protein in the "Enter UniProt accession or entry" field. The AIUPred server is always linked to latest version of UniProt. The header of the UniProt entry will be
Type or cut and paste your sequence in the "paste the amino acid sequence" field.

FASTA format

Prediction type

There are two different disorder prediction types offered, each using different parameters optimized for slightly different applications through advanced options. These are: Default smoothing and no smoothing

Default smoothing:
Apply a Savitzky–Golay filter with a window size of 11 and a polynom order of 5 which increases the performance of the method

No smoothing
Show the standard output of the neural network without any smoothing.

Context-dependent predictions

IDPRs often harbor binding regions that are able to specifically interact with a globular domain. During this interaction, in the majority of known cases, the binding disordered region adopts an ordered structure in its bound form. This is probably the most commonly occurring context-dependent protein disorder, where the transition between the unstructured and the structured states is initiated by the presence of an appropriate protein partner. Such disordered binding regions are identified using the AIUPred-binding prediction algorithm. Similarly to AIUPred, AIUPred-binding also assigns to each residue a score between 0 and 1, representing the probability of the given residue to be part of a disordered binding region. Selecting binding as a prediction option, the binding score is provided along with the IUPred score.

Flexible Linker Prediction:
AIUPred is also capable of predicting flexible linkers by mathematically combining the disorder and binding scores. This feature identifies regions that are highly disordered but lack binding sites, functioning as flexible connecting regions between distinct functional domains.

Output

Basic features:
The primary output of AIUPred is a graph showing the disorder tendency of each residue in the given protein, where higher values correspond to a higher probability of disorder. The graph is scalable and can be directly downloaded for presentation/publication purposes. The list of position-specific disorder scores is also downloadable in simple text or JSON format.

Extended features:
If the prediction was run by specifying a UniProt ID/accession, the output of AIUPred also shows additional protein annotations, including Pfam regions; post-translational modifications (PTMs), including phosphorylations (upper line), methylations and acetylations (lower line) taken from PhosphoSitePlus; corresponding structures from the PDB; and regions that were experimentally verified to be disordered, taken from DisProt, DIBS, and MFIB.

If context-dependent predictions were selected, the output graph and the downloadable results incorporate additional data as well.
Regions overlapping with experimentally verified disordered regions are marked with a red background on the plot. Alongside with this notation regions which were categorised as ordered are marked with a grey background. In case of disordered binding region prediction, the graph shows the probability of each residue being part of a binding region in blue.
If flexible linker prediction is selected, the probability of each residue functioning as a linker is displayed as a green curve.
The presence or absence of the AIUPred, binding, and linker scores are switchable by clicking on the legend.

RESTful API

AIUPred can be accessed using RESTful API to analyse proteins programatically. The API can be accessed using a standard GET request at

https://aiupred.elte.hu/rest_api

Available paramaters are:

Parameter		Default	Values
accession	Required		UniProt accession
analysis_type	Optional		'binding' for AIUPred-binding, 'linker' for linkers, and 'redox' for redox
smoothing	Optional	default	'default' or 'False'

import requests
import json

data = {'accession': 'q32p44', 'smoothing': 'default', 'analysis_type': 'linker'}
url = 'https://aiupred.elte.hu/rest_api'
for key, val in json.loads(requests.get(url, params=data).text).items():
    print(key, val)

Programmatic usage

AIUPred can be freely downloaded for academic users. It is provided as a modern, installable Python library with a built-in executable script. It is highly advised to use a virtual environment! You can install AIUPred directly from GitHub:

pip install git+https://github.com/doszilab/AIUPred.git

Once installed, you can use the standalone executable globally by running:

aiupred -i input.fasta

Available options:

usage: aiupred [-h] -i INPUT_FILE [-o OUTPUT_FILE] [-v] [-b] [-l] [-g GPU] [--force-cpu]

options:
  -h, --help            show this help message and exit
  -i INPUT_FILE, --input_file INPUT_FILE
                        Input file in (multi) FASTA format
  -o OUTPUT_FILE, --output_file OUTPUT_FILE
                        Output file
  -v, --verbose         Increase output verbosity
  -b, --binding         Predict binding using AIUPred-binding
  -l, --linker          Predict flexible linkers
  -g GPU, --gpu GPU     Index of GPU to use, default=0
  --force-cpu           Force the network to only utilize the CPU. Calculation will be very slow.

The following section gives an example of how to use the importable Python library.

Because AIUPred is now a standard Python package, there is no need to modify your PYTHONPATH. Simply import the class in your Python scripts:

from aiupred import AIUPred

# Initialize the predictor (loads the models into memory automatically)     
predictor = AIUPred()

sequence = 'THISISATESTSEQUENCE'

# Predict disorder
disorder_scores = predictor.predict_disorder(sequence)

# Predict binding
binding_scores = predictor.predict_binding(sequence)

# Predict flexible linkers
# Tip: Pass the pre-calculated disorder and binding arrays to skip redundant network inference!
linker_scores = predictor.predict_linker(sequence, disorder_pred=disorder_scores, binding_pred=binding_scores)

Primary citation

AIUPred: combining energy estimation with deep learning for the enhanced prediction of protein disorder
Gábor Erdős, Zsuzsanna Dosztányi
Nucleic Acids Research 2024; gkae385

AIUPred – Binding: Energy Embedding to Identify Disordered Binding Regions
Gábor Erdős, Norbert Deutsch, Zsuzsanna Dosztányi
Journal of Molecular Biology 2025