Download the predictions in JSON or CSV format, and visit the API results with the above buttons.
You can select the protein you want to display on the right. Toggle predictions on and off by clicking on their name underneath the plot.
Multiple sequence alignment (MSA) tutorial - Biophysical conservation
The selection and plot below are almost the same as the single sequence predictions, refer to the single sequence tutorial for the main functionality. Hovering over the plot, however, will display the Gaussian Mixture Model (GMM) score for that residue. This score is based on an analysis of the 7-dimensional 'biophysical space' for that residue, in relation to the column it occupies in the MSA.
High scores indicate very normal scores, low (negative) scores indicate unusual scores, meaning this residue is unlike other ones in the alignment. Below the plot, you will also find a breakdown of the residues that are indicated by the GMM as unusual, using 5%, 1% and 0.1% cutoffs over the full MSA.
Click on the above prediction names to toggle them on/off
The above plot shows the following predictions for :
|DynaMine backbone dynamics||Values above 0.8 indicate rigid conformations, values above 1.0 membrane spanning regions, values below 0.69 flexible regions. Values between 0.69-0.80 are 'context' dependent and capable of being either rigid or flexible.|
|DynaMine sidechain dynamics||Higher values mean more likely rigid. These values are highly dependent on the amino acid type (i.e. a Trp will be rigid, an Asp flexible).|
|DynaMine conformational propensities (sheet, helix, coil, ppII (polyproline II))||Higher values indicate higher propensities.|
|EFoldMine earlyFolding propensity||Values above 0.169 indicate residues that are likely to start the protein folding process, based on only local interactions with other amino acids.|
|Disomine disorder||Values above 0.5 indicate that this is likely a disordered residue.|
The plot below shows, for the protein you selected above, the variation in predicted biophysical parameters within the multiple sequence alignment (MSA) that you uploaded. This variation is displayed according to simple box plot statistics, with median, first/third quartile, and outlier range of the distributions shown. Columns in the MSA that are 'gapped' for the selected protein are not shown here.
In other words, what is displayed is how the biophysical prediction for each aligned position varies for all the proteins that are in the MSA. You can select the type of prediction that you want to display in the selection box below the plot, and turn each distribution statistic on and off by clicking on its name. The 'prediction' field corresponds to the same type of prediction shown in the top plot.
The selected prediction below this message is visualized in the context of the input MSA. Therefore, there may be gaps where the selected protein lacks values. You can still visualize all the statistical fields for these gaps in the alignment.