Classification table

The classification table is the main output file StructMAn produces. It is a tab-separated table file. Each row represents the computed results around one amino acid position of one queried protein. In the following, all its columns are explained:

Name	Description
Uniprot-Ac	Contains the Uniprot accession ID of the query protein.
Uniprot Id	Contains the Uniprot entry name of the query protein.
Refseq	Contains the Refseq-protein-ID of the query protein.
PDB-ID (Input)	In the case, PDB identifiers were given, this column contains the PDB identifier and the chain of the query.
Residue-Id	In the case, PDB identifiers were given, this column contains the residue identifier of the queried position.
Amino Acid	Contains the amino acid in one letter code of the queried position.
Position	Contains the number in the protein sequence of the queried position.
Tags	Contains the tags given by the input file regarding the queried position.
Weighted Surface/Core	Contains the annotation of whether the queried position lies on the solvent-accessible surface of the protein or in the solvent-unaccessible core of the protein, based on all mapped structures.
Class	Contains the distance-based structural classification of the queried position.
Simple Class	Contains a simplified version of the distance-based classification.
RIN Class	Contains the residue interaction network (RIN) based classification.
RIN Simple Class	Contains a simplified version of the RIN-based classification.
Confidence Value	Contains a score representing the confidence of the annotated structural classification, based on the amount and quality of mapped structures and of the ambiguity of the structural classifications of the individual mappings.
Secondary Structure	Contains the secondary structure assignment of the queried position performed by DSSP.
Recommended Structure	Contains the PDB-ID and the chain identifier of the recommended structure, which is the structure chosen by StructMAn to best represent the structural neighborhood of the queried position.
Sequence-ID	Contains the percent sequence identity between the sequence of the recommended structure and the query protein.
Coverage	Contains the fraction of the sequence of the query protein covered by the recommended structure when the sequences of the query protein and the structurally resolved protein are aligned.
Resolution	Contains the resolution of the recommended structure.
Max Seq Id Structure	Contains the PDB-ID and the chain identifier of the structure, which has the highest sequence identity of all structures where the queried residue can be mapped.
Max Sequence-ID	Contains the percent sequence identity between the sequence of the maximal sequence identity structure and the query protein.
Max Seq Id Coverage	Contains the fraction of the sequence of the query protein covered by the maximal sequence identity structure.
Max Seq Id Resolution	Contains the resolution of the maximal sequence identity structure.
Amount of mapped structures	Contains the total number of all structures, where the queried position could be mapped to.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Classification table

Clone this wiki locally