-
Notifications
You must be signed in to change notification settings - Fork 0
Classification table
AlexanderGress edited this page Oct 30, 2020
·
13 revisions
The classification table is the main output file StructMAn produces. It is a tab-separated table file. Each row represents the computed results around one amino acid position of one queried protein. In the following, all its columns are explained:
Name | Description |
---|---|
Uniprot-Ac | Contains the Uniprot accession ID of the query protein. |
Uniprot Id | Contains the Uniprot entry name of the query protein. |
Refseq | Contains the Refseq-protein-ID of the query protein. |
PDB-ID (Input) | In the case, PDB identifiers were given, this column contains the PDB identifier and the chain of the query. |
Residue-Id | In the case, PDB identifiers were given, this column contains the residue identifier of the queried position. |
Amino Acid | Contains the amino acid in one letter code of the queried position. |
Position | Contains the number in the protein sequence of the queried position. |
Tags | Contains the tags given by the input file regarding the queried position. |
Weighted Surface/Core | Contains the annotation of whether the queried position lies on the solvent-accessible surface of the protein or in the solvent-unaccessible core of the protein, based on all mapped structures. |
Class | Contains the distance-based structural classification of the queried position. |
Simple Class | Contains a simplified version of the distance-based classification. |
RIN Class | Contains the residue interaction network (RIN) based classification. |
RIN Simple Class | Contains a simplified version of the RIN-based classification. |
Confidence Value | Contains a score representing the confidence of the annotated structural classification, based on the amount and quality of mapped structures and of the ambiguity of the structural classifications of the individual mappings. |
Secondary Structure | Contains the secondary structure assignment of the queried position performed by DSSP. |
Recommended Structure | Contains the PDB-ID and the chain identifier of the recommended structure, which is the structure chosen by StructMAn to best represent the structural neighborhood of the queried position. |
Sequence-ID | Contains the percent sequence identity between the sequence of the recommended structure and the query protein. |
Coverage | Contains the fraction of the sequence of the query protein covered by the recommended structure when the sequences of the query protein and the structurally resolved protein are aligned. |
Resolution | Contains the resolution of the recommended structure. |
Max Seq Id Structure | Contains the PDB-ID and the chain identifier of the structure, which has the highest sequence identity of all structures where the queried residue can be mapped. |
Max Sequence-ID | Contains the percent sequence identity between the sequence of the maximal sequence identity structure and the query protein. |
Max Seq Id Coverage | Contains the fraction of the sequence of the query protein covered by the maximal sequence identity structure. |
Max Seq Id Resolution | Contains the resolution of the maximal sequence identity structure. |
Amount of mapped structures | Contains the total number of all structures, where the queried position could be mapped to. |