-
Notifications
You must be signed in to change notification settings - Fork 0
Classification table
The classification table is the main output file StructMAn produces. It is a tab-separated table file. Each row represents the computed results around one amino acid position of one queried protein. In the following all its columns are explained:
contains the Uniprot accession ID of the query protein.
contains the Uniprot entry name of the query protein.
contains the Refseq-protein-ID of the query protein.
contains the amino acid in one letter code and the number in the protein sequence of the queried position.
contains a short species ID, to which the query protein belongs.
contains the tags given by the input file.
contains the annotation whether the queried position lies on the solvent accessible surface of the protein or in the solvent unaccessible core of the protein based on all mapped structures.
contains the structural classification of the queried position.
contains a simplified version of 8.
contains a score representing the confidence of the annotated structural classification, based on the amount and quality of mapped structures and of the ambiguity of the structural classifications of the individual mappings.
contains the secondary structure assignment of the queried position performed by DSSP.
contains the PDB-ID and the chain identifier of the recommended structure, which is the structure, chosen by StructMAn to best represent the structural neighborhood of the queried position.
contains the sequence identity between the sequence of the recommended structure and the query protein.
contains the the fraction of the sequence of the query protein covered by the recommended structure.
contains the resolution of the recommended structure.
contains the PDB-ID and the chain identifier of the structure, which has the highest sequence identity of all mapped structures.
contains the sequence identity between the sequence of the maximal sequence identity structure and the query protein.
contains the the fraction of the sequence of the query protein covered by the maximal sequence identity structure.
contains the resolution of the maximal sequence identity structure.
contains the total number of all structures, where the queried position could be mapped to.