-
Notifications
You must be signed in to change notification settings - Fork 2
Home
Welcome to the GlyCompare wiki!
GlyCompare is a novel method wherein glycans from glycomic data are decomposed to a minimal set of intermediate substructures, thus incorporating shared intermediate glycan substructures into all comparisons of glycans.
https://glycompare.herokuapp.com
If you run into RAM exceed problems, please contact us. We can run it for you on our lab server.
On the "Update your own dataset" page, the sections with "*" are required inputs. Please rigorously follow the naming instructions of your input files in the following sections.
- This is the name of your project. If your dataset is published, usually name it as <last name of the first author>_<year published> (i.e. Sibille_2016).
-
This is a CSV file named as <Dataset Name>_abundance_table.csv, where <Dataset Name> is exactly the one you filled in the Dataset Name section.
-
The row entries are samples. The columns are glycans. The table is expected to contain column names as glycan names and row names as sample names.
Click to see example
-
This is a CSV file named as <Dataset Name>_variable_annotation.csv, where <Dataset Name> is exactly the one you filled in the Dataset Name section.
-
The annotation file should have a column called "Name" that contains glycan names. Dependent on whether your dataset is compositional or structural, the other required column is either "Glycan Structure" for structural data or "Composition" for compositional data.
-
Structureal data includes IUPAC-extended, glycoCT, WURCS, glytoucan_id, and linear_code.
Click to see example
-
Compositional data is of the form HexNAc(2)Hex(5), where HexNAc and Hex are glycans and (2) and (5) are their occurance times.
Click to see example
On the "Run My Updated Dataset" page, you will need to input some parameters to proceed with the program.
The uploaded datasets are shown in the dropdown menu. Please select yours. If you have run the selected dataset before, the program will remember your input parameters and it will auto-set parameters if you press "Auto parameters".
- Structure: set to Structure if you have the Glycan Structure column in your variable annotation file. The column could be IUPAC-extended, glycoCT, WURCS, glytoucan_id, or linear_code.
- Composition: set to Composition if you have the Composition column in your variable annotation file. The data is of compositional form such as HexNAc(2)Hex(5)
Fill in this section if you set Structure for data type above.
- linkage + structure: if your structural data contains linkage information.
- structure: if your structural data doesn't contain linkage information.
You can choose from IUPAC-extended, glycoCT, WURCS, glytoucan_id, and linear_code. Please select the one that your data contains.
Select Biosynth if you want to restrict the analysis to a specified root
- N-glycan: Specify the root to GlcNAc
- O-glycan: Specify the root to GalNAc
- HMO/glycolipids: Specify the root to Gal(b1-4)Glc (lactose)
- Custom root: Specify the root to a custom monosaccharide(s). You need to input the glycoCT format of your custom root in the Custom core textbox below
Select Epitope if you don't want to specify a single root. Then the analysis will run where every possible monosaccharide is a root.
Check this if you want to draw cluster maps, including pseudo_profile_clustering, motif_cluster, profile_clustering
Click to see pseudo_profile_clustering example
Click to see motif_cluster example
Click to see profile_clustering example
Currently under construction.
- no normalization: Use the raw abundance data
- z-score normalization: standard normalization
- probabilistic quotient normalization: A commonly seen normalization method in biological data described in Dieterle et al. 2006
The reference motif vector is synthesized through past datasets and is used during glyCompare analysis to generate more complete motifs. It is automatically expanded with new datasets.