With the increased simplicity associated with producing MS-based proteomics data, the bottleneck has now shifted to the functional analysis and exploration of large lists of expressed proteins to extract meaningful biological knowledge. Bioinformatics resources are often spread and disseminated under different forms (program/libraries/software/web tools and databases) and their access is rather limited for researchers without programming experience or no in-house bioinformatics support. As a consequence, interpretation of their data by experts remains a tedious and time-consuming process, and potentially error-prone (e.g., due to manual handling or input error).
The ProteoRE (Proteomics Research Environment) aims at fulfilling this need by centrally providing an online research service to assist biologists/clinicians in the interpretation of their proteomics data in a unified framework. Built upon the Galaxy environment, this web-based platform for computational biomedical research, allows researchers to apply a large range of dedicated bioinformatics tools and data analysis workflows on their data, share their analyses with others, and enable tiers to repeat the same analysis while keeping tracks of the overall process.
Currently, ProteoRE implements 21 tools organized into four subsections for: i) data manipulation and visualization; ii) Get features/annotation; iii) functional analysis; and iv) pathway analysis along with graphical representations. Furthermore, we also developed a specialized tool that allow for the management of annotation from external resources upon which some ProteoRE’s tools are based (e.g. Uniprot, Human Protein Atlas, Biogrid, etc.). The ProteoRE platform has been designed in close collaboration with biomedical researchers on the basis of case studies such as functional analysis of a human MS/MS proteomics sample [1] and the selection of candidate proteomics biomarkers of human disease [2,3]. Our platform also provides online support, shared workflows and training materials (shared via the Galaxy Training Network, "proteomics" directory) and is in free access: http://www.proteore.org.
All ProteoRE's tools are publicly developped on GitHub. These tools can be either tested via the ProteoRE main instance or installed on any Galaxy instance through the Galaxy ToolShed
Please, do not hesitate to provide us with feedbacks on tools (by opening an issue or contacting us: [email protected] ). We will be glad to consider any suggestion / feedback you may have. For contributors, please note that any pull request / contribution will be examined for compliancy with the IUC standards.
ProteoRE project is a joint national effort between the French bioinformatics Institute (IFB) and the proteomics infrastructure (ProFI) funded by the French Research Agency (ANR-11-INBS-0013).
David Christiany (software engineer), Florence Combes (bioinformatician), Lien Nguyen (former software engineer), Lisa Perus (former internship), Virginie Brun, Maud Lacombe, Marianne Tardif & Benoit Gilquin (use-case & beta-testing), Valentin Loux (Bioinformatician , IT Manager), Yves Vandenbrouck (Project manager – [email protected]) The Galaxy Project is supported in part by NHGRI, NSF, The Huck Institutes of the Life Sciences, The Institute for CyberScience at Penn State, and Johns Hopkins University.
[1] Lacombe M et al. Proteomic characterization of human exhaled breath condensate. J Breath Res. 12(2):021001, 2018. 10.1088/1752-7163/aa9e71
[2] Nguyen L. et al., Designing an In Silico Strategy to Select Tissue-Leakage Biomarkers Using the Galaxy Framework. Methods Mol Biol. 1959:275-289, 2019. 10.1007/978-1-4939-9164-8_18
[3] Vandenbrouck Y, Christiany D, Combes F, Loux V, Brun V. Bioinformatics Tools and Workflow to Select Blood Biomarkers for Early Cancer Diagnosis: An Application to Pancreatic Cancer. Proteomics. 2019. (21-22):e1800489. [doi:10.1002/pmic.201800489] (https://www.ncbi.nlm.nih.gov/pubmed/31538697)