This repository contains the code for statistical analyses performed in Chapter 3 of my thesis "Cross-sectional and longitudinal profiling of PD transcriptomics and metabolomics". The project consists on whole blood transcriptomics and blood plasma metabolomics cross-sectional and longitudinal profiling of Parkinson's disease patients and controls from the PPMI cohort and the LuxPARK cohort respectively, to identify differential molecular and higher-level functional features in PD.

Content

The code covers the following main tasks and analyses:

Loading, preparation and processing of datasets
Generate higher-level functional features (mean, median, sd, 1st principal component "pca", pathifier deregulation scores) for GOBP, GOCC, CORUM databases (PPMI), KEGG (LuxPARK)
Differential expression analysis for PD vs. control (PPMI) and differential abundance analysis for de novo PD vs. control and all PD vs. control (LuxPARK) for single molecules and higher-level functional representations
Longitudinal analyses: association with time, correlation, trend analysis over consecutive timepoints, for single molecules and higher-level functional representations
Pathway enrichment analysis using gsea, goana, mesh terms (PPMI)
Post-processing of results, visualizations (boxplots, trajectories)

There is a README.md inside each directory with corresponding explanation for each script:

ppmi_analyses contains code related to analysis on transcriptomics data from PPMI.

luxpark_analyses contains code related to analysis on metabolomics data from LuxPARK.

Data

The public transcriptomics data used in this project was derived from the Parkinson’s Progression Markers Initiative (https://www.ppmi-info.org/, RNAseq - IR3). The metabolomics data from LuxPARK is not publicly available as it is linked to the Luxembourg Parkinson’s Study and its internal regulations. Any requests for accessing the dataset can be directed to request.ncer-pd@uni.lu.

Requirements

The code was mostly written and tested in R (R 4.0.3) on both current Mac (Ventura) and Linux operating systems (Ubuntu 23.04), relying on multiple R and BioConductor packages that are listed at the beginning of the corresponding R scripts. The code should be compatible with later versions of R installed on current Mac, Linux or Windows systems. R software packages loaded at the beginning of the R scripts must be installed before using the code. R packages available on CRAN can be installed with the command:

install.packages("PACKAGE_NAME")

R packages from Bioconductor can be installed with the following commands:

if (!require("BiocManager", quietly = TRUE)) { install.packages("BiocManager") }

BiocManager::install("PACKAGE_NAME")

License

The code is available under the MIT License (see LICENSE).

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
luxpark_analysis		luxpark_analysis
ppmi_analyses		ppmi_analyses
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ppmi_deseq_salmon_star.R		ppmi_deseq_salmon_star.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of contents

Introduction

Content

Data

Requirements

License

About

Releases

Packages

Languages

License

elisagdelope/statistical_analyses_cross_long_PD

Folders and files

Latest commit

History

Repository files navigation

Table of contents

Introduction

Content

Data

Requirements

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages