This repository contains scripts and commands used to perform most analyses for The Role of Sex in Brain Protein Expression and Disease.
The goal of the repository is to provide transparency for critical analysis details of the manuscript. It is not the intention for the commands and scripts in the repository to be directly used to recapitulate the analyses of the manuscript since they were written for the specific computing infrastructure that existed at the time of the analysis. Rather, the code, commands, and notes in this repository are intended to be a guide for the approaches to the analyses performed in the manuscript.
Yue Liu in the Wingo Lab wrote the notes, scripts, and comments in this repository and performed the majority of the analyses for the manuscript. Yue's records were collated with minimal editing to provide details of the analysis: sex_specific_brain_protein_expression_project_analysis_notes.txt
.
The analysis used either a local environment or a high performance computing cluster with a job queue system.
Scripts are divided into either cluster
or local
directories depending on the context of the analysis.
Within the notes, scripts are prefixed with the appropriate directory (i.e., cluster/
or local/
) within this repository.
Scripts were tidied to improve readability using perltidy and tidyall for perl using settings in .perltidyrc
, shfmt for shell using default settings, and vscode-R and format-files for R using default settings.
The analysis shared in the manuscript and code both depend on external software that include:
- Bystro b10.
- COLOC v5.0.0.9002
- DEeq2 version v.1.26.0
- EIGENSTRAT 6.1.4
- FUSION
- GO-Elite version 1.2.5
- KING version 2.2.2.
- MASHR v0.2.38
- Plink version v1.90 beta.
- Proteome Discoverer suite v2.4.1
- qvalue version 2.22.0
- R version 3.5.1
- SEURAT version v3.1.2
- SMR software
- STAR v2.4
- Surrogate variable analysis v3.20.0.
- SusieR