This repository provides the analysis process for the synergy data augmentation. It contains the datasets utilized in the analysis and generates the figures that are presented in the paper to describe the analysis process.
- Pandas 1.1.3
- Numpy 1.19.2
- Scipy 1.7.3
- Matplotlib 3.5.1
- Seaborn 0.11.2
- PIL 9.0.1
The following steps provide a guide to reproduce the analysis results presented in the paper.
Similarity measure for cellular responses to drug treatment
- Run
python Correlation_drug_pair.py
This will calculate the Kendall tau correlation coefficients for the drug pairs. - Run
python Make_pic50_scatter.py
- Run
python Legend.py
- Run
python Concat_figures.py
Relation between drug similarity and pharmacological effects
Run python Fig2_Analysis_Thresholds.py
Heatmap of the fraction of drug pairs with positively correlated pharmacological effects
Run python Fig3_Heatmap_Plot.py
Optimal DACS threshold for data augmentation
- Download the dataset
DACS_score_between_original_drug_and_similar_drugs.csv
from the link https://osf.io/kd9e7/ and put the dataset inData
folder - Run
Pre_process_for_the_plot.py
- Run
Fig4_Analysis_Intersection.m
in MATLAB
Distribution plots of the synergy scores in original and augmented datasets
- Download the dataset
AugmentedData.csv
from link https://osf.io/kd9e7/ and put the dataset inData
folder - Run
Fig5_Histograms.py