Skip to content

Latest commit

 

History

History
59 lines (48 loc) · 1.82 KB

README.md

File metadata and controls

59 lines (48 loc) · 1.82 KB

NDPredict

NDPredict: ASN (N) Deamidation Predictor

Use this program to take a PDB file with ASN residues, and predict the probability that the ASN site will be deamidated.

To install:

  • git clone this repo
git clone https://github.com/darianyang/ndpredict.git
  • cd into the repo
cd ndpredict
  • pip install directly
pip install .
  • eventually I may publish this on PyPI for easier installation

Requirements:

  • numpy
  • pandas
  • matplotlib
  • sklearn
  • mdtraj
  • biopython

Example Usage

For arg info:

ndpredict --help

ndpredict help

To use on an example PDB file (note that first the PDB must be cleaned and ready for input into MDTraj).

ndpredict -i pdb/1gb1_leap.pdb

The output will be a plot of each ASN residue of the input PDB (X axis) and the corresponding deamidation probability (Y axis):

1gb1 ndpredict

Note that there is still some work to be done to make the feature calculation robust enough be able to handle any new PDB input. So not every PDB input will work at the current time.

References & Motivation

  • using reference data and methodology inspiration from: Jia L, Sun Y (2017) Protein asparagine deamidation prediction based on structures with machine learning methods. PLoS ONE 12(7): e0181347

  • reference paper uses Discovery Studio to calculate features, which is not open source or reproducible. Here, I am working on an fully automated feature calculation pipeline from an input PDB file using pure Python and I refined a few of their metrics.

Copyright

Copyright (c) 2024, Darian Yang