PP_dialog_models

Computational methods for evaluating patient-provider communication.
Logistic regression and HMM can be used for prediction.

Author: Jihyun Park [email protected]
Last updated: 6/22/2018

Requirements

Python2 should be installed with packages numpy, nltk, pandas, sklearn, csv, cPickle.

Tutorial iPython Notebook

load_model_and_predict.ipynb
Demo iPython notebook file that loads the pre-trained model and the sample test data and predicts on the sample test data set.

Input training and test file format

Training and test file should be a pipe-delimited file (delimited with "|") with visitid, talkturn, text, topicnumber, topicletter as column names.
For the test data, topicnumber and topicletter columns are not necessary since the test data can be run without labels. However, the scores will not be calculated without those columns.

models.py

Classes for models are in the file. Details of the usage can be found in the demo iPython notebook file and the code docstring.

DialogModel
Base class for dialog model. Used when you have a set of results from another base model (independent model) that is trained somewhere else (e.g. output from RNN). Predictions and output probabilities are loaded using load_model() in this class object and then the object can be plugged into HMM.
LogRegDialogModel
Class for running independent logistic regression model.
fit_model(tr_data) to train data, predict(te_data) to make prediction.
HMMDialogModel
Class for running Hidden Markov Model on top of some base independent model.
fit_model(tr_data) to train data, predict_viterbi(te_data) to make prediction.
DialogResult
Class that stores the results and calculates and prints out the scores.

mhddata.py

Classes for the data. The classes loads the data and pre-processes.

DialogData : Base class for dialog data.
MHDTrainData : Class for MHD training data.
MHDTestData : Class for MHD test data. Preprocessing methods are in preprocess.py file.

hmm.py

Methods that are related to HMM.

utils.py

Utility methods.

Pre-trained models

Saved as model/*.pkl Files

Vocabulary and Label files

Vocabulary and label files from the training data are saved as data/vocab.pkl and data/label*.pkl.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PP_dialog_models

Requirements

Tutorial iPython Notebook

Input training and test file format

models.py

mhddata.py

hmm.py

utils.py

Pre-trained models

Vocabulary and Label files

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
data		data
model		model
stopwordlists		stopwordlists
LICENSE		LICENSE
README.md		README.md
evaluate.py		evaluate.py
hmm.py		hmm.py
load_model_and_predict.ipynb		load_model_and_predict.ipynb
mhddata.py		mhddata.py
models.py		models.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
utils.py		utils.py

License

UCIDataLab/PP_dialog_models

Folders and files

Latest commit

History

Repository files navigation

PP_dialog_models

Requirements

Tutorial iPython Notebook

Input training and test file format

models.py

mhddata.py

hmm.py

utils.py

Pre-trained models

Vocabulary and Label files

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages