austinlasseter / sagemaker_lda_medical Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Using the LDA algorithm on SageMaker to extract topics from medical transcriptions data

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
data		data
docs		docs
exploratory		exploratory
images		images
.gitignore		.gitignore
3clusters.ipynb		3clusters.ipynb
3topics.html		3topics.html
README.md		README.md
eda-medical.ipynb		eda-medical.ipynb
finalreport.pdf		finalreport.pdf
proposal.pdf		proposal.pdf
surgery-lda-coherence.ipynb		surgery-lda-coherence.ipynb

Repository files navigation

SageMaker LDA Medical

RESULTS OF THE FINAL MODEL CAN BE VIEWED HERE:

http://sagemaker-lda-medical.s3-website-us-east-1.amazonaws.com/

Data source:

https://www.kaggle.com/tboyle10/medicaltranscriptions
Medical transcription data scraped from mtsamples.com

Python libraries used:

pandas
scikit learn
gensim
boto3
plotly
pyLDAvis

Templates followed:

Link to previous proposal review:

https://review.udacity.com/?utm_campaign=ret_600_auto_ndxxx_project-passednew_global&utm_source=blueshift&utm_medium=email&utm_content=ret_600_auto_ndxxx_project-passednew_global&bsft_clkid=943bc2e5-061c-45f9-b074-1de86f38cd73&bsft_uid=3c530f74-7441-4b91-849d-cb7956207ef1&bsft_mid=eb3afafc-4f57-4e40-8aa1-5afb2322a576&bsft_eid=df4dd510-74b9-5827-929d-e272d8aaf47d&bsft_txnid=81ca0956-e8e3-4fc6-8671-dbacad74e53c&bsft_link_id=2&bsft_mime_type=html&bsft_ek=2020-10-29T15%3A46%3A40Z&bsft_aaid=8d7e276e-4a10-41b2-8868-423fe96dd6b2&bsft_lx=2&bsft_tv=1#!/reviews/2568920

About

Using the LDA algorithm on SageMaker to extract topics from medical transcriptions data

Report repository

Releases

No releases published

Packages

No packages published

Languages