Skip to content

type-null/aiTrain

Repository files navigation

AI training

Authors: Sahil, Thomas, Ren

Training a Deep Learning classifier for a sentiment classifiction task. Pre-processing library see here.

1. Pre Processing ETL

  • Modified pre-processing library to be able to load the dictionary word list from a zip file directly without unzipping if first.
  • Added <pad>, <unknown> to the top of our word list "index_arry.txt" in pre-processing library.

2. Run the Pre Processing on the dataset

3. Tensorflow model

Forked from https://github.com/pharnoux/columbia-aiops-training

  • Changed embeddings same as our word list mentioned before. Embedding file is oversize for github, see S3 bucket.

  • Built our model see folder model_training

  • Ran the model locally. Successful result see below highlight line. modelLocal

4. SageMaker training

  • Created a Notebook on SageMaker. Modified code to be able to load the data and the dictionary from S3. embeddingFromS3

  • Ran the code form Step #3 over there successfully. notebookResult

  • Output result directly to S3. resultInS3

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published