Skip to content

Latest commit

 

History

History
24 lines (16 loc) · 439 Bytes

README.md

File metadata and controls

24 lines (16 loc) · 439 Bytes

nanogpt

Implementation of nanogpt from karpathy. I've modified this model to text classification by chopping off the language model head and adding a binary classifier on top.

Development

poetry install

Training

Test run:

python train.py --config-name rotten_tomatoes_binary_classification_fast

Full training run:

python train.py --config-name rotten_tomatoes_binary_classification