NarrowBERT

This reporsitory provides an implementation of the methods described in NarrowBERT: Accelerating Masked Language Model Pretraining and Inference.

Pretrained and MNLI finetuned models could be downloaded here.

Dependencies

This implementation is mainly based on Huggingface Transformers with the optimization package DeepSpeed. Use ./requirements.txt to make sure these packages are installed.

We provide a DeepSpeed example configuration ./ds_zero2_1gpu.json, but feel free to use your own.

Implementation

Configuration and model implementation can be found within ./narrowbert. The code is mainly adapted from BERT provided on Huggingface.

We provide training scripts for all tasks we mentioned in the paper, but you can also take the model and train it with your own scripts.

We provide models for MLM pretraining, sequence classification, and token classification, which cover the experiments mentioned in the paper.

For the tokenizer, we reuse the BERT tokenizer provided on Huggingface. In all of our experiments, we used the pretrained tokenizer from bert-base-uncased.

Pretraining

./run_mlm_narrowbert.py is the script for pretraining, and is adapted from Huggingface exampla run_mlm.py. You can run it with command

python ./run_mlm_narrowbert.py ./run_mlm_config.json

where ./run_mlm_config.json contains the hyperparameters that were used.

GLUE/IMDB/Amazon Tests

We adapt the Huggingface example run_glue.py and provide ./run_glue_narrowbert.py with the corresponding configuration. To run the script:

python ./run_glue_narrowbert.py [config]

where you can replace [config] with ./config_glue.json or ./config_imdb.json. Again, you can modify hyperparameters or choose different tasks of GLUE using these json files.

Note that Amazon2/5 requires some data preprocessing. We provide the script we used amazon.py. To run preprocessing:

python ./amazon.py [cache_path] [amazon2_save_path] [amazon5_save_path]

NER Tests

We use ./run_ner_narrowbert.pt which is adapted from the Huggingface example run_ner.py. To run it:

python ./run_ner_narrowbert.py ./config_ner.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NarrowBERT

Dependencies

Implementation

Pretraining

GLUE/IMDB/Amazon Tests

NER Tests

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
narrowbert		narrowbert
.gitignore		.gitignore
README.md		README.md
amazon.py		amazon.py
config_glue.json		config_glue.json
config_imdb.json		config_imdb.json
config_ner.json		config_ner.json
ds_zero2_1gpu.json		ds_zero2_1gpu.json
requirements.txt		requirements.txt
run_glue_narrowbert.py		run_glue_narrowbert.py
run_mlm_config.json		run_mlm_config.json
run_mlm_narrowbert.py		run_mlm_narrowbert.py
run_ner_narrowbert.py		run_ner_narrowbert.py

lihaoxin2020/narrowbert

Folders and files

Latest commit

History

Repository files navigation

NarrowBERT

Dependencies

Implementation

Pretraining

GLUE/IMDB/Amazon Tests

NER Tests

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages