GitHub - Vitaly-Protasov/DL_project_skoltech: Skoltech project of the Deep Learning course

Skoltech project of the Deep Learning course

Authors of this project (in alphabet order):

Alexander Selivanov, Kristina Ivanova, Lucy Airapetyan, Vitaly Protasov.

Requirements

python 3.6+
pytorch 1.4+
transformers

What we have done

We reimplemented the original article: code2vec by U. Alon et al.
We improved F1-score on the test dataset of java14m-data here you can find dataset.
Weights of two models you can find here

Best F1-scores:	Our implementation	U. Alon work	With BERT
Batch size 128 Test	0.17671	0.1752	0.1689
Batch size 128 Validation	0.20213	-	0.17341
Batch size 1024 Test	0.16372	-	-
Batch size 1024 Validation	0.1887	-	-

Also, we applied Bert architecture instead of attention layer in the original article. Results you can see below:

If you want to run our code for training

First of all, you can open ipython notebook in colab via the button above. Just run all cells, it's easy to do.
Without notebook in the console:

First of all, clone our repository:

git clone https://github.com/Vitaly-Protasov/DL_project_skoltech
cd DL_project_skoltech

In order to download data just use shell script:

./download data.sh

Start train the NN from the original article:

python3 to_train_article_model.py

Start train improved version with Transformer inside:

Install transformers library, we used it:

pip3 install transformers

Run python file for training:

python3 to_train_bert.py

As the parameters which you need to vary are batch_size of validation and train datasets, learning rate and weight decay for optimization algorithm.

Results of predictions

Here you can see how our models predict names

Name		Name	Last commit message	Last commit date
Latest commit History 222 Commits
README.md		README.md
create_vocab.py		create_vocab.py
data_to_tensors.py		data_to_tensors.py
download_data.sh		download_data.sh
for_test_notebook.ipynb		for_test_notebook.ipynb
metrics.py		metrics.py
model_implementation.py		model_implementation.py
to_train_article_model.py		to_train_article_model.py
to_train_bert.py		to_train_bert.py
train_class.py		train_class.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Skoltech project of the Deep Learning course

Authors of this project (in alphabet order):

Requirements

What we have done

If you want to run our code for training

First of all, clone our repository:

In order to download data just use shell script:

Start train the NN from the original article:

Start train improved version with Transformer inside:

Results of predictions

About

Releases

Packages

Contributors 3

Languages

Vitaly-Protasov/DL_project_skoltech

Folders and files

Latest commit

History

Repository files navigation

Skoltech project of the Deep Learning course

Authors of this project (in alphabet order):

Requirements

What we have done

If you want to run our code for training

First of all, clone our repository:

In order to download data just use shell script:

Start train the NN from the original article:

Start train improved version with Transformer inside:

Results of predictions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages