Bert Distillation

Main repo and feature works

Main repo can be found here.

Features

distributed training
logging with tensorboard, wandb, neptune, alchemy ...
fp16
various losses and loss agregating
initialization with teacher's layers

Experiment && Results

I initialize my model with [0,2,4,7,9,11] encoder layers of teacher model.

I ran my script for 100 hours on 4x1080TI with RuBERT model as a teacher. Logs can be found here. I distil it on Lenta Russian News dataset.

Then I run classification task on mokoron twitter dataset.

Here are my results:

My models can

Post

Also, probably soon, I will publish my post about my project on medium (in pytorch blog). Here is a draft link. Thanks to Sergey Kolesnikov from catalyst-team for promotion.

Feel free to propose something new for this project.

Folders

bin - bash files for running pipelines
configs - just place configs here
docker - project Docker files for pure reproducibility
presets - datasets, notebooks, etc - all you don't need to push to git
requirements - different project python requirements for docker, tests, CI, etc
scripts - data preprocessing scripts, utils, everything like python scripts/.py
serving - microservices, etc - production
src - model, experiment, etc - research

Usage

git clone https://github.com/PUSSYMIPT/bert-distillation.git
cd bert-distillation
pip install -r requirements/requirements.txt
bin/download_lenta.sh
python scripts/split_dataset.py --small
catalyst-dl run -C configs/config_ru_ranger.yml --verbose --distributed

It will take a lot of time. "Let's go get some drinks"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

bin

bin

configs

configs

data

data

docker

docker

requirements

requirements

scripts

scripts

serving

serving

src

src

.gitattributes

.gitattributes

Example.ipynb

Example.ipynb

README.md

README.md

Repository files navigation

Bert Distillation

Main repo and feature works

Features

Experiment && Results

Post

Folders

Usage

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.github/workflows		.github/workflows
bin		bin
configs		configs
data		data
docker		docker
requirements		requirements
scripts		scripts
serving		serving
src		src
.gitattributes		.gitattributes
Example.ipynb		Example.ipynb
README.md		README.md

deep-nlp-spring-2020/bert-distillation

Folders and files

Latest commit

History

Repository files navigation

Bert Distillation

Main repo and feature works

Features

Experiment && Results

Post

Folders

Usage

About

Resources

Stars

Watchers

Forks

Languages