[GKD] add ULD type loss to GKD Trainer #2263

kashif · 2024-10-22T18:08:48Z

What does this PR do?

Add ULD loss to GKD trainer that doesn't require the student/teacher to have the same vocab by optimal transporting the prob from the teacher to student

docs/source/gkd_trainer.md

Co-authored-by: Quentin Gallouédec <[email protected]>

HuggingFaceDocBuilderDev · 2024-10-24T13:49:27Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2024-11-23T12:48:02Z

trl/trainer/gkd_trainer.py

        with torch.no_grad():
            outputs_teacher = self.teacher_model(
-                input_ids=inputs["input_ids"],


Why do you use a different input ids here?

qgallouedec · 2024-11-23T12:50:57Z

Looks good overall. Feel free to request a final review from me when you think it's ready to be merged

qgallouedec · 2024-11-23T12:52:44Z

examples/scripts/gkd.py

@@ -42,6 +42,21 @@
    --use_peft \
    --lora_r 64 \
    --lora_alpha 16
+
+# ULD
+python examples/scripts/gkd.py \


Where is the argument related to ULD?

kashif added 6 commits October 14, 2024 12:43

initial uld loss

0cbbbe7

Merge remote-tracking branch 'upstream/main' into uld

c8d49ba

remove the beta from the loss

34520c3

fix comments

7c5502c

align masks

4eb781f

add doc

3b22fd7

qgallouedec reviewed Oct 24, 2024

View reviewed changes

docs/source/gkd_trainer.md Outdated Show resolved Hide resolved

kashif and others added 2 commits October 24, 2024 15:40

Update docs/source/gkd_trainer.md

e0b006a

Co-authored-by: Quentin Gallouédec <[email protected]>

Merge branch 'main' into uld

f8438ea

kashif added 4 commits October 25, 2024 17:32

Merge branch 'main' into uld

14418e7

Merge branch 'main' into uld

82e3568

Update gkd_trainer.md

bb6ccc8

fix docs

69415e0

kashif mentioned this pull request Nov 5, 2024

Add Sequence-Level KD #2220

Merged

5 tasks

kashif added 7 commits November 5, 2024 11:25

Merge branch 'main' into uld

bda5922

use teacher prompts and mask for generate_on_policy_outputs

34d532b

Merge remote-tracking branch 'refs/remotes/origin/uld' into uld

dc064f3

labels are not optional

cc26568

Merge branch 'main' into uld

2d359cd

Merge branch 'main' into uld

34011b7

add tests

f3f633a

qgallouedec reviewed Nov 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GKD] add ULD type loss to GKD Trainer #2263

[GKD] add ULD type loss to GKD Trainer #2263

kashif commented Oct 22, 2024

HuggingFaceDocBuilderDev commented Oct 24, 2024

qgallouedec Nov 23, 2024

qgallouedec commented Nov 23, 2024

qgallouedec Nov 23, 2024

[GKD] add ULD type loss to GKD Trainer #2263

Are you sure you want to change the base?

[GKD] add ULD type loss to GKD Trainer #2263

Conversation

kashif commented Oct 22, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Oct 24, 2024

qgallouedec Nov 23, 2024

Choose a reason for hiding this comment

qgallouedec commented Nov 23, 2024

qgallouedec Nov 23, 2024

Choose a reason for hiding this comment