Skip to content

Actions: stanford-crfm/levanter

CI with GCP TPU

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
675 workflow runs
675 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix for token bug that skips EOS
CI with GCP TPU #584: Pull request #815 synchronize by ahmeda14960
November 20, 2024 23:03 22m 39s fix_sft_eos
November 20, 2024 23:03 22m 39s
jit and batch supervised data loading to speed it up (a lot)
CI with GCP TPU #583: Pull request #816 opened by dlwh
November 20, 2024 22:54 21m 55s batched_supervised
November 20, 2024 22:54 21m 55s
fix for token bug that skips EOS
CI with GCP TPU #582: Pull request #815 opened by ahmeda14960
November 20, 2024 22:17 22m 42s fix_sft_eos
November 20, 2024 22:17 22m 42s
Update test_optimizer_config.py
CI with GCP TPU #581: Pull request #814 synchronize by blahBlahhhJ
November 20, 2024 18:46 21m 54s blahBlahhhJ-patch-1
November 20, 2024 18:46 21m 54s
Update test_optimizer_config.py
CI with GCP TPU #580: Pull request #814 opened by blahBlahhhJ
November 20, 2024 18:45 22m 14s blahBlahhhJ-patch-1
November 20, 2024 18:45 22m 14s
Fix lr schedule
CI with GCP TPU #579: Pull request #813 opened by blahBlahhhJ
November 20, 2024 17:22 21m 53s blahBlahhhJ-patch-1
November 20, 2024 17:22 21m 53s
Merging DiVA to Levanter Main
CI with GCP TPU #578: Pull request #779 synchronize by Helw150
November 20, 2024 02:22 21m 29s will/diva-merge
November 20, 2024 02:22 21m 29s
Adds Short-Context Qwen Support
CI with GCP TPU #577: Pull request #812 synchronize by Helw150
November 20, 2024 02:17 22m 4s will/qwen
November 20, 2024 02:17 22m 4s
Adds Short-Context Qwen Support
CI with GCP TPU #576: Pull request #812 opened by Helw150
November 20, 2024 02:08 10m 2s will/qwen
November 20, 2024 02:08 10m 2s
document WSD-S stuff, add cycles
CI with GCP TPU #575: Pull request #811 synchronize by dlwh
November 20, 2024 01:37 21m 54s wsds_redo
November 20, 2024 01:37 21m 54s
document WSD-S stuff, add cycles
CI with GCP TPU #574: Pull request #811 synchronize by dlwh
November 19, 2024 05:54 22m 3s wsds_redo
November 19, 2024 05:54 22m 3s
document WSD-S stuff, add cycles
CI with GCP TPU #573: Pull request #811 synchronize by dlwh
November 19, 2024 05:27 21m 49s wsds_redo
November 19, 2024 05:27 21m 49s
document WSD-S stuff, add cycles
CI with GCP TPU #572: Pull request #811 synchronize by dlwh
November 19, 2024 00:28 22m 0s wsds_redo
November 19, 2024 00:28 22m 0s
document WSD-S stuff, add cycles
CI with GCP TPU #571: Pull request #811 opened by dlwh
November 19, 2024 00:26 22m 49s wsds_redo
November 19, 2024 00:26 22m 49s
cache fixes
CI with GCP TPU #570: Pull request #810 opened by dlwh
November 19, 2024 00:25 21m 8s last_cache_fixes
November 19, 2024 00:25 21m 8s
Wsds redo
CI with GCP TPU #569: Pull request #809 synchronize by dlwh
November 18, 2024 19:18 22m 28s wsds_redo
November 18, 2024 19:18 22m 28s
Wsds redo
CI with GCP TPU #568: Pull request #809 opened by dlwh
November 18, 2024 19:17 21m 11s wsds_redo
November 18, 2024 19:17 21m 11s
remove unnecessary assertion in tokenization
CI with GCP TPU #566: Pull request #807 opened by dlwh
November 18, 2024 05:25 20m 31s wsds_redo
November 18, 2024 05:25 20m 31s
deprecate LMSupervisedDataConfig, but keep it working for now
CI with GCP TPU #565: Pull request #806 opened by dlwh
November 17, 2024 19:18 20m 53s old_supervised
November 17, 2024 19:18 20m 53s
Use haliax state dict
CI with GCP TPU #564: Pull request #805 synchronize by dlwh
November 17, 2024 03:56 22m 17s use_haliax_state_dict
November 17, 2024 03:56 22m 17s
Use haliax state dict
CI with GCP TPU #563: Pull request #805 synchronize by dlwh
November 17, 2024 03:43 10m 29s use_haliax_state_dict
November 17, 2024 03:43 10m 29s
Use haliax state dict
CI with GCP TPU #562: Pull request #805 synchronize by dlwh
November 16, 2024 08:34 20m 48s use_haliax_state_dict
November 16, 2024 08:34 20m 48s
Use haliax state dict
CI with GCP TPU #561: Pull request #805 synchronize by dlwh
November 16, 2024 08:29 21m 6s use_haliax_state_dict
November 16, 2024 08:29 21m 6s
Use haliax state dict
CI with GCP TPU #560: Pull request #805 synchronize by dlwh
November 16, 2024 08:26 21m 0s use_haliax_state_dict
November 16, 2024 08:26 21m 0s