generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
allow masking on consecutive messages with same roles
#2000
opened Aug 31, 2024 by
lsy641
Loading…
4 of 5 tasks
feat: add support for packing tokenized datasets
#2011
opened Sep 3, 2024 by
kmehant
Loading…
3 of 5 tasks
DPO trainer supports num_logits_to_keep to save memory
🏋 DPO
Related to DPO
#2129
opened Sep 26, 2024 by
xyangk
Loading…
3 of 5 tasks
Remove graph breaks for torch.compile() in padding free branch in DataCollatorForCompletionOnlyLM
🐛 bug
Something isn't working
🏋 SFT
Related to SFT
#2158
opened Oct 3, 2024 by
Abhishek-TAMU
Loading…
1 of 5 tasks
[CGPO] Mixture of judges
👨⚖️ judge
Related to judges
#2159
opened Oct 3, 2024 by
gaetanlop
Loading…
4 tasks done
Change KTO tokenization to use DPO's
🏋 KTO
Related to KTO
#2187
opened Oct 6, 2024 by
kawine
Loading…
fixed: OverflowError: out of range integral type conversion attempted
#2206
opened Oct 9, 2024 by
himanshushukla12
Loading…
1 of 5 tasks
Remove ds_config scheuduler params to prevent deepseed from creating scheduler for ref_model
#2224
opened Oct 11, 2024 by
Ben-Schneider-code
Loading…
2 of 5 tasks
[SFT VLM] Added support for Molmo models via standalone script
sft_vlm_molmo
#2236
opened Oct 15, 2024 by
sergiopaniego
Loading…
2 of 5 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.