Skip to content

Navigation Menu

Explore
By size
By industry
By use case
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

huggingface / trl Public

generated from fastai/nbdev_template

Notifications You must be signed in to change notification settings
Fork 1.2k
Star 9.5k

Code
Issues 78
Pull requests 17
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: huggingface/trl

Labels 17 Milestones 0

Labels 17 Milestones 0

New pull request New

17 Open 985 Closed

17 Open 985 Closed

Author

Filter by author

Loading

Label

Filter by label

Loading

Use alt click/return to exclude labels

or ⇧ click/return for logical OR

Projects

Filter by project

Loading

Milestones

Filter by milestone

Loading

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Loading

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Conversational dataset support for DPOTrainer

#2131 opened Sep 26, 2024 by qgallouedec • Draft

5 tasks

1

DPO trainer supports num_logits_to_keep to save memory

#2129 opened Sep 26, 2024 by xyangk

Loading…

3 of 5 tasks

3

[DRAFT] Process-supervised RM Trainer

#2127 opened Sep 26, 2024 by gaetanlop

Loading…

5 tasks done

3

[SCoRE] initial score stage 1

#2115 opened Sep 24, 2024 by kashif • Draft

1

Fix RLOO checkpointing

#2114 opened Sep 24, 2024 by bartoszzuk

Loading…

7

Default dataset_text_field to "text"

#2078 opened Sep 18, 2024 by qgallouedec • Draft

5 tasks

1

Remove deprecated args in trainers

#2036 opened Sep 8, 2024 by qgallouedec • Draft

5 tasks

4

feat: add support for packing tokenized datasets

#2011 opened Sep 3, 2024 by kmehant

Loading…

2 of 5 tasks

1

5

allow masking on consecutive messages with same roles

#2000 opened Aug 31, 2024 by lsy641

Loading…

4 of 5 tasks

feat: start working on revisions instead of preferences

#1999 opened Aug 30, 2024 by KarelDO • Draft

3 tasks

added initial TPO implementation

#1965 opened Aug 24, 2024 by sahsaeedi

Loading…

4 of 5 tasks

7

[GRPO] initial GRPO trainer

#1954 opened Aug 21, 2024 by saisurbehera • Draft

3

Add SRPO algorithm.

#1772 opened Jun 25, 2024 by frasermince

Loading…

1 of 7 tasks

23

Add simplified version of BCO loss

#1731 opened Jun 13, 2024 by Trangle

Loading…

1

Adding SimPO to TRL

#1725 opened Jun 11, 2024 by yumeng5

Loading…

23

Prototype Dataset Processor

#1646 opened May 16, 2024 by vwxyzjn • Draft

8

[DRAFT] Vllm integration

#1628 opened May 7, 2024 by vwxyzjn • Draft

4

ProTip! no:milestone will show everything without a milestone.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.