Skip to content

Issues: allenai/RL4LMs

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Top-K and Top-p sampling
#7 opened Oct 19, 2022 by boblee22
BART supervised
#10 opened Nov 4, 2022 by talent404
100% likely that two function parameters have been merged by accident code enhancement Code fix for better readability and maintenance with no new features good first issue Good for newcomers
#16 opened Nov 29, 2022 by JulesGM
Implementing self-play
#18 opened Dec 11, 2022 by eublefar
Off-policy RL algorithms support enhancement New feature or request help wanted Extra attention is needed
#23 opened Dec 20, 2022 by Div99
Reproducing IMDB results
#28 opened Dec 30, 2022 by mnoukhov
Mix-Precision training
#29 opened Dec 31, 2022 by lovodkin93
Problem with BLEURT reward function
#34 opened Jan 18, 2023 by eublefar
Persistent Variance in IMDB
#37 opened Feb 2, 2023 by mnoukhov
Metric version incompatible
#42 opened Mar 6, 2023 by c-box
Bloom Supporting
#44 opened Mar 13, 2023 by c-box
[Question] End-to-end example
#51 opened Apr 4, 2023 by farrokhsiar
ProTip! Follow long discussions with comments:>50.