Subscribe
Sign in
Share this post
Reinforced
Reward Modeling for RLHF
Copy link
Facebook
Email
Notes
More
Reward Modeling for RLHF
Alex Nikulkov
Jan 10, 2024
1
Share this post
Reinforced
Reward Modeling for RLHF
Copy link
Facebook
Email
Notes
More
An introduction to reward models
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
Reward Modeling for RLHF
Share this post
An introduction to reward models