New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Chapter Plans #6

Open

natolambert opened this issue Jun 1, 2024 · 0 comments

Labels

Owner

natolambert commented Jun 1, 2024 •

edited

Loading

Here is a rough outline of what I would like to see in the book, and who will be writing it.

Introductions & History

Introduction
Economics, Psychology, Philosophy of preference, etc.: VNM Theory, Bradley Terry, Impossibility theorems, social choice, etc
Optimal Control, Deep RL, ML etc.
RLHF for LLM lit (pre chatgpt stuff), maybe summarize instrugpt

Links:

https://arxiv.org/abs/2310.13595

Problem Specification

Definitions, basic stuff, math
Preference data collection
Preference model training
KL constraints and other penalties

Policy Optimization

IFT / SFT / Chat Templates
Rejection Sampling / Best of N
PPO, REINFORCE, Policy Gradient
DPO (Eric, Archit, Rafael)
Other variants (short)

Advanced (optional)

CAI
Synthetic vs human data
Evaluation

Open Questions (TBD / optional)

Reward model over-optimization

natolambert added the documentation label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment