Add loss_fn_config #156

maitchison · 2025-12-08T21:22:41Z

Some of the loss functions have configuration settings that are currently not accessible.

This PR adds an optional loss_fn_config the RL and distillation training steps and forward_backward calls.

SFT is hardcoded to cross_entropy and so was not updated.

Added optional loss_fn_config to rl.train.Config
For any training script that have a loss_fn, I've also added loss_fn_config.
Added explicit argument names to some function calls.

Matthew Aitchison added 3 commits December 9, 2025 09:57

add loss_fn_config

73a2428

explicit argument names

8acf967

added loss_fn_config to any CLIConfigs that had loss_fn

4253706

maitchison changed the title ~~Matthew/add loss fn~~ Add loss_fn_config Dec 8, 2025

Provide feedback