Sean13
/
llama-8b-instruct-rdpo-full-multipref-0.90

Model card Files Files and versions
xet
Metrics Training metrics Community