Post by @leading • Hey
The safety RLHF improved Llama 2 according to reward model score distributions
Stats
Actions: 0
Comments: 0
Likes: 17
Mirrors: 0
Quotes: 0
Comments