Post by @sunsetjesus • Hey

...and then finetune on top of that general-purpose initialization with very efficient online RL that can learn at real-time speeds by leveraging an effect

Stats

Comments