Post by @sunsetjesus • Hey
Performance on training games is very good, even from highly suboptimal data. With near optimal data, this outperforms non-Q-learning methods (e.g., BC, d
Stats
Actions: 0
Comments: 0
Likes: 2
Mirrors: 3
Quotes: 0
Comments