www.reddit.com/r/reinforcementlearning/comments/nc2gx9/what_is_currently_the_best_sota_rl_algorithm_for/
1 Users
0 Comments
1 Highlights
0 Notes
Tags
Top Highlights
PPO is a solid choice for most discrete action settings.
Glasp is a social web highlighter that people can highlight and organize quotes and thoughts from the web, and access other like-minded people’s learning.