Spyke
No comments on the original post yet.
Paper page - DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search | Spyke