1 min readfrom Machine Learning

[P] gumbel-mcts, a high-performance Gumbel MCTS implementation

Hi folks,

Over the past few months, I built an efficient MCTS implementation in Python/numba.

https://github.com/olivkoch/gumbel-mcts

As I was building a self-play environment from scratch (for learning purposes), I realized that there were few efficient implementation of this algorithm.

I spent a lot of time validating it against a golden standard baseline.

My PUCT implementation is 2-15X faster than the baseline while providing the exact same policy.

I also implemented a Gumbel MCTS, both dense and sparse. The sparse version is useful for games with large action spaces such as chess.

Gumbel makes much better usage of low simulation budgets than PUCT.

Overall, I think this could be useful for the community. I used coding agents to help me along the way, but spent a significant amount of manual work to validate everything myself.

Feedback welcome.

submitted by /u/randomwalkin
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#natural language processing for spreadsheets
#generative AI for data analysis
#rows.com
#Excel alternatives for data analysis
#self-service analytics tools
#machine learning in spreadsheet applications
#large dataset processing
#real-time data collaboration
#financial modeling with spreadsheets
#real-time collaboration
#self-service analytics
#big data performance
#Gumbel MCTS
#MCTS
#implementation
#Python
#numba
#self-play
#PUCT
#sparse