Comments/Ratings for a Single Item

I thought AlphaZero used the output of its NN for evaluating leaf nodes. That makes it different from 'normal' MCTS, which would randomly play out games until they satisfy a win or draw condition, and uses the statistics of such 'rollouts' as a measure for the winning probability in the leaf.
The NN outputs a probability distribution over all the possible moves (illegal moves are set to 0 and the probabilities sum to 1). The MTCS call for this distribution, combine it with an exploration coefficient and Dirichlet noise, to form a score, and choses a move to expand until a certain number (in chess is 6000) of nodes have been visited. Nodes are expanded until leaf nodes are explored. This link explains it better than I:

That describes the 'policy head' of the NN, which is used to bias the move choice (which is otherwise based on the number of visits of the move and that of the total for the node, and the move scores) when walking the tree from root to leaf for finding the next leaf to expand. But my understanding was that when the leaf is chosen and expanded, all daughters should receive a score from the 'evaluation head' of the NN in the position after the move, rather than just inheriting their policy weight from the position before the move. These scores are then back-propagated towards the root, by including them in the average score of all nodes in the path to the expanded leaf.
I do not understand "tgat". You probably meant that.

Indeed. Hard to avoid typos on these virtual keyboards of Android devices... I corrected it.
I'm not sure I understand what you say, HG!
What is not understandable? Typo! What else.
By the way, details about programming are not clear for most people. How to deal with it?
I did not meant the typo!

By the way, details about programming are not clear for most people. How to deal with it?
I don't think there is any "fix" to this issue. I am not sure there is any issue at all. Some conversations are going to involve things other people don't understand. That said, the talkchess forums are the usual place for these kinds of discussions, but I am happy to have some discussion here as well. Some people who are not chess programmers may still be interested in whether the new neural-network techniques being applied to orthodox chess can be applied to chess variants.
That sounds plausible.
11 comments displayed
Permalink to the exact comments currently displayed.
Thanks Greg, My conundrum comes from the definition of leaf nodes. In the traditional way you apply the evaluation function, but in the MCTS of Alpha zero are only when the endgame conditions apply.