Git Product home page Git Product logo

Comments (11)

CezCz avatar CezCz commented on July 29, 2024

I found in previous commits that output used to be, but later due to log_likelihood_cost refactor got deleted.

output = tf.nn.softmax(tf.reshape(h_conv_final, [-1, go.N ** 2]) + b_conv_final)

from mugo.

brilee avatar brilee commented on July 29, 2024

Hm. Sorry about that - work on this repo is continuing at https://github.com/tensorflow/minigo. I'll update the README.md

from mugo.

CezCz avatar CezCz commented on July 29, 2024

from mugo.

JoeyQWu avatar JoeyQWu commented on July 29, 2024

@CezCz yeah, thanks for your kind answer,
actually, I fixed the line 88 with
" log_likelihood_cost = tf.reduce_sum(tf.nn.softmax_cross_entropy_with_logits(logits=logits, labels=y))" and it could work,
but I could not understand the output of mcts, why it often choose the bigger value even if it is negative ?
I am confused about the result , I would appreciate if you can tell me the reason @CezCz

from mugo.

CezCz avatar CezCz commented on July 29, 2024

from mugo.

JoeyQWu avatar JoeyQWu commented on July 29, 2024

1 just like the first picture , the location of white is R4, and I get the value is -7.5, just as the second image,
2
the another position is Q3,
3
and its value is 8.5,
4
and why do the white chose the R4 rather than Q3, the latter value is greater than the former value , I am just very confused about this, perhaps I do not understand the code ,or maybe this is a silly question , but I want strongly to know the reason and I am very grateful to you @CezCz , you are a very kind person and thank you very much !

from mugo.

CezCz avatar CezCz commented on July 29, 2024

from mugo.

JoeyQWu avatar JoeyQWu commented on July 29, 2024

Hi , @CezCz
So the next move is chosen just because the algorithm chooses the most visited move , and the value network backpropagated the visit count and the winner predicted, the positive value represents the current player wins this game , the next move is selected is not related to the value of value network, just related to the visit count , right ?

from mugo.

CezCz avatar CezCz commented on July 29, 2024

@JoeyQWu
The move that is chosen to be played in the actual game yes. Not to confuse with move chosen within selection phase - this one is chosen based on some sophisticated heuristic with exploration taken into consideration.
You may want to read:
https://jeffbradberry.com/posts/2015/09/intro-to-monte-carlo-tree-search/ - nice mcts introduction with examples
http://www.baeldung.com/java-monte-carlo-tree-search - simple monte carlo tree search implementation
https://deepmind.com/documents/119/agz_unformatted_nature.pdf - page 25-27 MCTS implementation within alphago zero (don't be confused about temperature parameter and parent visit count, these are just another parameters to promote exploration during training, but the core is visit count)

from mugo.

JoeyQWu avatar JoeyQWu commented on July 29, 2024

@CezCz
okay, I will read more to understand , thank you very much, you are so nice , very grateful to you for your help!

from mugo.

brilee avatar brilee commented on July 29, 2024

I also wrote http://www.moderndescartes.com/essays/deep_dive_mcts/ recently

from mugo.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.