In othello.tensorflow.OthelloNNet.py:
The pi output of the model is
self.pi = Dense(s_fc2, self.action_size) (line 36)
which is incorrect. In fact, it should be
self.pi = tf.nn.softmax(Dense(s_fc2, self.action_size)).
By contrast, the implementation in Keras is correct. In othello.keras.OthelloNNet.py:
self.pi = Dense(self.action_size, activation='softmax', name='pi')(s_fc2) (line 28).
Pull request: #215 (comment)