CMPUT 657 Class Notes

Extra material from class, usually done on the whiteboard.

Contents:

November 27 Notes
November 25 Notes
November 4 Notes
October 30 Notes
Project Ideas Notes (Oct 29)
October 23 Notes
October 9 Notes
October 7 Notes
October 2 Notes
September 18 Notes
September 16 Notes
September 4 Notes
September 2 Notes

November 27

The sum game played by economic rules in class, part 2:

More details in the spreadsheet that I shared

Tax rate 5
Balance -3.75
Sum game after 5 moves: G1 + G2LR + G3LR + G4 + G5R = {2|0} + 1 + 5 + G4 + {0 | -1 || -2 | -3}
G4 = -3|5 = 0 is Zugzwang, no one wants to play there (not even at tax = -1).
Martin to play next.

Day 2:

Martin passes. Class passes.
Rebid for lower tax rate: Martin 1. Class 1.5.
New tax rate 1.5, Class to play.
Move 6 (Class): G5R → G5RR = -2 | -3
Class pays tax 1.5. Balance -2.25
Martin passes. Class passes.
Rebid for lower tax rate: Martin 1. Class 1.
New tax rate 1, Martin wins tiebreak (coin toss), Martin to play.
Move 7 (Martin): G1 → G1L = 2
Martin pays tax 1. Balance -3.25
Class passes. Martin passes.
Rebid for lower tax rate: Martin 0.5. Class 0.5.
New tax rate 0.5, Class wins tiebreak (coin toss), Class to play.
Move 8 (Class): G5RR → G5RRR = -3
Class pays tax 0.5. Balance -2.75
All games are integers now. For lack of time we skip the step of playing out the integers (see below for how it would work).
Sum game now: G1L = 2, G2LR = 1, G3LR = 5, G4 = 0, G5RRR = -3
Total score = sum of integers + Balance.
2 + 1 + 5 + 0 + (-3) + (-2.75) = 2.25 >0, Martin wins

How to play out the integers: Continue the game after move 8 above.

Sum game now = 2 + 1 + 5 + 0 + (-3)
Balance -2.75, Martin to play
Martin passes. Class passes.
Rebid for lower tax rate: Martin -1. Class -1.
New tax rate -1, Martin wins tiebreak (coin toss), Martin to play.
Martin plays 2 → 1 (in G1), Tax -1, Balance -1.75.
Class plays -3 → -2 (in G5), Tax -1, Balance -2.75.
Martin plays 1 → 0 (in G1), Tax -1, Balance -1.75.
Class plays -2 → -1 (in G5), Tax -1, Balance -2.75.
Martin plays 1 → 0 (in G2), Tax -1, Balance -1.75.
Class plays -1 → 0 (in G5), Tax -1, Balance -2.75.
Sum game now: 0 + 0 + 5 + 0 + 0
Martin has 5 more moves: 5 → 4 → 3 → 2 → 1 → 0.
Class has no more moves.
Martin collects 5x tax, final balance = -2.75 + 1 + 1 + 1 + 1 + 1 = 2.25.

November 25

The sum game played by economic rules in class, so far:

Bid to play Black:
Class bids 7.5, Martin 10, average 8.75.
Martin bid more, so Martin = Black = Left = positive, and Class = White = Right = negative.
Martin pays 8.75 to class. Balance -8.75 (from Left's point of view)
Bid for tax rate, and to go first: Both teams bid 5. Tiebreak: class goes first. Current tax rate 5.
Move 1 (Class): G5 → G5R = 0 | -1 || -2 | -3
Class pays tax 5. Balance -3.75
Move 2 (Martin): G2 → G2L = 10|1
Martin pays tax 5. Balance -8.75
Move 3 (Class): G2L → G2LR = 1
Class pays tax 5. Balance -3.75
Move 4 (Martin): G3 → G3L = 20|5
Martin pays tax 5. Balance -8.75
Move 5 (Class): G3L → G3LR = 5
Class pays tax 5. Balance -3.75
Sum game after 5 moves: G1 + G2LR + G3LR + G4 + G5R = G1 + 1 + 5 + G4 + {0 | -1 || -2 | -3}
Martin to play next.

November 4

Constructing the thermographs of numbers 1/2, 1/4, 3/4 in subzero thermography.
Note: taxed left options in green, I did not have a blue pen handy...

October 30

Cooling games Examples, try them in CGSuite

G.Cool(t) applies a tax of t to game G, giving G_t

{6|-6}.Cool(3)
{6|-6}.Cool(1/2)
{6|-6}.Cool(-1/2)
{6|-6}.Cool(-1) // CGSuite does not like this...

Examples from slides, experiments with cooling them

4.Cool(1) // nothing happens to integers
(1/2).Cool(1/8) // nothing happens to fractions
// t(4|-4) = 4
{4|-4}.Cool(1)
{4|-4}.Cool(2)
{4|-4}.Cool(4)
{4|-4}.Cool(5)
//t(4|{−4|−10}) = 11/2
{4||-4|-10}.Cool(3)
{4||-4|-10}.Cool(5)
{4||-4|-10}.Cool(11/2)
{4||-4|-10}.Cool(6)
{4||-4|-10}.Cool(1000)
//t(4|{−4|−20}) = 8
{4||-4|-20}.Cool(6)
{4||-4|-20}.Cool(8)
//t(4|{−4|−100}) = 8
{4||-4|-100}.Cool(8)

More Thermograph Examples, try them in CGSuite

Most thermographs have a simple structure, just a few line segments plus a mast. However, you can create thermographs with arbitrary many line segments.

G := {2|-2} + {4|-4} + {6|-6} + {8|-8} + {10|-10} + {12|-12} + {14|-14}
G.Thermograph.Plot()
(G+G.Cool(1)).Thermograph.Plot()
H := {16|-16} + {18|-18} + {20|-20} + {22|-22} + {24|-24}
H.Thermograph.Plot()
(G+H).Thermograph.Plot()
// next one is a bit slow
(G+G.Cool(1) + H).Thermograph.Plot()
// next one took forever and I killed it with Menu-System-Kill calculation.
// It is the TG of the sum of all switches from +-1 to +- 24
(G+G.Cool(1) + H +H.Cool(1)).Thermograph.Plot()

Thermograph Construction Examples From Class

Thermograph Construction Examples from 2022

Thermograph of gote situation G = 4|0. Mean(G) = 2, temperature(G) = 2
In CGSuite: Explorer({4|0}), then click on Selection.CanonicalForm.Thermograph.Plot() in the drop down list in the bottom left, where it says "Select or enter...". CGSuite shows only the final result, not the construction steps.

Thermograph of one-sided sente G = 9||8|0. Mean(G) = 8, temperature(G) = 1.
Explorer({9||8|0})

Thermograph of double sente G = 14 | 2 ||| -2 || -14 | -16. Mean(G) = -1/4, temperature(G) = 33/4 = 8 1/4.
Explorer({14|2|||-2||-14|-16})
In CGSuite, you can click on the node (circle) in the top right, select "Expand sensible lines", then click on other nodes to see e.g. the thermographs of 14|2 and -2||-14|-16 that were used in the construction.

Thermograph of 1 || -2|-4.
Explorer({1||-2|-4})

Thermograph construction with two right options, G = 5 | G1, G2. LS for right: min(LS(G1_t), LS(G2_t)). The games G1 and G2 are such that at some temperatures, a move to G1 is better, and at other temperatures, a move to G2 is better. In the figure, the thermograph for G1 is in black, and G2 in red. At each t, the min of the two LS is shown in green. It follows red at low temperatures, then black, then red again. The min RS in this example is always from G2, and is also shown in green. But thi

Project Ideas Notes (Oct 29)

I wrote these notes in response to one student. I am sharing them here since they may be of more general interest.

Score-counting Games

There is a survey article Scoring games: the state of play by Urban Larsson, Richard J. Nowakowski and Carlos Pereira Dos Santos, in Games of No Chance 5. Personally, I know very little about the subject even though it started with Go...but I would be happy if you did such a project.

Move Ordering Heuristic

For full-board (single) games there is a lot of work, and almost all the more recent (15-20 years) papers use ML in some form. E.g. the policy function in AlphaGo etc., and many hand-made but computer-tuned heuristics before that. Also many statistics-based schemes.

There are also a few game-independent move ordering heuristics such as history heuristic for such games. A very basic kind of "learning".

Iterative deepening search and using the best move from the previous iteration is the standard move ordering in alphabeta. In MCGS, we do not have any numeric evaluation, only win-loss-unknown.

For neural networks for speeding up proofs, there is the work on Proof Cost Networks (PCN) by the group in Taiwan that we work with: AlphaZero-based Proof Cost Network to Aid Game Solving. Ti-Rong Wu, Chung-Chin Shih, Ting Han Wei, Meng-Yu Tsai, Wei-Yuan Hsu, I-Chen Wu.
poster
paper
I do not know if and how it can be adapted to sums, but it would be interesting to think about.

For combinatorial (sum) games in general I do not know too much about move ordering. It would be nice to have something general for MCGS, but where to start? The papers by our group (Taylor for Clobber and Henry Du for NoGo) talk about some move ordering strategies for specific games, such as play in the middle to split games early into smaller subgames.

There are at least two levels to this question for sums - 1. which subgame to play in, and 2. which move to play there.

My old paper with Zhichao Li Locally Informed Global Search... (on our reading list) shows that temperature is a super-strong heuristic for move ordering in search. The games in that paper have only one move in each subgame. We have some unpublished experiments with multiple moves per subgame, but the complexity of solving grows much more quickly.

My even older PhD thesis uses incentives for solving Go endgame puzzles, but mostly for the case where you do not need any global search. The are many cases where there is some (or even much) ordering between moves but it is not perfect, and we still need search. My PhD thesis used a very naive approach for that case. I worked on a better algorithm for Go about 3-4 years ago. It works, but it is embedded in a very old Go program, so hard to work with. A clean implementation of such algorithms, e.g. in MCGS, would be a good project.

Regarding ML and game rules, I do not see how to learn directly from rules, but I can see e.g. learning from sampling of (local? global?) game trees. There is a huge literature on "General Game Playing" where they try such things (again, not for the case of sum games afaik)

October 23

Pictures of Whiteboard

CGSuite for Examples from Class

Note: LeftStop and RightStop are what we called LeftScore and RightScore.

Sums-Incentives: 
Slide 3:
A := {1|-1}; B := {2|-2}; C := {3|-3}; D := {4|-4}
A.LeftIncentives // a set
A.LeftIncentives.Head // first element in set
D.LeftIncentives.Head >= A.LeftIncentives.Head // check dominated incentive
D.LeftIncentives.Head >= B.LeftIncentives.Head
D.LeftIncentives.Head >= C.LeftIncentives.Head
// these three together prove that D is the dominating incentive.
// Note that I only test the Head of a set since these sets only have
// one element. A complete solution would need to loop over all options.

Slide 7, three subgames example
G1 := {5||3|2}; G2 := {10|4||-2}; G3 := {7|6||1|0}
G := G1+G2+G3
G.LeftStop
G.RightStop
G.Options(Left) // note: it returns a set of one option; all others are dominated
// It is not so easy to tell which subgame they are from. But see the second example below.
G.Options(Right) // a set of two options

Slide 9, two subgames.
G1 := {5||3|2}; G2 := {10|4||-2}; G := G1+G2
G.LeftStop
G.RightStop
G1.LeftIncentives
G2.LeftIncentives
G.LeftIncentives // one dominating incentive. Only the one from G2
G1.RightIncentives
G2.RightIncentives
G.RightIncentives // G2 again has better incentive
G2.LeftIncentives.Head > G1.LeftIncentives.Head // better play in G2!

Examples for mean

4.Mean
{4|-4}.Mean
{6|-4}.Mean
{4||-4|-10}.Mean
{4||-4|-20}.Mean

October 9

Canonical Forms Get Large Quickly

CGSuite examples, try them out

{1|-1}
{2|-2}
{1|-1} + {2|-2}
{1|-1} + {2|-2} + {3|-3}
{1|-1} + {2|-2} + {3|-3} + {4|-4}
{1|-1} + {2|-2} + {3|-3} + {4|-4} + {5|-5}
{1|-1} + {2|-2} + {3|-3} + {4|-4} + {5|-5} + {6|-6}
{1|-1} + {2|-2} + {3|-3} + {4|-4} + {5|-5} + {6|-6} + {7|-7}
{1|-1} + {2|-2} + {3|-3} + {4|-4} + {5|-5} + {6|-6} + {7|-7} + {8|-8}
{1|-1} + {2|-2} + {3|-3} + {4|-4} + {5|-5} + {6|-6} + {7|-7} + {8|-8} + {9|-9}

Clobber examples:

game.grid.Clobber("OX").CanonicalForm
game.grid.Clobber("OXOXOX").CanonicalForm
game.grid.Clobber("OXOXOXOXOX").CanonicalForm
game.grid.Clobber("OXOXOXOXOXOXOX").CanonicalForm
game.grid.Clobber("OXOXOXOXOXOXOXOXOX").CanonicalForm
game.grid.Clobber("OXOXOXOXOXOXOXOXOXOXOX").CanonicalForm

Bug Fix

There was a mistake on slide 20 of "comparing games", that we hit at the end of last class: in the game G = {0, ∗,−1|−1, {1|−2}, {2|0}}, everything I wrote about comparing options is true. In particular, −1 and {1|−2} are incomparable. However, the final statement is wrong: Canonical form: G = {0, ∗|−1, {1|−2}}. Why is it wrong? It is because of the other simplification, reversible moves. In this case, Right should never move to {1|−2}, since Left can immediately answer and move to 1, which is greater than G itself (because the Left options from G are "only" 0 and *). So the canonical form of G is {0, * | -1}, without that reversible Right option.

To make the example work as intended, and keep {1|−2} in the canonical form, the left options need to be better, so I replaced them by 2, 2 + * and made a new version of the slides with this fix. The canonical form of G:= {2, 2+*, -1 | -1, {1|-2}, {2|0}} is indeed {2, 2* | -1, {1 | -2} }. This is the kind of thing that is very easy for humans to miss, but CGSuite catches it easily.

October 7

The "Outcome Diamond" and Comparing Games

The outcome diamond is the partial order between game outcomes shown above (Figure from Urban Larsson's paper). If you have to choose a game based on its outcome, then as Left you always choose outcome L over any game with outcome in N, P, R, and you choose any other outcome over R, while outcomes N and P are incomparable.

However, that does not mean that e.g. any game in L is better than any game in another class in terms of direct comparison. For example, consider the games G = 1, and H = 100 | 0. Clearly, G is in L and H is in N, since Right can win H by moving to 0. So if left had to choose a single game, Left should choose G since it is a sure win, while H is not. However, G and H themselves are incomparable. There are many sums where having the option to move to H is more valuable than moving to G. For example, with game K = 0 | -50, the sum G + K is in N, but H + K is in L.

Whiteboard

Thanks Abel!

October 2

MCGS analysis of Domineering game from class

White (students) is horizontal and goes first. This is with the "more-grid-games" branch of MCGS, which includes the rules of domineering.

Students played perfectly according to MCGS - all White to play positions are wins, and all Black to play positions are losses.

Moves (left square for White, top square for Black): 1. A2, 2. E5, 3. E4, 4. E1, 5. C2, 6. B5, 7. A4, 8. C5, 9.C4, 10. F2. Now horizontal has 4 more moves, and vertical only 3, so horizontal wins. Below are the calls to solve all game positions with MCGS.

./MCGS "[domineering] ......|......|......|......|......|...... {W}"
./MCGS "[domineering] ......|##....|......|......|......|...... {B}"
./MCGS "[domineering] ......|##....|......|......|....#.|....#. {W}"
./MCGS "[domineering] ......|##....|......|....##|....#.|....#. {B}"
./MCGS "[domineering] ....#.|##..#.|......|....##|....#.|....#. {W}"
./MCGS "[domineering] ....#.|#####.|......|....##|....#.|....#. {B}"
./MCGS "[domineering] ....#.|#####.|......|....##|.#..#.|.#..#. {W}"
./MCGS "[domineering] ....#.|#####.|......|##..##|.#..#.|.#..#. {B}"
./MCGS "[domineering] ....#.|#####.|......|##..##|.##.#.|.##.#. {W}"
./MCGS "[domineering] ....#.|#####.|......|######|.##.#.|.##.#. {B}"
./MCGS "[domineering] ....#.|######|.....#|######|.##.#.|.##.#. {W}"

Comments on Nim game from class

Let *n denote a nim heap with n stones. We played the game *3 + *4 + *5. A winning move is to move from *3 to *1, leaving the sum *1 + *4 + *5. This is a losing position (2nd player win)

A few more examples of losing positions: *n + *n (second player copies the first player's moves), *1 + *2 + *3.

There is an analysis of this game based on the binary representation of heap sizes, and bitwise XOR. We will talk about it in class soon.

September 18

Represent a 1x10 Q1game in 16 bits

Each cell has 3 states, can encode as numbers empty = 0, black = 1, white = 2
Whole board with n cells can be written as a base 3 number with n digits, e.g. n=10, board representation = 0100002000, corresponds to empty, black, empty, ...
0100002000 = 0*3^9 + 1*3^8 + ...+2*3^3
With 10 base 3 digits, can write all numbers from 0 to the largest: 2222222222 = 3^10 - 1
There are 3^n different base 3 numbers with n digits, each corresponds to a different board.
It takes k = ceil(log_2(3)n) bits to represent such numbers, where log_2(3) is the base 2 logarithm of 3, about 1.58496... k is the smallest number such that 2^k ≥ 3^n.

Bit-wise XOR

XOR rules: 0^0 = 0, 0^1 = 1, 1^0 = 1, 1^1 = 0

8-bit bitwise XOR
a = 01101001
b = 10001110
---------------------
c = 11100111
c = a xor b

Rule: x xor x = 0 for all x (you can check this for single bits)
Example: c xor b = 01101001 = a
Zobrist table for Q1game: code[10][3] = table of random (say 64 bit) numbers
Encoding Empty = E = 0, Black = B = 1, White = W = 2
Example: 1x5 board, initially empty EEEEE
board code = code[0][0] xor code[1][0] xor ...code[4][0]
play move to EEEBE: remove code[3][0], add code[3][1], both using xor

Hash collision

Hash collision: code(board1) = code(board2), but board1 != board2
Chance of hash collision: Two n-bit random codes, chance that they are the same = 2^(-n)
In a search we have m states, m can be large. Chance of no collision in the whole search: No pair of states s1, s2 has a collision. There are choose(m, 2) pairs of states. Probability of no collision: (1-2^-n)^choose(m, 2). We did some numeric examples. What error rate is acceptable?
For proof, we want 0 error. We can verify a probabilistic proof which depends on hashing. The verification should not rely on trusting hash codes.

September 16

See updates: Canvas discussions, Quiz 1 on Canvas, new materials on website
upcoming conference: Advances in Computer Games conference (ACG 2025), October 21-23. online, free registration
Illustration for Principal Variation (PV): Chess analysis with PV by strong chess program Sesse

September 4 Notes

Proof that XOXOXO is a second player win. Thanks Abel for the picture!
Solving 1x10 Clobber, XOXOXOXOXO, in three tree nodes: X can play to XOXOXO.XXO = XOXOXO + XXO. Since XOXOXO is a second player win, no player can profit from playing there. So we can ignore that part and focus on XXO. O to play has only one move, to XO.. From here, X to play can clobber the last O stone and win.
Picture - State space for 2x2 Clobber. Thanks Yuzhang!
We looked at two variants, a "naive" state space, and a smaller one where equivalent states (just differ by rotation) are merged into a single state, e.g. by defining a "normal form". Also, we discussed that we can simplify games with "zero subgames", as we did in simplifying XOXOXO + XXO to XXO.
Example of a large Go puzzle that can be solved by local search and CGT methods. Also, failure of full-board search to scale to large problems.

September 2 Notes

Clobber game from class, and analysis with MCGS and CGSuite. The student team played perfectly, and I had no chance!

Last update: Nov 28, 2025, Martin Müller