A brain bending battle of good vs. EVL

Generated at 22/07/2021, 04:35 from 1000 logged games.

Representative game (in the sense of being of mean length). Wherever you see the 'representative game' referred to in later sections, this is it!

EVL is a territory capture game played on an unusual board of heptagons (7-sided) and pentagons (5-sided), using a stackingand unstacking mechanism.

EVL forces are trying to invade you; deploy your forces and defend your territory!

The first player to capture 10 pentagons wins the game

General comments:

Play: Combinatorial

BGG Entry | EVL |
---|---|

BGG Rating | null |

#Voters | null |

SD | null |

BGG Weight | null |

#Voters | null |

Year | null |

Size (bytes) | 27165 |
---|---|

Reference Size | 10293 |

Ratio | 2.64 |

Ai Ai calculates the size of the implementation, and compares it to the Ai Ai implementation of the simplest possible game (which just fills the board). Note that this estimate may include some graphics and heuristics code as well as the game logic. See the wikipedia entry for more details.

Playouts per second | 9631.29 (103.83µs/playout) |
---|---|

Reference Size | 343867.13 (2.91µs/playout) |

Ratio (low is good) | 35.70 |

Tavener complexity: the heat generated by playing every possible instance of a game with a perfectly efficient programme. Since this is not possible to calculate, Ai Ai calculates the number of random playouts per second and compares it to the fastest non-trivial Ai Ai game (Connect 4). This ratio gives a practical indication of how complex the game is. Combine this with the computational state space, and you can get an idea of how strong the default (MCTS-based) AI will be.

Label | Its/s | SD | Nodes/s | SD | Game length | SD |
---|---|---|---|---|---|---|

Random playout | 15,291 | 148 | 869,345 | 8,100 | 57 | 11 |

search.UCT | 15,304 | 386 | 36 | 5 |

Random: 10 second warmup for the hotspot compiler. 100 trials of 1000ms each.

Other: 100 playouts, means calculated over the first 5 moves only to avoid distortion due to speedup at end of game.

Rotation (Half turn) lost each game as expected.

Reflection (X axis) lost each game as expected.

Reflection (Y axis) lost each game as expected.

Copy last move lost each game as expected.

Mirroring strategies attempt to copy the previous move. On first move, they will attempt to play in the centre. If neither of these are possible, they will pick a random move. Each entry represents a different form of copying; direct copy, reflection in either the X or Y axis, half-turn rotation.

1: Black win % | 55.20±3.10 | Includes draws = 50% |
---|---|---|

2: White win % | 44.80±3.06 | Includes draws = 50% |

Draw % | 0.00 | Percentage of games where all players draw. |

Decisive % | 100.00 | Percentage of games with a single winner. |

Samples | 1000 | Quantity of logged games played |

Note: that win/loss statistics may vary depending on thinking time (horizon effect, etc.), bad heuristics, bugs, and other factors, so should be taken with a pinch of salt. (Given perfect play, any game of pure skill will always end in the same result.)

Note: Ai Ai differentiates between states where all players draw or win or lose; this is mostly to support cooperative games.

Match | AI | Strong Wins | Draws | Strong Losses | #Games | Strong Score | p1 Win% | Draw% | p2 Win% | Game Length |
---|---|---|---|---|---|---|---|---|---|---|

0 | Random | |||||||||

2 | UCT (its=3) | 631 | 0 | 348 | 979 | 0.6140 <= 0.6445 <= 0.6739 | 53.42 | 0.00 | 46.58 | 54.26 |

28 | UCT (its=76) | 631 | 0 | 163 | 794 | 0.7652 <= 0.7947 <= 0.8214 | 49.87 | 0.00 | 50.13 | 51.52 |

29 | UCT (its=207) | 631 | 0 | 114 | 745 | 0.8194 <= 0.8470 <= 0.8710 | 48.59 | 0.00 | 51.41 | 42.99 |

30 | UCT (its=562) | 631 | 0 | 186 | 817 | 0.7423 <= 0.7723 <= 0.7998 | 50.43 | 0.00 | 49.57 | 37.19 |

31 | UCT (its=1529) | 631 | 0 | 256 | 887 | 0.6807 <= 0.7114 <= 0.7402 | 53.78 | 0.00 | 46.22 | 34.85 |

32 | UCT (its=1529) | 506 | 0 | 494 | 1000 | 0.4750 <= 0.5060 <= 0.5369 | 54.80 | 0.00 | 45.20 | 34.72 |

Search for levels ended: time limit reached.

Level of Play: **Strong** beats **Weak** 60% of the time (lower bound with 95% confidence).

Draw%, p1 win% and game length may give some indication of trends as AI strength increases.

This chart shows the win(green)/draw(black)/loss(red) percentages, as UCT play strength increases. **Note that for most games, the top playing strength show here will be distinctly below human standard.**

Game length | 36.16 | |
---|---|---|

Branching factor | 30.75 | |

Complexity | 10^52.90 | Based on game length and branching factor |

Samples | 1000 | Quantity of logged games played |

Computational complexity (where present) is an estimate of the game tree reachable through actual play. For each game in turn, Ai Ai marks the positions reached in a hashtable, then counts the number of new moves added to the table. Once all moves are applied, it treats this sequence as a geometric progression and calculates the sum as n-> infinity.

Distinct actions | 680 | Number of distinct moves (e.g. "e4") regardless of position in game tree |
---|---|---|

Good moves | 270 | A good move is selected by the AI more than the average |

Bad moves | 410 | A bad move is selected by the AI less than the average |

Terrible moves | 30 | A terrible move is never selected by the AI Too many terrible moves to list. |

Response distance | 3.18 | Mean distance between move and response; a low value relative to the board size may mean a game is tactical rather than strategic. |

Samples | 1000 | Quantity of logged games played |

A mean of 44.52% of board locations were used per game.

Colour and size show the frequency of visits.

Game length frequencies.

Mean | 36.16 |
---|---|

Mode | [35] |

Median | 35.0 |

This chart is based on a single representative* playout, and gives a feel for the change in material over the course of a game. (* Representative in the sense that it is close to the mean length.)

Table: branching factor per turn, based on a single representative* game. (* Representative in the sense that it is close to the mean game length.)

This chart is based on a single representative* game, and gives a feel for the types of moves available throughout that game. (* Representative in the sense that it is close to the mean game length.)

Red: removal, Black: move, Blue: Add, Grey: pass, Purple: swap sides, Brown: other.

This chart shows the best move value with respect to the active player; the orange line represents the value of doing nothing (null move).

The lead changed on 25% of the game turns. Ai Ai found 4 critical turns (turns with only one good option).

This chart shows the relative temperature of all moves each turn. Colour range: black (worst), red, orange(even), yellow, white(best).

Measure | All players | Player 1 | Player 2 |
---|---|---|---|

Mean % of effective moves | 24.67 | 26.23 | 23.11 |

Mean no. of effective moves | 6.56 | 7.44 | 5.67 |

Effective game space | 10^21.90 | 10^10.35 | 10^11.55 |

Mean % of good moves | 11.32 | 2.82 | 19.83 |

Mean no. of good moves | 3.00 | 0.89 | 5.11 |

Good move game space | 10^10.96 | 10^1.56 | 10^9.41 |

These figures were calculated over a single game.

An *effective move* is one with score 0.1 of the best move (including the best move). -1 (loss) <= score <= 1 (win)

A *good move* has a score > 0. Note that when there are no good moves, an multiplier of 1 is used for the game space calculation.

Measure | Value | Description |
---|---|---|

Hot turns | 97.22% | A hot turn is one where making a move is better than doing nothing. |

Momentum | 25.00% | % of turns where a player improved their score. |

Correction | 44.44% | % of turns where the score headed back towards equality. |

Depth | 4.09% | Difference in evaluation between a short and long search. |

Drama | 0.46% | How much the winner was behind before their final victory. |

Foulup Factor | 8.33% | Moves that looked better than the best move after a short search. |

Surprising turns | 0.00% | Turns that looked bad after a short search, but good after a long one. |

Last lead change | 88.89% | Distance through game when the lead changed for the last time. |

Decisiveness | 8.33% | Distance from the result being known to the end of the game. |

These figures were calculated over a single representative* game, and based on the measures of quality described in "Automatic Generation and Evaluation of Recombination Games" (Cameron Browne, 2007). (* Representative, in the sense that it is close to the mean game length.)

Moves | Animation |
---|---|

f7,a5,f1,c1 | |

c1,b5,c7 | |

e1,d1,d5 | |

e1,g7,b3 | |

b3,g7,e1 | |

c3,e5,b3 | |

d3,g7,d7 | |

a5,b5,g5 | |

b5,g7,d7 | |

c5,g1,b3 | |

f5,a5,c1 | |

g5,c3,b5 |

Colour shows the success ratio of this play over the first 10moves; black < red < yellow < white.

Size shows the frequency this move is played.

0 | 1 | 2 | 3 | 4 | 5 | 6 |
---|---|---|---|---|---|---|

1 | 28 | 784 | 11368 | 154630 | 1563310 | 15136182 |

Note: most games do not take board rotation and reflection into consideration.

Multi-part turns could be treated as the same or different depth depending on the implementation.

Counts to depth N include all moves reachable at lower depths.

Inaccuracies may also exist due to hash collisions, but Ai Ai uses 64-bit hashes so these will be a very small fraction of a percentage point.

No solutions found to depth 6.

Puzzle | Solution |
---|---|

Black to win in 5 moves | |

Black to win in 7 moves | |

Black to win in 5 moves | |

White to win in 6 moves | |

White to win in 5 moves | |

White to win in 5 moves | |

Black to win in 3 moves | |

Black to win in 3 moves | |

Black to win in 3 moves |

Weak puzzle selection criteria are in place; the first move may not be unique.