|
BGonline.org Forums
Rollout strength - A mathematical model
Posted By: eXtreme Gammon In Response To: Rollout strength - A mathematical model (Maik Stiebler)
Date: Saturday, 18 February 2012, at 4:53 p.m.
First of all, sorry for the math below, i know it may only interest a few of you.
However, if it is not clear, the goal of what i am doing here is to be able to tell what are best setting for Rollout. for instance the results I posted shows that there is basically no reason to use 1-ply or 2-ply RO: XGR++ will be stronger. Also full rollout under 2592 games are not as accurate as XGR++.
WARNING: MATH BELLOW
For XG, the level used to calculate the luck of each roll is the ply level used for checker play with the following exception:
1-ply uses 2-ply luck XGRoller uses 3-ply luck XGroller+ and ++ uses 4-ply luck. Stronger luck level allow the VR to converge faster.
Theoretically, a perfect level would make the rollout converge immediately (after one game roll)Here is the methodology used: I analyzed (using the first 1000 positions of the Depreli study) what a level consting of a full rollout using P ply and N game would play. I did that using 1-ply and 2-ply. I also modify the program to allow a rollout using 2-ply to perform VR using 3-ply. Also i've check using 3-ply Red (that uses 3-ply luck). I made test using 108,324 648,1296, 2592,5184 and 10368 games
Below is a graph of the results using 1/Sqrt(N) on the X axis and the PR in the Y axis
Making a linear regression shows the following
- the effect of the number of games is linear with 1/sqrt(N)
- increasing the ply in the VR makes the RO converge faster
- The effect of VR is only dependent of the level used in the VR as
- 1-ply and 2-ply are roughly parallel
- "2-ply using 3-ply VR" and "3-ply red" are roughly parallel
- infinite RO (X=0) do no depend of the level used in VR: "2-ply" and "2-ply using 3-ply VR" converge to the same value.
These linear regressions give me 2 things:
the slope is the VR effect for 2-ply and 3-ply the intercept at X=0 gives the level of an infinite RO for 1-ply, 2-ply and 3-ply red. now determining the formula for PR(N-ply,infinite game) and VR(N-ply) is not based on anything, i just pick a formula that made sense to me. For PR(N-ply,infinite game) I've 4 points (knowing that PR=0 means PRro=0). a power function fitted well:
X axis is the Pr of the level, Y axis is the PR of an infinite RO with that levelfor the VR factor, I've only 3 point (PR=0, 2-ply, 3-ply), but i figure i'd also use a power function.
|
BGonline.org Forums is maintained by Stick with WebBBS 5.12.