[ View Thread ] [ Post Response ] [ Return to Index ] [ Read Prev Msg ] [ Read Next Msg ]

BGonline.org Forums

Confidence Interval Overlap vs Joint Standard Deviation (long)

Posted By: neilkaz
Date: Saturday, 31 October 2009, at 5:13 p.m.

There's been some discussion and concern here since GNU rollouts use JSD and since XG rollouts like Snowie rollouts report as a 95% confidence interval.

Way back when Snowie was the only really good bot on the scene (Was this before or after the Norman Conquest?..I dunno...too many pain pills here to recall that long ago) I used feel really confident about a rollout if the confidence intervals didn't overlap. Knowing that each specific interval gave 95% confidence made me feel quite certain that if those intervals didn't overlap, or even if they just touched, that the RO result was giving me almost 100% stat sig.

Noting that I only took a couple college prob/stat courses 30 years ago, I'll do my best with some basic stat now. Those here who are better than me in math can please correct, or tell me that I'm far wrong.

It all starts with standard deviation. GNU reports as std dev. XG and Snowie report as 95% CI. A 95% CI is a range of data that is +/- 1.96 SD from the mean.

Lets say that a RO resulted in a position being a .96 take with a .005 SD. The 95% CI for this RO is .96 +/- .0098. I'll simplify and call the CI .96 +/- .01 since multiplying by 2 is almost the same as multiplying by 1.96. I'll use .005 as SD and .010 as 95% CI for the remainder of this discussion.

OK now lets say that we are rolling out one play vs another, and play A results in .92 and play B results in .94 emg. Both plays have a SD of .005.

The CI for both plays is therefore ~.01. So XG would show play A confidence as (.91 to .93) and Play B confidence as (.93 to .95). Does the fact that these CI touch make you feel uncomfortable that Play A may actually be superior to play B? Well, it shouldn't !

"Joint Standard Deviation The standard deviation of the difference between two rollouts: JSD = sqrt(SD1*SD1 + SD2*SD2). A measure of how statistically significant the result is." comes from Tom's site.

Lets calculate JSD here. Sqrt(.025+.025)= .00707...or about 1.4 times SD when SD's are the same.

Hmmm...Play A was .92 and Play B was .94 emg. They differ by .020 which is almost 3 times the JSD of .007 ! ie this is more than 99% confidence that play B would finish ahead of play A if we could extend the rollout until the end of time. 99+% confidence is certainly good enough for me.

What we see here is that if the two 95% CI's don't overlap or if they even touch we have at least 99% confidence.

What more do we need? XG is blazing fast for RO's and I am very comfortable with any results that don't have CI overlap and I think all should be comfortable with this method as well.

Who needs JSD? ..well not me... neilkaz ..

Messages In This Thread

 

Post Response

Your Name:
Your E-Mail Address:
Subject:
Message:

If necessary, enter your password below:

Password:

 

 

[ View Thread ] [ Post Response ] [ Return to Index ] [ Read Prev Msg ] [ Read Next Msg ]

BGonline.org Forums is maintained by Stick with WebBBS 5.12.