Processing math: 100%

Monday, May 10, 2021

A numerical game of Mine's Bigger

Martina and Olivia each secretly generate their own random real number, selected uniformly between 0 and 1. Starting with Martina, they take turns declaring (so the other can hear) who they think probably has the greater number until the first moment they agree. Throughout this process, their respective numbers do not change.

They are playing as a team, hoping to maximize the chances they correctly predict who has the greater number. For any given round with randomly generated numbers, what is the probability that the person they agree on really does have the bigger number?

Extra credit: Martina and Olivia change the rules so that they stop when Olivia first says that she agrees with Martina. That is, if Martina says on her turn that she agrees with Olivia, that is not a condition for stopping. Again, if they play to maximizing their chances, what is the probability that the person they agree on really does have the bigger number?

Martina and Olivia are excellent Bayesian statisticians and after each response they update the current bounds for the other's random value. In particular, after k responses from Olivia, there are estimates Oklo,Okhi and Mklo,Mkhi such that Martina believes that Olivia's number is distributed as U(Oklo,Okhi) and Olivia believes that Martina's number is distributed as U(Mklo,Mkhi). Given this assumption and that Martina knows the realization of her own random process m, she will respond for her kth response that she believes her own value is bigger whenever P{OmOU(Oklo,Okhi)}0.5

and that Olivia's number is larger otherwise. Similarly, since Olivia knows ther realization of her own random process o, Olivia's kth response will similarly be that she believes her own value is bigger whenever P{MoMU(Mklo,Mkhi)}0.5.

Based on the faithful behavior of the players to estimate which of them have the larger number based on all of the available inforation, the updating procedure for Oklo,Okhi,Mklo and Mkhi are interrelated, with Mklo={12(Ok1lo+Ok1hi),if Martina says her number is biggerMk1lo,if Martina says Olivia's number is biggerMkhi={Mk1hi,if Martina says her number is bigger12(Ok1lo+Ok1hi),if Martina says Olivia's number is biggerOklo={Ok1lo,if Olivia says Martina's number is bigger12(Mk1lo+Mk1hi),if Olivia says her number is biggerOkhi={12(Mk1lo+Mk1hi),if Olivia says Martina's number is biggerOk1hi,if Olivia says her number is bigger

For any (m,o)[0,1]×[0,1], the gameplay can be characterized by a finite string in the alphabet {M,O} corresponding the concatenating Martina's and Olivia's responses until there is either an instance of MM or OO. We can carve up the unit square [0,1]×[0,1] into response regions where each (m,o) in that region would lead to the same response history from Martina and Olivia in their game. So, for instance the MM region is [0.5,1]×[0,0.75], and the subregion within MM where the identified player does not have the largest number is a right isoceles triangle with side length 0.750.5=0.25. On the flip side the OO region is [0,0.5]×[0.25,1] and also has a misidentified region that is right isoceles triangle with side length 0.25. So if the gameplay string has 2 characters, the mischaracterization region has area (1/4)^2.$

Moving further, we see that the regions MOO given by [0.5,0.875]×[0.75,1] and OOM given by [0.125,0.5]×[0,0.25] are congruent and each has a misidentification region shaped like a right isoceles with side length 0.125. If the gameplay string has 3 characters, then the misidentification region has area (1/8)2. Moving further on, we see that if the gameplay string has k characters, then the misidentification region has area (1/2k)2=4k. Therefore, the entire misclassification region has area given by k=214k=1114114=4354=112

and thus the probability of correctly identifying the player with the larger number is 1115.

If Martina and Olivia modified the game to end only if Olivia agrees with Martina, then this limits the response strings to only those with even lengths. In this case, the misidentified region now has area k=1142k=k=1116k=111161=115,

and so the modification allows the pair gets better at identifying the player with probability 1516.

No comments:

Post a Comment