A certain lottery involves players selecting six numbers without replacement from the range [1,49]. The jackpot is shared among the players who match all six numbers ("balls") selected in the same way at random in a twice-weekly draw (in any order). If no player matches every drawn number, the jackpot "rolls over" and is added to the following draw's jackpot.
Although the lottery is fair in the sense that every combination of drawn numbers is equally likely, it has been observed that many players show a preference in their selection for certain numbers such as those which represent dates (i.e. more of their numbers are chosen from [1,31] than would be expected if they chose randomly). Hence, to avoid sharing the jackpot and hence to maximise one's expected winnings, it would be reasonable to avoid these numbers.
Test this hypothesis by establishing if there is any correlation between the number of balls with values less than 13 (representing a month) and the jackpot winnings per person. Ignore draws immediately following a rollover. The necessary data can be downloaded here.
To access solutions, please obtain an access code from Cambridge University Press at the Lecturer Resources page for my book (registration required) and then sign up to scipython.com providing this code.