Many years ago I took a basic college statistics course. Usually it had to do with a situation where you have a collection of different objects and you calculate the odds of getting a specific subset or range of subsets from that. For example, you might say "a jar has 8 white marbles and 13 black marbles. What are the odds that the first marble removed is black, not returned, and then you pull 2 white marbles?"
However, I am trying to do something basically the reverse of this. I have a jar of 100 marbles which can be white or black. But I don't know how many of each there are. From the jar I remove 10 marbles at random: 7 white marbles and 3 black marbles. From that information, I want to know what the odds are that there are more white marbles in the jar than black marbles.
I had an idea on the process, but I have doubts about it. I thought that I could look at the number of possible further outcomes. There are 90 marbles remaining, and there could therefore be 0 to 90 white marbles remaining (91 possible outcomes). From that, I can determine that 44 to 90 white marbles in the jar would produce an outcome where there are more white marbles than black marbles, while outcomes 0-42 produce an outcome where there are fewer white marbles than black marbles, and outcome 43 produces a result where there are the same number of each. 44 of the outcomes are failures in this sense, and therefore this means that of the possibilities, 47/91 of the possibilities are successes, and that would suggest a 51.6% chance that there are more white marbles than black marbles. Is this correct?
What I'm unsure about... given that I have already pulled more white marbles than black marbles, are the odds of each possible remaining outcome the same? Which I think is what I'm assuming in the above calculations. And it seems odd to me that a significant majority in 10% subset at random from the whole still would be only a tiny increase over 50/50 odds.
However, I am trying to do something basically the reverse of this. I have a jar of 100 marbles which can be white or black. But I don't know how many of each there are. From the jar I remove 10 marbles at random: 7 white marbles and 3 black marbles. From that information, I want to know what the odds are that there are more white marbles in the jar than black marbles.
I had an idea on the process, but I have doubts about it. I thought that I could look at the number of possible further outcomes. There are 90 marbles remaining, and there could therefore be 0 to 90 white marbles remaining (91 possible outcomes). From that, I can determine that 44 to 90 white marbles in the jar would produce an outcome where there are more white marbles than black marbles, while outcomes 0-42 produce an outcome where there are fewer white marbles than black marbles, and outcome 43 produces a result where there are the same number of each. 44 of the outcomes are failures in this sense, and therefore this means that of the possibilities, 47/91 of the possibilities are successes, and that would suggest a 51.6% chance that there are more white marbles than black marbles. Is this correct?
What I'm unsure about... given that I have already pulled more white marbles than black marbles, are the odds of each possible remaining outcome the same? Which I think is what I'm assuming in the above calculations. And it seems odd to me that a significant majority in 10% subset at random from the whole still would be only a tiny increase over 50/50 odds.