[Probability] Conditional Probability

Conditional Probability:

If A and B are events with P(B) > 0, then the conditional probability of A given B, denoted by P(A|B), is defined as P(A|B) = P(A ∩ B) / P(B)

Two Children:

(Two children). Martin Gardner posed the following puzzle in the 1950s, in his column in Scientific American.

Mr. Jones has two children. The older child is a girl.

What is the probability that both children are girls?   1/2

Mr. Smith has two children. At least one of them is a boy.

What is the probability that both children are boys?  1/3

The possible combinations for the two children are: {BB, BG, GB, GG} where B represents a boy and G represents a girl.

At first glance this problem seems like it should be a simple application of conditional probability, but for decades there have been controversies about whether or why the two parts of the problem should have different answers, and the extent to which the problem is ambiguous. Gardner gave the answers 1/2 and 1/3 to the two parts, respectively, which may seem paradoxical: why should it matter whether we learn the older child’s gender, as opposed to just learning one child’s gender?

It is important to clarify the assumptions of the problem. Several implicit assumptions are being made to obtain the answers that Gardner gave.

• It assumes that gender is binary, so that each child can be definitively categorized as a boy or a girl. In fact, many people don’t neatly fit into either of the categories “male” or “female”, and identify themselves as having a non-binary gender.

• It assumes that P(boy) = P(girl), both for the elder child and for the younger child. In fact, in most countries slightly more boys are born than girls. For example, in the United States it is commonly estimated that 105 boys are born for every 100 girls who are born.

• It assumes that the genders of the two children are independent, i.e., knowing the elder child’s gender gives no information about the younger child’s gender, and vice versa. This would be unrealistic if, e.g., the children were identical twins.

Think about: 

Suppose we have a winter-born boy, what's the prob. that both are boys?

Let's define the events:

  • Event A: Both children are boys. P(A) = 1/4
  • Event B: One child is a winter-born boy. P(B) = 1 - P(C) = 15 / 64
  • Event C : Both are not winter-born boy. P(C) = ((7/8)^2) = 49 / 64
  • P( A&B ) = 7 / 64 (notice that A & B are not independent )
  • P( A&B )/ P(B) = 7 / 15

We want to find P(A∣B), the probability that both children are boys given that one of them is a winter-born boy.

Don't trust your intuitions !

Think about : P ( Both are boys, and there's a winter - born child ) is equal to 

P ( Both are boys)* P( there is a winter born child )= 1/4 * 7/16 

Because the two are independent

Independence

P(A&B) = P(A)* P(B) iff A & B are independent

if P (A) > 0, P ( A | B ) = P ( A )

like : 

A: flip a coin and get "H"

B : you get an A in the course. 

(Independence of complementary events)

If A and B are independent, then A and Bc are independent, Ac and B are independent, and Ac and Bc are independent.

Proof. Let A and B be independent. We will first show that A and Bc are independent.

If P(A) = 0, then A is independent of every event, including Bc .

So assume P(A) != 0. Then P(B c |A) = 1 − P(B|A) = 1 − P(B) = P(B c ),

(Independence of three events).

Events A, B, and C are said to be independent if all of the following equations hold:

P(A ∩ B) = P(A)P(B),

P(A ∩ C) = P(A)P(C),

P(B ∩ C) = P(B)P(C),

P(A ∩ B ∩ C) = P(A)P(B)P(C).

On the other hand, P(A ∩ B ∩ C) = P(A)P(B)P(C) does not imply pairwise independence; this can be seen quickly by looking at the extreme case P(A) = 0, when the equation becomes 0 = 0, which tells us nothing about B and C. We can define independence of any number of events similarly. 

Q: whether... A: Yes/ No. (Binary info) If A and B are independence, knowing A would not give insights of B. 

What about A and B and C? A, B independent, to make sure C is independent of A and B, 

Q1: whether A...

Q2: whether B...

The combinations of the answers of Q1 & Q2 does not affect the event C, which are all needed to be checked. But it turned out that we only need the four equations above.

For more, you can prove with similar equations which can be proved by using complementary result and prove the kth event does not affect others.

(Pairwise independence doesn’t imply independence)

(Conditional independence).

Events A and B are said to be conditionally independent given E if P(A ∩ B|E) = P(A|E)P(B|E).

but

It is easy to make terrible blunders stemming from confusing independence and conditional independence. Two events can be conditionally independent given E, but not independent given Ec .

(Conditional independence doesn’t imply independence). 

(Probability of the intersection of two events).

For any events A and B with positive probabilities, P(A ∩ B) = P(B)P(A|B) = P(A)P(B|A)

Bayes' Rule:

Bayes' theorem, also known as Bayes' rule or Bayes' law, is a fundamental concept in probability theory. It provides a way to update probabilities based on new evidence or information. The theorem is named after the Reverend Thomas Bayes, an 18th-century mathematician and statistician.

The formula for Bayes' theorem is expressed as follows:

P(A|B) = P(B|A) *P(A) / P(B)

with 

P(A|B) = P(A ∩ B) / P(B)

Here's the breakdown of the terms:

- \( P(A|B) \): The probability of event A occurring given that event B has occurred. This is the posterior probability.
- \( P(B|A) \): The probability of event B occurring given that event A has occurred. This is the likelihood.
- \( P(A) \): The probability of event A occurring. This is the prior probability.
- \( P(B) \): The probability of event B occurring. This is the marginal likelihood or the total probability of event B.

In words, Bayes' theorem states that the probability of A given B is proportional to the probability of B given A, multiplied by the prior probability of A, and then normalized by the overall probability of B.

Bayes' theorem is particularly useful in Bayesian statistics and machine learning, where it is employed in various applications such as Bayesian inference, spam filtering, medical diagnosis, and more. It allows for the incorporation of new evidence to update prior beliefs and make more informed decisions.

(Odds).

The odds of an event A are

odds(A) = P(A)/P(A c ).

For example, if P(A) = 2/3, we say the odds in favor of A are 2 to 1. (This is sometimes written as 2 : 1, and is sometimes stated as 1 to 2 odds against A

Of course we can also convert from odds back to probability:

P(A) = odds(A)/(1 + odds(A))

你可能感兴趣的:(STA,Probability)