Essential of Probability

Assignment ~ Week 10

Boma satrio (52250061)

Logo

Introduction

Probability is a field of mathematics that studies the likelihood of an event occurring. Understanding this concept is crucial as it forms the basis for data analysis, statistics, and decision-making under uncertainty.

This report is based on six videos that systematically discuss the flow of probability concepts. The first video introduces the sample space and events as the foundation for probability calculation. The second video explains the difference between independent and dependent events. The third video discusses the rules for the union and intersection of events. The fourth video reaffirms the basic probability rules. The fifth video introduces the binomial experiment and its formula, while the sixth video visualizes the binomial distribution for easier comprehension.

Overall, these six videos complement each other and provide a concise, comprehensive overview of fundamental concepts up to the initial applications of probability.


6.1 Fundamental Concept

Explanatory Video

If the video does not appear, please click the YouTube link below:

https://youtu.be/ynjHKBCiGXY

This video explains the basic definitions in probability theory: sample space, event, and how to determine the probability of an event. It then introduces the basic rules of probability: every probability value is between 0 and 1; the total probability of all outcomes in the sample space equals 1.

✨ 1. Probability and Value

Probability (\(P(A)\)) is a measure of how likely an event is to occur.
Probability Value: The value of \(P(A)\) must always be in the range of 0 to 1.

\[0 \le P(A) \le 1\]

  • \(0\): The event is impossible.
  • \(1\): The event is certain to occur.

✨ 2. Sample Space (\(S\))

The sample space (\(S\)) is the set of all possible outcomes of an experiment.
- Event (\(A\)): An event is a single outcome or a collection of outcomes (a subset) of the sample space.
- Basic Formula: If every outcome is equally likely (fair), then:

\[ P(A) = \frac{\text{Number of outcomes in } A \text{ (favorable outcomes)}}{\text{Number of outcomes in sample space} \text{ (total possible outcomes)}}\]

✨ 3. Basic Probability Rules

All probability problems must satisfy these two essential conditions:
a. Probability Range: The probability of an event must always be between 0 and 1: \[0 \le P(A) \le 1\] b. Total Probability: The total probability of all outcomes in the sample space must always equal 1: \[P(S) = 1\] - Example: \(P(\text{Heads}) + P(\text{Tails}) = 0.5 + 0.5 = 1.0\).

✨ 4. Complement Rule

The Complement Rule is used to calculate the probability that an event does not occur.
- Complement (\(A^c\)): The complement of event \(A\) is the event that \(A\) does not occur.
- Rule: \[ P(A^c) = 1 - P(A)\] - This rule holds because \(A\) and \(A^c\) together cover the entire sample space (\(P(A) + P(A^c) = 1\)).

Logo click if Image does not appear

  • Usefulness: This rule is very useful when it is easier to calculate the probability of the opposite (complement) than to calculate the probability of the original event directly.

✨ 5. Example Application of the Complement Rule

Situation: Tossing two coins. Determine the probability of NOT getting two tails (\(A^c\)).
1. Calculate the probability of event \(A\) (getting two tails): \(P(TT) = 0.25\).
2. Use the Complement Rule: \[P(\text{Not } TT) = 1 - P(TT) = 1 - 0.25 = 0.75\] The probability of not getting two tails is \(0.75\) or \(75\%\).


6.2 Independent and Dependent

Explanatory Video

If the video does not appear, please click the YouTube link below:

https://youtu.be/LS-_ihDKr2M

This video explains the difference between independent events and dependent events in probability. It also explains how to calculate the probability of two events occurring simultaneously (\(A\) and \(B\)), called the Intersection, the formula for which depends on whether the events are Independent or Dependent.

☀️ 1. Basic Definitions

# Load library
library(knitr)

# Membuat data frame
tabel_konsep <- data.frame(
  Concept = c("Event", "Intersection (A∩B)"),
  Brief_Explanation = c("An outcome or a collection of outcomes that can occur in a random experiment.", "The probability that event A and event B occur together.")
)

# Menampilkan tabel
kable(tabel_konsep, caption = "Basic Probability Concepts")
Basic Probability Concepts
Concept Brief_Explanation
Event An outcome or a collection of outcomes that can occur in a random experiment.
Intersection (A∩B) The probability that event A and event B occur together.

☀️ 2. Independent Events

Two events \(A\) and \(B\) are called independent if the occurrence of \(A\) does not affect the probability of \(B\) occurring.

Main Formula:
The probability of the intersection is calculated by multiplying the probabilities of the individual events. \[P(A \cap B) = P(A) \times P(B)\] Characteristics & Examples: - Constant Probability:The probability of the next event remains the same, as nothing changes in the sample space. - Sampling Type: Usually occurs with sampling with replacement. - Example: Tossing a coin twice, or drawing a marble and then replacing it before the second draw.

☀️ 3. Dependent Events

Two events \(A\) and \(B\) are called dependent if the occurrence of \(A\) changes the probability of \(B\) occurring.

Key Concept: Conditional Probability
For dependent events, we use the concept of Conditional Probability, symbolized as \(P(B \mid A)\).
- \(P(B \mid A)\) is the probability that event \(B\) occurs, given that event \(A\) has already occurred.

Main Formula:
The probability of the intersection is calculated by multiplying the probability of the first event by the probability of the second event after the first one has occurred. \[P(A \cap B) = P(A) \times P(B \mid A)\] Characteristics & Examples:
- Changing Probability: The probability of \(B\) changes because the composition of the sample space has changed after \(A\) occurred.
- Sampling Type: Usually occurs with sampling without replacement.
- Example: Drawing a card from a deck without replacing it, or drawing a marble and not replacing it.4. Example ApplicationSituation: A jar contains 4 Red and 6 Blue marbles (Total 10). You want to draw 2 Red marbles consecutively.

☀️ 4. Example Application

Situation: A jar contains 4 Red and 6 Blue marbles (Total 10). You want to draw 2 Red marbles consecutively.

library(knitr)

# Membuat data frame
tabel_kejadian <- data.frame(
  Event_Type = c("Independent", "Dependent"),
  Condition = c("Sampling with replacement", "Sampling without replacement"),
  Calculation = c(
    "$$P(R1 \\cap R2) = P(R1) \\times P(R2) = \\frac{4}{10} \\times \\frac{4}{10} = \\frac{16}{100} = 0.16$$",
    "$$P(R1 \\cap R2) = P(R1) \\times P(R2 \\mid R1) = \\frac{4}{10} \\times \\frac{3}{9} = \\frac{12}{90} \\approx 0.133$$"
  )
)

# Menampilkan tabel
kable(tabel_kejadian, caption = "Probability Calculation for Independent and Dependent Events", escape = FALSE)
Probability Calculation for Independent and Dependent Events
Event_Type Condition Calculation
Independent Sampling with replacement \[P(R1 \cap R2) = P(R1) \times P(R2) = \frac{4}{10} \times \frac{4}{10} = \frac{16}{100} = 0.16\]
Dependent Sampling without replacement \[P(R1 \cap R2) = P(R1) \times P(R2 \mid R1) = \frac{4}{10} \times \frac{3}{9} = \frac{12}{90} \approx 0.133\]

6.3 Union of Events

Explanatory Video

If the video does not appear, please click the YouTube link below:

https://youtu.be/vqKAbhCqSTc

This video explains how to calculate the probability of “A or B” or “A and B” when dealing with two (or more) events. Key terms:

  • Union — means “A or B (or both)”. Written as: (A B)
  • Intersection — means “A and B occur together”. Written as: (A B)

Because there can be overlap (the area where two events occur simultaneously / intersection) between A and B, we must avoid double-counting the intersection when calculating the union probability.

🚀 1. Basic Definitions in Probability (Review)

  1. Sample Space (\(S\)):
  • The Sample Space is the set of all possible outcomes of a random experiment.
  • Example from the Video: Rolling two 6-sided dice; the total sample space is \(6 \times 6 = 36\) possible outcomes.
  1. Event:
  • An event is a subset of the sample space, which is a collection of outcomes that satisfy a specific condition.
  1. Intersection (\(A \cap B\)):
  • \(A \cap B\) is the set of outcomes included in both \(A\) and \(B\) — meaning both events occur simultaneously.
  • This is called the overlap area or duplicate outcomes.
  1. Union (\(A \cup B\)):
  • \(A \cup B\) is the probability that at least one of the events occurs (\(A\) or \(B\) occurs, or both).

Logo click if Image does not appear

🚀 2. Formula for Union Probability — General Addition Rule

We use this formula when the question contains the keyword “or”.
General Addition Rule: \[P(A \cup B) = P(A) + P(B) - P(A \cap B)\] - \(P(A \cup B)\): The probability that \(A\) or \(B\) occurs.
- \(P(A \cap B)\): The probability of the intersection (\(A\) and \(B\) occur simultaneously).

For Disjoint (Mutually Exclusive) Events:
If \(A\) and \(B\) are disjoint, their intersection probability is zero: \[P(A \cap B) = 0\] Thus, the formula simplifies to: \[P(A \cup B) = P(A) + P(B)\]

🚀 3. Why Must We Subtract the Intersection (\(P(A \cap B)\))?

  • Objective: The union formula requires the term \(- P(A \cap B)\) because we want to eliminate duplicate outcomes.
  • Visual Explanation: When you directly add \(P(A) + P(B)\), the outcomes in the intersection area (\(A \cap B\)) are counted twice.
  • To ensure every outcome in the sample space is counted exactly once, the intersection probability must be subtracted once.

🚀 4. Example Application

Using the example of rolling two dice:
- A: Event “Getting two even numbers” \(\rightarrow P(A) = \frac{9}{36}\)
- B: Event “Getting at least one 2” \(\rightarrow P(B) = \frac{11}{36}\)

Then:
- Intersection (\(A \cap B\)): Event “Getting two even numbers AND at least one 2”. \[ P(A \cap B) = \frac{5}{36} \quad \text{(taken from the sample space overlap)}\] - Union (\(A \cup B\)): Probability of getting \(A\) OR \(B\). \[\begin{aligned}P(A \cup B) &= P(A) + P(B) - P(A \cap B) \&= \frac{9}{36} + \frac{11}{36} - \frac{5}{36} \&= \frac{15}{36} \approx 0.4167\end{aligned}\] Meaning: the probability of getting “two even or at least one 2” is approximately 41.67%.

6.4 Exclusive and Exhaustive

Explanatory Video

If the video does not appear, please click the YouTube link below:

https://youtu.be/f7agTv9nA5k

This video explains the difference and definitions of two types of events in probability theory:

  • Mutually Exclusive Events — two events that cannot occur simultaneously.
  • Exhaustive Events — a collection of events that covers all possible outcomes of an experiment.

⭐ 1. Exclusive Events (Mutually Exclusive)

Definition
Two events are said to be mutually exclusive if they cannot occur at the same time. This means no outcome appears in both events simultaneously.

Mathematically: \[A \cap B = \emptyset\] There is no “overlap”.

Example:
On a single die roll:

  • A = rolling an even number = {2,4,6}
  • B = rolling an odd number = {1,3,5}
  • A and B have no intersection → mutually exclusive.

Formula ConsequenceIf A and B are mutually exclusive: \[P(A \cup B) = P(A) + P(B)\] No need to subtract the overlap because there is no intersection.

Logo click if Image does not appear

⭐ 2. Exhaustive Events (Collectively Exhaustive / Comprehensive)D

efinition
Events are called exhaustive if together they cover the entire sample space. \[P(A \cup B) = S\] All possible outcomes are included in A or B or both.

Example
Die roll:

  • A = even number = {2,4,6}
  • B = odd number = {1,3,5}
  • Union of A and B = S = {1,2,3,4,5,6}

→ A & B are exhaustive. Note: They are both exhaustive and mutually exclusive.

Another example that is not mutually exclusive but is still exhaustive:

  • A = number > 3 → {4,5,6}
  • B = number \(\le\) 3 → {1,2,3}

The union = the entire sample space. There is no requirement to be mutually exclusive.

⭐3. Key Differences

Differences Between Mutually Exclusive and Exhaustive Events
Concept Mutually_Exclusive Exhaustive
Can A and B occur together? No Yes (overlap is allowed or not allowed)
Intersection A ∩ B Always empty Can be empty / can be non-empty
Union A ∪ B Not necessarily = S Must equal the entire sample space
Example Even vs Odd Even & Odd, or >3 ≤ 3

⭐ 4. Relationship Between the Two

  • Two events can be mutually exclusive + exhaustive (example: even vs. odd).
  • They can be exhaustive but not exclusive (example: A = \(\ge 3\), B = \(\le 4\)).
  • They can be exclusive but not exhaustive (example: A = 1, B = 6 on a die → they don’t cover the whole 1–6).

⭐ 5. Example ApplicationE

xample 1 (Mutually Exclusive)
Given:

  • A = rolling a 2
  • B = rolling a 5

Then: \[P(A \cup B) = P(A) + P(B) = \frac{1}{6} + \frac{1}{6} = \frac{2}{6}\]

Example 2 (Exhaustive)
Given:

  • A = rolling a number \(\le 3\)
  • B = rolling a number \(\ge 3\)

The union covers the entire sample space → exhaustiveness.

6.5 Binomial Experiment

Explanatory Video

If the video does not appear, please click the YouTube link below:

https://youtu.be/nRuQAtajJYk

This video explains the basic concept of a Binomial Experiment and how to calculate probabilities using the Binomial Formula. The core concept of the binomial probability distribution is that it is a probability distribution for an experiment that is repeated multiple times and yields only two possible outcomes (as indicated by the prefix “bi-”), which are: Success or Failure.

🪐 1. Binomial Experiment

A Binomial Experiment is a type of statistical trial that satisfies four key characteristics:
- There is a fixed number of trials (\(n\)).
- Each trial has only two outcomes: success or failure.The probability of success (\(p\)) is constant for every trial.
- The trials are independent (they do not influence each other).

Case examples:Coin Toss:
- Calculating the probability of getting exactly one head (success) in three coin tosses. Since all 4 conditions are met (fixed number of trials \(n=3\), only two outcomes, constant probability \(p=0.5\), and independent trials), this is a binomial experiment.
- Marble Draw: Calculating the probability of getting exactly two green marbles (success) in five draws with replacement. Replacement ensures that the probability of success remains constant in every trial, thus satisfying the binomial condition.

🪐 2. Conditions for a Binomial Experiment

A trial is called binomial if it meets four conditions:

✔ Fixed number of trials
The number of trials \(n\) is predetermined.
✔ Two possible outcomes
Each trial yields only success or failure.
✔ Constant probability
The probability of success \(p\) is the same for every trial.
✔ Independent trials
One trial does not affect the others.

🪐 3. Important Notation in Binomial Distribution

  • \(n\) = number of trials
  • \(x\) = number of successes
  • \(p\) = probability of success
  • \(q = 1 - p\) = probability of failure
  • \(P(X = x)\) = probability of getting exactly \(x\) successes

🪐 4. Binomial Distribution Formula

The binomial probability formula: \[P(k) = {n \choose k} \cdot p^k \cdot (1-p)^{\,n-k}\] Where:
- \(P(k)\) : the probability of getting exactly \(k\) successes.
- \({n \choose k}\) : the combination, which is the number of ways to get \(k\) successes fromsukses dari \(n\) trials.
- \(n\) : total number of trials.
- \(k\) : desired number of successes.
- \(p\) : probability of success on a single trial.
- \((1-p)\) : probability of failure on a single trial.

🪐 5. Binomial Example Problem

Using the marble draw case to demonstrate the formula:
Case: If 5 marbles are drawn with replacement, what is the probability of getting exactly 2 green marbles?

\(n = 5\) (Trials)
\(k = 2\) (Successes)
\(p = 0.2\) (Probability of green, from \(\frac{2}{10}\))
\(1-p = 0.8\) (Probability of non-green)

Thus: \[P(k=2) = {5 \choose 2}(0.2)^2(1-0.2)^{5-2}\] \[P(k=2) = 10 \times (0.04) \times (0.8)^3\] \[P(k=2) = 10 \times 0.04 \times 0.512\] \[P(k=2) = 0.2048\] So, the probability is 0.2048.

##🪐 6. Why is it called “Binomial”?

The name Binomial comes from the fact that this distribution always has two possible outcomes in every trial, indicated by the prefix bi (meaning two).

6.6 Binomial Distribution

Explanatory Video

If the video does not appear, please click the YouTube link below:

https://youtu.be/Y2-vSWFmgyI

This video continues the discussion on the Binomial Distribution, focusing on how to visualize the data and understand its distributional properties, and visually explaining how the shape of the Binomial Distribution graph is influenced by changes in the number of trials ((n)) and the probability of success ((p)).

🌙 1. Formula and Basic Visualization

  • Objective: The Binomial Distribution is used to calculate the probability of getting a specific number of successes (\(k\)) from a fixed number of trials (\(n\)).
  • Example: Tossing a coin 2 times (\(n=2\)) with a probability of Head (\(p=0.5\)). The probabilities for getting 0, 1, or 2 Heads are calculated using the Binomial Formula.
  • Graph: These probabilities are visualized using a bar chart (histogram), where the X-axis is the number of successes (\(k\)) and the Y-axis is the probability (\(P(k)\)).

🌙 2. Important Parameters

Formulas for parameters to measure the Binomial Distribution:

# Load library for kable
library(knitr)

# Membuat data frame tabel
tabel_binomial <- data.frame(
  Parameter = c("Mean (μ)", "Variance (σ²)", "Standard Deviation (σ)"),
  Formula = c("μ = n · p",
            "σ² = n · p · (1 - p)",
            "σ = √(n · p · (1 - p))"),
  Description = c("The expected average number of successes.",
                 "A measure of the spread of the data from the mean.",
                 "The square root of the variance.")
)

# Menampilkan tabel
kable(tabel_binomial, caption = "Table of Binomial Distribution Parameters")
Table of Binomial Distribution Parameters
Parameter Formula Description
Mean (μ) μ = n · p The expected average number of successes.
Variance (σ²) σ² = n · p · (1 - p) A measure of the spread of the data from the mean.
Standard Deviation (σ) σ = √(n · p · (1 - p)) The square root of the variance.

🌙 3. Influence of Probability of Success (\(p\)) on Distribution Shape

The probability of success (\(p\)) controls the shape of the distribution curve. It determines whether the graph will be symmetrical or skewed:
✨ Shape of the Binomial Distribution ✨
Value_p Distribution_Shape Description
p = 0.5 (50%) Symmetrical Perfectly symmetrical distribution, its peak is in the middle, resembling the Normal Distribution.
p < 0.5 Skewed Right Low probability of success, so most outcomes pile up at small number of successes (near 0).
p > 0.5 Skewed Left High probability of success, so most outcomes pile up at large number of successes (near n).

Visualization of Probability (\(p\)) Influence:Logo click if Image does not appear

  1. Case \(p < 0.5\)
  • (Skewed Right)Visual: The graph on the left.
  • Observation: The histogram peak is near the value 0 (small number of successes). The tail of the curve extends to the right towards larger number of successes. This happens because the probability of failure (\(1-p\)) is greater than the probability of success (\(p\)), making the most likely outcome a large number of failures.
  1. Case \(p = 0.5\)
  • (Symmetrical)Visual: The graph in the center.
  • Observation: The histogram is perfectly symmetrical, resembling a bell shape. The peak is exactly in the middle (at \(\mu = n \cdot p\)). This happens because the probability of success and failure are equal (\(50\%\)), balancing the probability distribution.
  1. Case \(p > 0.5\)
  • (Skewed Left)Visual: The graph on the right.
  • Observation: The histogram peak shifts to the right, piling up near the value \(n\) (large number of successes). The tail of the curve extends to the left towards the value 0. This happens because the probability of success (\(p\)) is greater than the probability of failure, making the most likely outcome a high number of successes.

🌙 4. Normal Approximation to Binomial

As the value of the number of trials (\(n\)) increases, the shape of the Binomial Distribution becomes increasingly similar to the bell curve of the Normal Distribution.
- Approximation Condition: Normal Approximation to Binomial can be used (to simplify calculations) if both of the following conditions are met:
1. \[n \cdot p \ge 10\] 2. \[n \cdot (1-p) \ge 10\]

Visualization of Number of Trials (\(n\)) Influence: Logo Click if Image does not appear

  1. Small \(n\) (e.g., \(n=5, p=0.5\))
  • Representation: The first graph at the top.
  • Observation: The histogram (blue bars) looks discrete and the difference in height between bars is significant. The Normal curve (red line) looks very wide and is not suitable for modeling this Binomial histogram.
  • Validity: \(n \cdot p = 5 \cdot 0.5 = 2.5\). Since \(2.5 < 10\), the approximation is invalid.
  1. Medium \(n\) (e.g., \(n=20, p=0.5\))
  • Representation: The middle graph.
  • Observation: The histogram is starting to become symmetrical and shows a bell shape. The Normal curve is starting to encompass the histogram bars better.
  • Validity: \(n \cdot p = 20 \cdot 0.5 = 10\). Since \(10 \ge 10\), the approximation condition is beginning to be met. This approximation can start to be used, although accuracy may need improvement (usually requiring continuity correction).
  1. Large \(n\) (e.g., \(n=50, p=0.5\))
  • Representation: The bottom graph.
  • Observation: The histogram becomes very dense and smooth. The midpoint of each histogram bar almost falls perfectly on top of the Normal Distribution bell curve.
  • Validity: \(n \cdot p = 50 \cdot 0.5 = 25\). Since \(25 \ge 10\), the approximation is highly valid and yields very accurate results. This is the perfect visualization of the Normal Approximation to Binomial.

Conclusion

The material in these six videos is the backbone that strengthens our understanding of uncertainty. In Data Science, everything is based on probability. By mastering Probability and the Binomial Distribution, we have the mathematical logic foundation to build, test, and trust all prediction models and conclusions we make.

📋 1. Summary of 6 Key Topics

  1. Fundamental Concepts
    Introduction to the basics of probability (sample space, events, chance/likelihood).
  2. Independent & Dependent Events
    Understanding whether the outcome of one event affects the probability of another.
  3. Union of Events
    Rules for calculating the probability of one or more events occurring (\(P(A \text{ or } B)\)).
  4. Exclusive & Exhaustive Events
    Classification of relationships between events. For example, mutually exclusive events mean both cannot occur simultaneously.
  5. Binomial Experiment
    Defining the conditions for a trial to have its probability calculated using the Binomial formula (e.g., only two outcomes: success/failure, and trials are repeated independently).
  6. Binomial Distribution
    Application tool for calculating the probability of obtaining a specific number of successes in a fixed number of trials that satisfy the Binomial conditions.

📋 2. Interconnection (Logical Progression)

The interrelation among these six videos is a progression from the most basic rules to the use of a specific model tool:

  1. Logical Foundation (V1-V4)
    Videos 1 to 4 form the philosophical and mathematical basis of probability. We must understand what probability is, how events influence each other (V2 - Independent and Dependent), and how to calculate the probability of a union (V3 - Union) before we can go further. The concepts in V4 (Exclusive and Exhaustive) help us classify the relationship between these events.
  2. Moving Towards Application (V5)
    The concepts from V1-V4 are used to establish the conditions of a trial. Video 5 (Binomial Experiment) teaches us how to identify real-world situations that satisfy all those basic probability rules (e.g., the outcome of each coin toss is independent, consistent with V2). This is the stage of determining whether the Binomial model can be used.
  3. Using the Tool (V6)
    Once we are confident that the situation meets the conditions (V5), we then use the Binomial Distribution (V6) to calculate specific probabilities (e.g., what is the probability of getting 7 successes out of 10 trials). This calculation depends on the validity of the conditions learned in V1-V5.(Basic Probability Rules (V1-V4) validate the Trial Conditions (V5), which then allows us to use the Model (V6) for prediction.)

📋3. Impact of This Material on Statistics Learning in Data Science

A deep understanding of Probability Theory, especially the Binomial Distribution, is crucial because it is the foundation for Statistical Inference and Binary Data Modeling (success/failure outcomes) in Data Science.
✨ Positive Impact of the Binomial Distribution ✨
Positive_Impact Easy_Explanation
Understanding A/B Testing The Binomial Distribution is the primary tool in A/B Testing…
Foundation for Classification Models Many Data Science problems are binary classification…
Validation and P-Value This concept is closely related to hypothesis testing and p-value…
Basis for Sampling Independent and Dependent concepts are crucial for valid samples…

References

---
title: "Essential of Probability"
subtitle: "Assignment ~ Week 10"
author: "Boma Satrio - (52250061)"
date: "`r format(Sys.Date(), '%B %d, %Y')`"
output:
  rmdformats::readthedown:
    self_contained: true
    thumbnails: true
    lightbox: true
    gallery: true
    number_sections: false
    lib_dir: "libs"
    df_print: "paged"
    code_folding: "hide"
    code_download: yes
    css: "style.css"
---
<div class="main-container">

  <div class="childish-box">
  Boma satrio
  (52250061)

  <img id="Foto" src="https://raw.githubusercontent.com/bomass1116/hehe/main/boma.jpg"
       alt="Logo"
       style="width:200px; display: block; margin: 15px auto; border-radius: 12px;">
</div>

</div>

# Introduction

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p>Probability is a field of mathematics that studies the likelihood of an event occurring. Understanding this concept is crucial as it forms the basis for data analysis, statistics, and decision-making under uncertainty.

This report is based on six videos that systematically discuss the flow of probability concepts. The first video introduces the **sample space** and **events** as the foundation for probability calculation. The second video explains the difference between **independent** and **dependent** events. The third video discusses the rules for the **union** and **intersection** of events. The fourth video reaffirms the basic probability rules. The fifth video introduces the **binomial experiment** and its formula, while the sixth video visualizes the **binomial distribution** for easier comprehension.

Overall, these six videos complement each other and provide a concise, comprehensive overview of fundamental concepts up to the initial applications of probability.</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>

---

# 6.1 Fundamental Concept

<h2><i>Explanatory Video</i></h2>

<div class="main-container">

  <center>
    <iframe src="https://www.youtube.com/embed/ynjHKBCiGXY" 
            width="768" height="400" data-external="1" 
            style="border:0; border-radius:12px;">
    </iframe>
  </center>

  <h3>If the video does not appear, please click the YouTube link below:</h3>
  <p><a href="https://youtu.be/ynjHKBCiGXY" target="_blank">https://youtu.be/ynjHKBCiGXY</a></p>

</div>

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p>
This video explains the basic definitions in probability theory: **sample space**, **event**, and how to determine the probability of an event. It then introduces the basic rules of probability: every probability value is between 0 and 1; the total probability of all outcomes in the sample space equals 1.
</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>

## ✨ 1. Probability and Value

Probability ($P(A)$) is a measure of how likely an event is to occur. <br>
Probability Value: The value of $P(A)$ must always be in the range of 0 to 1.

$$0 \le P(A) \le 1$$

- $0$: The event is impossible.<br>
- $1$: The event is certain to occur.

## ✨ 2. Sample Space ($S$)

The **sample space** ($S$) is the set of all possible outcomes of an experiment.<br>
- **Event** ($A$): An event is a single outcome or a collection of outcomes (a subset) of the sample space.<br>
- Basic Formula: If every outcome is equally likely (fair), then:

$$ P(A) = \frac{\text{Number of outcomes in } A \text{ (favorable outcomes)}}{\text{Number of outcomes in sample space} \text{ (total possible outcomes)}}$$

- Example (Tossing 2 Coins): Sample space: HH, HT, TH, TT (Total 4 outcomes).
<img id="Foto" src="https://raw.githubusercontent.com/bomass1116/contoh/main/licensed.jpg" alt="Logo" style="width:200px; display: block; margin: auto;">
[Click if Image does not appear](https://1drv.ms/i/c/1ad1f53bf8aca8de/EVK9vf2ULJhGqQUySxzZ5JIBfcYuKM1pp-T_Hx6_9E51vg?e=rSmH1Q){ width=50% }

## ✨ 3. Basic Probability Rules

All probability problems must satisfy these two essential conditions:<br>
a. Probability Range: The probability of an event must always be between 0 and 1:
$$0 \le P(A) \le 1$$
b. Total Probability: The total probability of all outcomes in the sample space must always equal 1:
$$P(S) = 1$$
- Example: $P(\text{Heads}) + P(\text{Tails}) = 0.5 + 0.5 = 1.0$.

## ✨ 4. Complement Rule

The **Complement Rule** is used to calculate the probability that an event *does not* occur.<br>
- **Complement** ($A^c$): The complement of event $A$ is the event that $A$ does not occur.<br>
- Rule:
$$ P(A^c) = 1 - P(A)$$
- This rule holds because $A$ and $A^c$ together cover the entire sample space ($P(A) + P(A^c) = 1$).

<img id="Foto" src="https://raw.githubusercontent.com/bomass1116/2kali/main/licensed-.jpg" alt="Logo" style="width:200px; display: block; margin: auto;">
[click if Image does not appear](https://1drv.ms/i/c/1ad1f53bf8aca8de/Eb8WoUIgBO5Gn3S5FoANLyMBt3CxkzXQcAnXmWJewidsVQ?e=eBpEtv){ width=50% }

- Usefulness: This rule is very useful when it is easier to calculate the probability of the opposite (complement) than to calculate the probability of the original event directly.

## ✨ 5. Example Application of the Complement Rule

Situation: Tossing two coins. Determine the probability of **NOT** getting two tails ($A^c$).<br>
1. Calculate the probability of event $A$ (getting two tails): $P(TT) = 0.25$.<br>
2. Use the Complement Rule:
$$P(\text{Not } TT) = 1 - P(TT) = 1 - 0.25 = 0.75$$
The probability of not getting two tails is $0.75$ or $75\%$.

---

# 6.2 Independent and Dependent

<h2><i>Explanatory Video</i></h2>

<div class="main-container">

  <center>
    <iframe src="https://www.youtube.com/embed/LS-_ihDKr2M" 
            width="768" height="400" data-external="1" 
            style="border:0; border-radius:12px;">
    </iframe>
  </center>

  <h3>If the video does not appear, please click the YouTube link below:</h3>
  <p><a href="https://youtu.be/LS-_ihDKr2M" target="_blank">https://youtu.be/LS-_ihDKr2M</a></p>

</div>

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p>
This video explains the difference between **independent events** and **dependent events** in probability. It also explains how to calculate the probability of two events occurring simultaneously ($A$ and $B$), called the **Intersection**, the formula for which depends on whether the events are Independent or Dependent.
</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>


## ☀️ 1. Basic Definitions
```{r, echo=TRUE, message=FALSE}
# Load library
library(knitr)

# Membuat data frame
tabel_konsep <- data.frame(
  Concept = c("Event", "Intersection (A∩B)"),
  Brief_Explanation = c("An outcome or a collection of outcomes that can occur in a random experiment.", "The probability that event A and event B occur together.")
)

# Menampilkan tabel
kable(tabel_konsep, caption = "Basic Probability Concepts")
```

## ☀️ 2. Independent Events

Two events $A$ and $B$ are called independent if the occurrence of $A$ does not affect the probability of $B$ occurring.

Main Formula:<br>
The probability of the intersection is calculated by multiplying the probabilities of the individual events.
$$P(A \cap B) = P(A) \times P(B)$$
Characteristics & Examples:
- Constant Probability:The probability of the next event remains the same, as nothing changes in the sample space.
- Sampling Type: Usually occurs with sampling with replacement.
- Example: Tossing a coin twice, or drawing a marble and then replacing it before the second draw.

## ☀️ 3. Dependent Events

Two events $A$ and $B$ are called dependent if the occurrence of $A$ changes the probability of $B$ occurring.

Key Concept: Conditional Probability<br>
For dependent events, we use the concept of Conditional Probability, symbolized as $P(B \mid A)$.<br>
- $P(B \mid A)$ is the probability that event $B$ occurs, given that event $A$ has already occurred.

Main Formula:<br>
The probability of the intersection is calculated by multiplying the probability of the first event by the probability of the second event after the first one has occurred.
$$P(A \cap B) = P(A) \times P(B \mid A)$$
Characteristics & Examples:<br>
- Changing Probability: The probability of $B$ changes because the composition of the sample space has changed after $A$ occurred.<br>
- Sampling Type: Usually occurs with sampling without replacement.<br>
- Example: Drawing a card from a deck without replacing it, or drawing a marble and not replacing it.4. Example ApplicationSituation: A jar contains 4 Red and 6 Blue marbles (Total 10). You want to draw 2 Red marbles consecutively.

## ☀️ 4. Example Application

Situation: A jar contains 4 Red and 6 Blue marbles (Total 10). You want to draw 2 Red marbles consecutively.

```{r, echo=TRUE, message=FALSE}
library(knitr)

# Membuat data frame
tabel_kejadian <- data.frame(
  Event_Type = c("Independent", "Dependent"),
  Condition = c("Sampling with replacement", "Sampling without replacement"),
  Calculation = c(
    "$$P(R1 \\cap R2) = P(R1) \\times P(R2) = \\frac{4}{10} \\times \\frac{4}{10} = \\frac{16}{100} = 0.16$$",
    "$$P(R1 \\cap R2) = P(R1) \\times P(R2 \\mid R1) = \\frac{4}{10} \\times \\frac{3}{9} = \\frac{12}{90} \\approx 0.133$$"
  )
)

# Menampilkan tabel
kable(tabel_kejadian, caption = "Probability Calculation for Independent and Dependent Events", escape = FALSE)
```

# 6.3 Union of Events

<h2><i>Explanatory Video</i></h2>

<div class="main-container">

<center> 
 <iframe src="https://www.youtube.com/embed/vqKAbhCqSTc" width="768" height="400" data-external="1" style="border:0; border-radius:12px;"> 
 </iframe>
</center>

<h3>If the video does not appear, please click the YouTube link below:</h3> <p><a href="https://youtu.be/vqKAbhCqSTc" target="_blank">https://youtu.be/vqKAbhCqSTc</a></p>

</div>

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p> 
This video explains how to calculate the probability of "A or B" or "A and B" when dealing with two (or more) events. Key terms:
</p>

<ul> 
<li><strong>Union</strong> — means "A or B (or both)". Written as: (A \cup B)</li> <li><strong>Intersection</strong> — means "A and B occur together". Written as: (A \cap B)</li> 
</ul>

  <p> 
Because there can be overlap (the area where two events occur simultaneously / intersection) between A and B, we must avoid double-counting the intersection when calculating the union probability.
</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>

## 🚀 1. Basic Definitions in Probability (Review)

a. Sample Space ($S$):<br>
- The Sample Space is the set of all possible outcomes of a random experiment.<br>
- Example from the Video: Rolling two 6-sided dice; the total sample space is $6 \times 6 = 36$ possible outcomes.

b. Event:<br>
- An event is a subset of the sample space, which is a collection of outcomes that satisfy a specific condition.

c. Intersection ($A \cap B$):<br>
- $A \cap B$ is the set of outcomes included in both $A$ and $B$ — meaning both events occur simultaneously.
- This is called the overlap area or duplicate outcomes.

d. Union ($A \cup B$):<br>
- $A \cup B$ is the probability that at least one of the events occurs ($A$ or $B$ occurs, or both).

<img id="Foto" src="https://raw.githubusercontent.com/bomass1116/unnn/main/binomial.jpg" alt="Logo" style="width:200px; display: block; margin: auto;">
[click if Image does not appear](https://1drv.ms/i/c/1ad1f53bf8aca8de/EV5e8NdfK7VJtAArHp3HbAkBl6SEYr2_pJHj4rWrT10EHA?e=0o4i3Y){ width=50% }

🚀 2. Formula for Union Probability — General Addition Rule

We use this formula when the question contains the keyword "or".<br>
General Addition Rule:
$$P(A \cup B) = P(A) + P(B) - P(A \cap B)$$
- $P(A \cup B)$: The probability that $A$ or $B$ occurs.<br>
- $P(A \cap B)$: The probability of the intersection ($A$ and $B$ occur simultaneously).

For Disjoint (Mutually Exclusive) Events:<br>
If $A$ and $B$ are disjoint, their intersection probability is zero:
$$P(A \cap B) = 0$$
Thus, the formula simplifies to:
$$P(A \cup B) = P(A) + P(B)$$

## 🚀 3. Why Must We Subtract the Intersection ($P(A \cap B)$)?

- Objective: The union formula requires the term $- P(A \cap B)$ because we want to eliminate duplicate outcomes.<br>
- Visual Explanation: When you directly add $P(A) + P(B)$, the outcomes in the intersection area ($A \cap B$) are counted twice.<br>
- To ensure every outcome in the sample space is counted exactly once, the intersection probability must be subtracted once.

## 🚀 4. Example Application

Using the example of rolling two dice:<br>
- A: Event “Getting two even numbers” $\rightarrow P(A) = \frac{9}{36}$<br>
- B: Event “Getting at least one 2” $\rightarrow P(B) = \frac{11}{36}$

Then:<br>
- Intersection ($A \cap B$): Event “Getting two even numbers AND at least one 2”.
$$ P(A \cap B) = \frac{5}{36} \quad \text{(taken from the sample space overlap)}$$
- Union ($A \cup B$): Probability of getting $A$ OR $B$.
$$\begin{aligned}P(A \cup B) &= P(A) + P(B) - P(A \cap B) \&= \frac{9}{36} + \frac{11}{36} - \frac{5}{36} \&= \frac{15}{36} \approx 0.4167\end{aligned}$$
Meaning: the probability of getting "two even or at least one 2" is approximately 41.67%.

# 6.4 Exclusive and Exhaustive
<h2><i>Explanatory Video</i></h2>

<div class="main-container">

<center> 
<iframe src="https://www.youtube.com/embed/f7agTv9nA5k" width="768" height="400" data-external="1" style="border:0; border-radius:12px;"> </iframe>
</center>

<h3>If the video does not appear, please click the YouTube link below:</h3> <p><a href="https://youtu.be/f7agTv9nA5k" target="_blank">https://youtu.be/f7agTv9nA5k</a></p>

</div>

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p> 
This video explains the difference and definitions of two types of events in probability theory: 
<ul>
<li><strong>Mutually Exclusive Events</strong> — two events that cannot occur simultaneously.</li> 
<li><strong>Exhaustive Events</strong> — a collection of events that covers all possible outcomes of an experiment.</li> 
</ul> 
</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>

## ⭐ 1. Exclusive Events (Mutually Exclusive)

Definition<br>
Two events are said to be mutually exclusive if they cannot occur at the same time. This means no outcome appears in both events simultaneously.

Mathematically:
$$A \cap B = \emptyset$$
There is no "overlap".

Example:<br>
On a single die roll:

- A = rolling an even number = {2,4,6}
- B = rolling an odd number = {1,3,5}
- A and B have no intersection → mutually exclusive.

Formula ConsequenceIf A and B are mutually exclusive:
$$P(A \cup B) = P(A) + P(B)$$
No need to subtract the overlap because there is no intersection.

<img id="Foto" src="https://raw.githubusercontent.com/bomass1116/exlcv/main/W.jpg" alt="Logo" style="width:200px; display: block; margin: auto;">
[click if Image does not appear](https://1drv.ms/i/c/1ad1f53bf8aca8de/EUBswmuk1YFNtyfUS6TWUMoB_gaEQpJ7UrCNrXRXZKOeag?e=jusgdI){ width=50% }

## ⭐ 2. Exhaustive Events (Collectively Exhaustive / Comprehensive)D

efinition<br>
Events are called exhaustive if together they cover the entire sample space.
$$P(A \cup B) = S$$
All possible outcomes are included in A or B or both.

Example<br>
Die roll:

- A = even number = {2,4,6}
- B = odd number = {1,3,5}
- Union of A and B = S = {1,2,3,4,5,6}

→ A & B are exhaustive.
Note: They are both exhaustive and mutually exclusive.

Another example that is not mutually exclusive but is still exhaustive:

- A = number > 3 → {4,5,6}<br>
- B = number $\le$ 3 → {1,2,3}

The union = the entire sample space.
There is no requirement to be mutually exclusive.

## ⭐3. Key Differences
```{r, echo=FALSE}
library(knitr)

tabel_exclusive_exhaustive <- data.frame(
  Concept = c(
    "Can A and B occur together?",
    "Intersection A &cap; B",
    "Union A &cup; B",
    "Example"
  ),
  Mutually_Exclusive = c(
    "No",
    "Always empty",
    "Not necessarily = S",
    "Even vs Odd"
  ),
  Exhaustive = c(
    "Yes (overlap is allowed or not allowed)",
    "Can be empty / can be non-empty",
    "Must equal the entire sample space",
    "Even & Odd, or >3 &le; 3"
  ),
  check.names = FALSE
)

kable(
  tabel_exclusive_exhaustive,
  caption = "Differences Between Mutually Exclusive and Exhaustive Events",
  escape = FALSE
)
```

## ⭐ 4. Relationship Between the Two

- Two events can be mutually exclusive + exhaustive (example: even vs. odd).<br>
- They can be exhaustive but not exclusive (example: A = $\ge 3$, B = $\le 4$).<br>
- They can be exclusive but not exhaustive (example: A = 1, B = 6 on a die → they don't cover the whole 1–6).

## ⭐ 5. Example ApplicationE

xample 1 (Mutually Exclusive)<br>
Given:

- A = rolling a 2
- B = rolling a 5

Then:
$$P(A \cup B) = P(A) + P(B) = \frac{1}{6} + \frac{1}{6} = \frac{2}{6}$$

Example 2 (Exhaustive)<br>
Given:

- A = rolling a number $\le 3$
- B = rolling a number $\ge 3$

The union covers the entire sample space → exhaustiveness.

# 6.5 Binomial Experiment
<h2><i>Explanatory Video</i></h2>

<div class="main-container">

<center> 
<iframe src="https://www.youtube.com/embed/nRuQAtajJYk" width="768" height="400" data-external="1" style="border:0; border-radius:12px;"> </iframe> 
</center>

<h3>If the video does not appear, please click the YouTube link below:</h3> <p><a href="https://youtu.be/nRuQAtajJYk" target="_blank">https://youtu.be/nRuQAtajJYk</a></p>

</div>

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p> 
This video explains the basic concept of a Binomial Experiment and how to calculate probabilities using the Binomial Formula. The core concept of the binomial probability distribution is that it is a probability distribution for an experiment that is repeated multiple times and yields only two possible outcomes (as indicated by the prefix "bi-"), which are: Success or Failure. 
</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>

## 🪐 1. Binomial Experiment

A Binomial Experiment is a type of statistical trial that satisfies four key characteristics:<br>
- There is a fixed number of trials ($n$).<br>
- Each trial has only two outcomes: success or failure.The probability of success ($p$) is constant for every trial.<br>
- The trials are independent (they do not influence each other).

Case examples:Coin Toss: <br>
- Calculating the probability of getting exactly one head (success) in three coin tosses. Since all 4 conditions are met (fixed number of trials $n=3$, only two outcomes, constant probability $p=0.5$, and independent trials), this is a binomial experiment.<br>
- Marble Draw: Calculating the probability of getting exactly two green marbles (success) in five draws with replacement. Replacement ensures that the probability of success remains constant in every trial, thus satisfying the binomial condition.

## 🪐 2. Conditions for a Binomial Experiment

A trial is called binomial if it meets four conditions:

✔ Fixed number of trials<br>
The number of trials $n$ is predetermined.<br>
✔ Two possible outcomes<br>
Each trial yields only success or failure.<br>
✔ Constant probability<br>
The probability of success $p$ is the same for every trial.<br>
✔ Independent trials<br>
One trial does not affect the others.

## 🪐 3. Important Notation in Binomial Distribution

- \( n \) = number of trials
- \( x \) = number of successes
- \( p \) = probability of success
- \( q = 1 - p \) = probability of failure
- \( P(X = x) \) = probability of getting exactly \( x \) successes

## 🪐 4. Binomial Distribution Formula

The binomial probability formula:
$$P(k) = {n \choose k} \cdot p^k \cdot (1-p)^{\,n-k}$$
Where:<br>
- \( P(k) \) : the probability of getting exactly \( k \) successes.<br>
- \( {n \choose k} \) : the combination, which is the number of ways to get \( k \) successes fromsukses dari \( n \) trials.<br>
- \( n \) : total number of trials.<br>
- \( k \) : desired number of successes.<br>
- \( p \) : probability of success on a single trial.<br>
- \( (1-p) \) : probability of failure on a single trial.

## 🪐 5. Binomial Example Problem

Using the marble draw case to demonstrate the formula:<br>
Case: If 5 marbles are drawn with replacement, what is the probability of getting exactly 2 green marbles?

$n = 5$ (Trials)<br>
$k = 2$ (Successes)<br>
$p = 0.2$ (Probability of green, from $\frac{2}{10}$)<br>
$1-p = 0.8$ (Probability of non-green)

Thus:
$$P(k=2) = {5 \choose 2}(0.2)^2(1-0.2)^{5-2}$$
$$P(k=2) = 10 \times (0.04) \times (0.8)^3$$
$$P(k=2) = 10 \times 0.04 \times 0.512$$
$$P(k=2) = 0.2048$$
So, the probability is 0.2048.

##🪐 6. Why is it called "Binomial"?

The name Binomial comes from the fact that this distribution always has two possible outcomes in every trial, indicated by the prefix bi (meaning two).

# 6.6 Binomial Distribution
<h2><i>Explanatory Video</i></h2>

<div class="main-container">

<center> 
<iframe src="https://www.youtube.com/embed/Y2-vSWFmgyI" width="768" height="400" data-external="1" style="border:0; border-radius:12px;"> </iframe>
</center>

<h3>If the video does not appear, please click the YouTube link below:</h3> <p><a href="https://youtu.be/Y2-vSWFmgyI" target="_blank">https://youtu.be/Y2-vSWFmgyI</a></p>

</div>

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p> 
This video continues the discussion on the Binomial Distribution, focusing on how to visualize the data and understand its distributional properties, and visually explaining how the shape of the Binomial Distribution graph is influenced by changes in the number of trials ((n)) and the probability of success ((p)).
</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>

## 🌙 1. Formula and Basic Visualization

- Objective: The Binomial Distribution is used to calculate the probability of getting a specific number of successes ($k$) from a fixed number of trials ($n$).<br>
- Example: Tossing a coin 2 times ($n=2$) with a probability of Head ($p=0.5$). The probabilities for getting 0, 1, or 2 Heads are calculated using the Binomial Formula.<br>
- Graph: These probabilities are visualized using a bar chart (histogram), where the X-axis is the number of successes ($k$) and the Y-axis is the probability ($P(k)$).<br>

## 🌙 2. Important Parameters

Formulas for parameters to measure the Binomial Distribution:
```{r, echo=TRUE, message=FALSE}
# Load library for kable
library(knitr)

# Membuat data frame tabel
tabel_binomial <- data.frame(
  Parameter = c("Mean (μ)", "Variance (σ²)", "Standard Deviation (σ)"),
  Formula = c("μ = n · p",
            "σ² = n · p · (1 - p)",
            "σ = √(n · p · (1 - p))"),
  Description = c("The expected average number of successes.",
                 "A measure of the spread of the data from the mean.",
                 "The square root of the variance.")
)

# Menampilkan tabel
kable(tabel_binomial, caption = "Table of Binomial Distribution Parameters")
```

## 🌙 3. Influence of Probability of Success ($p$) on Distribution Shape

The probability of success ($p$) controls the shape of the distribution curve. It determines whether the graph will be symmetrical or skewed:
```{r, echo=FALSE, message=FALSE, warning=FALSE, results='asis'}
library(knitr)
library(htmltools)


# Data frame
tabel_skew <- data.frame(
  Value_p = c("p = 0.5 (50%)", "p < 0.5", "p > 0.5"),
  Distribution_Shape = c(
    "Symmetrical",
    "Skewed Right",
    "Skewed Left"
  ),
  Description = c(
    "Perfectly symmetrical distribution, its peak is in the middle, resembling the Normal Distribution.",
    "Low probability of success, so most outcomes pile up at small number of successes (near 0).",
    "High probability of success, so most outcomes pile up at large number of successes (near n)."
  )
)

# Tabel kable BIASA
tbl <- kable(
  tabel_skew,
  format = "html",
  caption = "✨ Shape of the Binomial Distribution ✨",
  table.attr = 'class="y2k-blue-table"'
)

# Cetak HTML apa adanya
cat(tbl)
```

Visualization of Probability ($p$) Influence:<img id="Foto" src="https://raw.githubusercontent.com/bomass1116/aam/main/hstgrm.jpg" alt="Logo" style="width:200px; display: block; margin: auto;">
[click if Image does not appear](https://1drv.ms/i/c/1ad1f53bf8aca8de/ESa25j4M5JBHqt0hnl0s5woB8yFS3zBq_Ar4pQtL1XR6_A?e=Ye0TVM){ width=50% }

1. Case $p < 0.5$
- (Skewed Right)Visual: The graph on the left.
- Observation: The histogram peak is near the value 0 (small number of successes). The tail of the curve extends to the right towards larger number of successes. This happens because the probability of failure ($1-p$) is greater than the probability of success ($p$), making the most likely outcome a large number of failures.

2. Case $p = 0.5$ 
- (Symmetrical)Visual: The graph in the center.
- Observation: The histogram is perfectly symmetrical, resembling a bell shape. The peak is exactly in the middle (at $\mu = n \cdot p$). This happens because the probability of success and failure are equal ($50\%$), balancing the probability distribution.

3. Case $p > 0.5$ 
- (Skewed Left)Visual: The graph on the right.
- Observation: The histogram peak shifts to the right, piling up near the value $n$ (large number of successes). The tail of the curve extends to the left towards the value 0. This happens because the probability of success ($p$) is greater than the probability of failure, making the most likely outcome a high number of successes.

## 🌙 4. Normal Approximation to Binomial

As the value of the number of trials ($n$) increases, the shape of the Binomial Distribution becomes increasingly similar to the bell curve of the Normal Distribution.<br>
- Approximation Condition: Normal Approximation to Binomial can be used (to simplify calculations) if both of the following conditions are met:<br>
1.
$$n \cdot p \ge 10$$
2.
$$n \cdot (1-p) \ge 10$$

Visualization of Number of Trials ($n$) Influence:
<img id="Foto" src="https://raw.githubusercontent.com/bomass1116/.../main/histogram.jpg" alt="Logo" style="width:200px; display: block; margin: auto;">
[Click if Image does not appear](https://1drv.ms/i/c/1ad1f53bf8aca8de/ESPmvQ1j9EFJrOEWp19fBVwBh_BmUh6230yUr3MTnl9qUw?e=4IjIxz){ width=50% }

1. Small $n$ (e.g., $n=5, p=0.5$)
- Representation: The first graph at the top.
- Observation: The histogram (blue bars) looks discrete and the difference in height between bars is significant. The Normal curve (red line) looks very wide and is not suitable for modeling this Binomial histogram.
- Validity: $n \cdot p = 5 \cdot 0.5 = 2.5$. Since $2.5 < 10$, the approximation is invalid.

2. Medium $n$ (e.g., $n=20, p=0.5$)
- Representation: The middle graph.
- Observation: The histogram is starting to become symmetrical and shows a bell shape. The Normal curve is starting to encompass the histogram bars better.
- Validity: $n \cdot p = 20 \cdot 0.5 = 10$. Since $10 \ge 10$, the approximation condition is beginning to be met. This approximation can start to be used, although accuracy may need improvement (usually requiring continuity correction).

3. Large $n$ (e.g., $n=50, p=0.5$)
- Representation: The bottom graph.
- Observation: The histogram becomes very dense and smooth. The midpoint of each histogram bar almost falls perfectly on top of the Normal Distribution bell curve.
- Validity: $n \cdot p = 50 \cdot 0.5 = 25$. Since $25 \ge 10$, the approximation is highly valid and yields very accurate results. This is the perfect visualization of the Normal Approximation to Binomial.

# Conclusion

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p>The material in these six videos is the backbone that strengthens our understanding of uncertainty. In Data Science, everything is based on probability. By mastering Probability and the Binomial Distribution, we have the mathematical logic foundation to build, test, and trust all prediction models and conclusions we make.
</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>

## 📋 1. Summary of 6 Key Topics

1. Fundamental Concepts<br>
Introduction to the basics of probability (sample space, events, chance/likelihood).
2. Independent & Dependent Events<br>
Understanding whether the outcome of one event affects the probability of another.
3. Union of Events<br>
Rules for calculating the probability of one or more events occurring ($P(A \text{ or } B)$).<br>
4. Exclusive & Exhaustive Events<br>
Classification of relationships between events. For example, mutually exclusive events mean both cannot occur simultaneously.<br>
5. Binomial Experiment<br>
Defining the conditions for a trial to have its probability calculated using the Binomial formula (e.g., only two outcomes: success/failure, and trials are repeated independently).<br>
6. Binomial Distribution<br>
Application tool for calculating the probability of obtaining a specific number of successes in a fixed number of trials that satisfy the Binomial conditions.

## 📋 2. Interconnection (Logical Progression)

The interrelation among these six videos is a progression from the most basic rules to the use of a specific model tool:

1. Logical Foundation (V1-V4)<br>
Videos 1 to 4 form the philosophical and mathematical basis of probability. We must understand what probability is, how events influence each other (V2 - Independent and Dependent), and how to calculate the probability of a union (V3 - Union) before we can go further. The concepts in V4 (Exclusive and Exhaustive) help us classify the relationship between these events.<br>
2. Moving Towards Application (V5)<br>
The concepts from V1-V4 are used to establish the conditions of a trial. Video 5 (Binomial Experiment) teaches us how to identify real-world situations that satisfy all those basic probability rules (e.g., the outcome of each coin toss is independent, consistent with V2). This is the stage of determining whether the Binomial model can be used.<br>
3. Using the Tool (V6)<br>
Once we are confident that the situation meets the conditions (V5), we then use the Binomial Distribution (V6) to calculate specific probabilities (e.g., what is the probability of getting 7 successes out of 10 trials). This calculation depends on the validity of the conditions learned in V1-V5.(Basic Probability Rules (V1-V4) validate the Trial Conditions (V5), which then allows us to use the Model (V6) for prediction.)

## 📋3. Impact of This Material on Statistics Learning in Data Science

A deep understanding of Probability Theory, especially the Binomial Distribution, is crucial because it is the foundation for Statistical Inference and Binary Data Modeling (success/failure outcomes) in Data Science.
```{r, echo=FALSE, warning=FALSE, message=FALSE, }
library(knitr)
library(dplyr)
library(magrittr)   # optional, but safe
library(kableExtra)

tabel_dampak <- data.frame(
  Positive_Impact = c(
    "Understanding A/B Testing",
    "Foundation for Classification Models",
    "Validation and P-Value",
    "Basis for Sampling"
  ),
  Easy_Explanation = c(
    "The Binomial Distribution is the primary tool in A/B Testing...",
    "Many Data Science problems are binary classification...",
    "This concept is closely related to hypothesis testing and p-value...",
    "Independent and Dependent concepts are crucial for valid samples..."
  )
)

tabel_dampak %>%
  kable("html", escape = FALSE,
        caption = "✨ Positive Impact of the Binomial Distribution ✨")
```

# References

<div class="pastel-box">
  <div class="pastel-title"></div>
  <p>
- https://bookdown.org/dsciencelabs/intro_statistics/06-Essentials_of_Probability.html
- https://youtu.be/ynjHKBCiGXY (Fundamental Concept)
- https://youtu.be/LS-_ihDKr2M (Independent and Dependent)
- https://youtu.be/vqKAbhCqSTc (Union of Events)
- https://youtu.be/f7agTv9nA5k (Exclusive and Exhaustive)
- https://youtu.be/nRuQAtajJYk (Binomial Experiment)
- https://youtu.be/Y2-vSWFmgyI (Binomial Distribution)
- https://gemini.google.com/app
- https://chatgpt.com/
- Walpole, Ronald E., Raymond H. Myers, and Sharon L. Myers.<br>
Probability and Statistics for Engineers and Scientists. [https://books.google.com/books/about/Probability_Statistics_for_Engineers_Sci.html?id=tAs50bpeX9oC] (Added as a core reference for probability and distributions.)
- Freund, John E.<br>
Mathematical Statistics. [https://www.probabilitycourse.com/] (Added as a core reference for theoretical statistics.)
- Hogg, Robert V., Joseph W. McKean, and Allen T. Craig.<br>
Introduction to Mathematical Statistics. [https://openlibrary.telkomuniversity.ac.id/home/catalog/id/151031/slug/introduction-to-mathematical-statistics.html] (Added as a core reference for theoretical statistics, alternative to Freund.)
- Spiegel, Murray R., and Larry J. Stephens.<br>
Schaum's Outline of Theory and Problems of Statistics. [https://anyflip.com/ljjmh/cgnr/basic] (Added as a supplementary reference for problems and theory.)
</p>
  <p style="text-align:center; font-size:1.4em; font-weight:bold;"></p>
</div>
