Binomial distribution in Statistics is the probability of success (p) or failure (1-p) in a experiment which repeated multiple times. Therefore, we can say binomial is a sequence of indentical Bernoulli events.Also, Binomial distribution graph describes the probability of achieving our desired outcome a specific number of times.
Binomial distribution is used to compute the probability of sucess in the experiment which repeated multiple times and the outcome is either a sucess or a failure
4 requirements needed to be a binomial distribution:
There are only two outcomes, caleed success and failure
The n trials are independent
The probability of each outcome remains constant from trial to trial.
A fixed number of trials
The probability of having four or less correct answers if a student attempts to answer every question at random:
Considering there is only one out five possible answers is correct. The probability to answer a question correctly is 1/5 = 0.2
The probability of having four or less correct answers if a student attemps to answer every question at random:
## [1] 0.6296483
The probability is 62.96483%.
Poisson Distribution in statistic is a distribution that shows how many times an event is possible to happen within a specific period time. It start with letter Po and a single value parameter, lambda. It is also deals with the frequency with which an event occurs within a specific interval. Simply put, it requires knowing how often it occurs for a specific period of time or distance.
Poisson distribution is used when there are independent events which happen at a instant rate in interval of time. It is used because it can be a tool that helps people to predict the probability of certain events from happening when we know how often the event has occured and it gives us the probability of a given number of events happening in a fixed interval of time
The difference between Binomial distribution and Poisson distribution are
The binomial distribution is only for the probability of sucess on a certain frame while the poisson distribution is used for events which could occur a very large number of times.
The Binomial distribution only has 2 outcomes, success or failure and the poisson distribution has no limit of possible outcome
The probability of sucess in binomial distribution is constant and in poisson distribution has tiny ability to succed.
What we know: 20 cars crossing/ minute on average what we need to find: find the probability of having >= 13 cars in a particular minute and vizualise
Average number of cars crossing per minute is \(μ=20\)
we will use ppois function to find the porbability of having or more 13 cars, so we will use lower.trail= F and set q at 16
## Warning: package 'ggplot2' was built under R version 3.6.3
## Warning: package 'dplyr' was built under R version 3.6.3
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
## [1] 0.06612764
We have 6.612764% probability of having 13 or more cars crossing the bridger in any particular minute in the upper tail of the probability density function
## [1] 0.9338724
We have 93.38724% probability of having 13 or more cars crossing the bridger in any particular minute.
The continous uniform distribution is a continous probability distribution and we can find when it is concerned with events that are equally likely to occur.Also, when the cdf that the function is continuous, is does not go in steps like the discrete case. Further, the probability density function takes values in the range a to b. An example of the continous uniform distribution is an idealized random number generator.
Uniform distribution is divided into categories which are discrete uniform distributions and continuous uniform distribution. Discrete uniform distributions have a finite number of outcomes. A continuous uniform distribution is a statistical distribution with an infinite number of equally likely measurable values.
It means the outcomes of uniform distribution have equal probability, the mean and the variance are uniterpretable, and possess no predictice power.
We solve uniform distribution problems using R by applying the function runif. The runif() function generates random deviates of the uniform distribution and is written. We also use dunif() function gives the density
The exponential disribution example is actually the time between events in a poisson process. it is also describes the arrival time of a randomly recurring independent event sequenceThe exponential distribution example length, the amount of time, in months, and the amount of time until a tsunami occurs.
The exponential distribution is a continous probability ditribution used for modelling the time we need to wait before a given event occurs.
The exponential distribution rate is = 0 and β = 1, x ≥ μ; β > 0
The exponential distribution is one of the widely used continuous distribution
What we know: mean = 3 minutes What we want to know: probability being checkout and completed in less than two minutes
## [1] 0.4865829
The probability is 48.65829%
In statistics, the normal distribution is the function of probability that portrays how the values of a variable are distributed and it graph shows that the majority of the data center around the mean. The standard normal distribution (also known as Z distribution) is a normal distribution with a mean of 0 and a standard deviation of 1.Also, it is zero-centered and the extent to which a particular measurement deviates from the mean is given by the standard deviation.
It is called normal distribution because its distribution are at least approximately normally distributed and has mean= median= mode, symmetry about the center, and 50% of values less than the mean and 50% greater than the mean. Also, it approximates the binomial probabilites well.
we can calculate normal distribution in R using pnorm function. pnorm function has the same default mean and sd arguments as dnorm. The function pnorm returns the integral from minus infinite to q of the pdf of the nomal distribution where q is a Z-score. it is the function that replaces the table of probabilites and Z-scores.
The characteristics of a nomral distributions are
symmetric
unimodal
asymptotic
the mean, media, and mode are equal.
## [1] 0.3085375
## [1] 0.372079
## [1] 0.8413447
Chi square distribution is used for hypothesis testing, computing confidence interval, determining the goodness of fit
The chi square distribution formula is capital Greek letter chi squared followed by a parameter“l dpicting the degress of freedom. So, we can read it with variable as”Variable Y follows a chi square distribution with degress of freddom.
Chi square distribution is the distribution of the sum of squared standard normal deviates.
Chi square distribution is continuous
Note: There are two ways to solve this problem, using the T Distribution Calculator. Both approaches are presented below. Solution A is the traditional approach. It requires you to compute the t statistic, based on data presented in the problem description. Then, you use the T Distribution Calculator to find the probability. Solution B is easier. You simply enter the problem data into the T Distribution Calculator. The calculator computes a t statistic “behind the scenes”, and displays the probability. Both approaches come up with exactly the same answer.
The student \(t\) distribution used for estimating population parameters when the sample size is small or when the population variance is unknown
It is called Student \(t\) distribution because Gosset’s employer preferred staff to use pen names so he used the name “Student” to hide his identity. Also, it become well known when Ronald Fisher who called the distribution as “Student’s distribution” and represented the test value with the letter t.Also, \(t\) distribution has own charateristric which is the mean of the distribution is equal to 0 and its standard deviations differ ccording to the sample size n.
We can find the student \(t\) distribution by looking at the distribution it self. It is symmetric, bell-shaped, and has heavier tails which mean that it is more likely to proce values that fall far from its mean
The student \(t\) distribution is a continuous distribition.
| Population | Population standard deviation | Sample standard deviation |
|---|---|---|
| Women | 30 | 35 |
| Men | 50 | 45 |
F distribution is the density function of a probability that is used in analysis of cariance and is a function of the ratio of two independent random variables which each of them has a chi square distribution and is divided by its number of degress of freedom.
F distribution tells us the value to find out if the means between two populations are significantly different
F distribution is used for deciding to accept or reject the null hypothesis.
The F distribution is derived by defining the quotient \(Z= Y/X\). Then the PDF of Z can be determined from the result of the joint distribution of a quotient. Now, the gamma function has been defined. Then, we use the substitution \(t=x((z+1)/2)\). Therefore, we can found the quotient \(F=(Y/m)/(X/n)\). So, after that we can decide it’s PDF.