Statistical Theoretical Distribution -

Last Updated on: 13th December 2023, 01:43 pm

Statistical Theoretical Distribution

Statistical Theoretical Distribution

Statistical Theoretical Distributions are deduced mathematically based on certain assumption (not obtained by actual observations or experiments).

Types of theoretical distributions commonly used in statistical analysis

Binomial distribution due to James Bernoulli
Poission Distribution due to S. D. Poission
Normal Distribution due to Demoivre

Importance of Theoretical Distribution:

Estimate of nature and trend of frequency distribution: On the basis of theoretical frequency distribution, the nature and trend of frequency distribution can be estimated under certain assumptions and conditions.
Basis of logical decisions: Risk and uncertainty of an event can be analysed on the basis of theoretical distribution for taking logical decisions.
Forecasting: It provides base for prediction, projection and forecasting.
Test of Sampling: It serves as benchmarks against which actual frequency distribution and deviations are compared.

Binomial Distribution

Binomial Distribution is useful where there are only two outcomes (e.g success or failure, good or defective, hit or miss, yes or no etc).

Binomial Probability Function
Binomial probability distribution gives the probability of obtaining exactly x successes and (n – x) failures in n trials.

Successes (x)	Probability f(x)
0	ⁿc_op^oq^n-o= qⁿ
1	ⁿc₁ p¹ q^{n -1}
2	ⁿc₂p²q^n-2
…	…
x	ⁿc_xp^xq^n-x
…	…
n	ⁿc_npⁿq^n-n = pⁿ

Probability for the number of success in a given number of trials is given by
f(x) = ⁿc_xp^xq^n-x .. (1) for x = 0, 1, 2, ..,n [Random variable (x) is an integer]
Where p = constant probability of success in a single trial. q = 1 p, (as p + q = 1, q is the probability of failure), n = number of trials, x = number of successes in n trials (x $\displaystyle \le$ n).

The above terms of f(x) are the successive terms of the binomial expansion of (q+p)ⁿ i.e. qⁿ+ⁿc₁p¹q^n-1+ⁿc₂p² q^n-2+…..+pⁿ. So it is known as Binomial distribution.

$\displaystyle \sum\limits_{{x=0}}^{n}{{f(x)={{{\left( {q+p} \right)}}^{n}}}}$ ={q+(1-q)}ⁿ=1ⁿ=1. (as p+q=1, or p=1-q)

Binomial Distribution Properties

Binomial Distribution is a discrete probability distribution, where the random variable x (i.e. the no. of successes) assumes the values 0, 1, 2, , n, where n is finite and x $\displaystyle \Leftarrow$ n.
Mean = np, variance = npq, s.d. ( $\displaystyle \sigma$ ) = $\displaystyle \sqrt{{}}$ variance = $\displaystyle \sqrt{{}}$ npq. Skewness = [(q-p) / (( npq)],
Kurtosis = (1-6pq) $\displaystyle \sqrt{{}}$ npq, where p+q=1.
Skewness is positive for p<½, negative for p>½ and zero for p =½. Value of x which has maximum probability

Assumptions of Binomial Distribution

1.Each trial has two mutually exclusive possible outcomes, i.e. success or failure. 2. Each trial is independent of other trials. 3. The probability of a success remains constant from trial to trial. 4. The probability of getting a head in a toss of coins is ½. This result must remain same in successive tosses. 5. The number of trial is fixed.

Sequence of p and q
The general from of binomial distribution is the expansion of (p +q)ⁿ, in which the number of successes is written in a descending order. If the number of successes is written in an ascending order, then (q +p)ⁿ will be expanded.

Binomial expansion of events

No. of events	Binomial Expansion
1.	(p + q)¹ = p + q
2.	(p + q)² = p² + 2pq + q²
3.	(p + q)³ = p³ + 3p²q + 3pq²+ q³
4.	(p + q)⁴ = p⁴ + 4p³q + 6p²q²+ 4pq³ + q⁴
5.	(p + q)⁵ = p⁵ + 5p⁴q + 10p³q²+ 10p²q³ + 5pq⁴+ q⁵
6.	(p + q)⁶ = p⁶ + 6p⁵q + 15p⁴q²+ 20p³q³ + 15p²q⁴+ 6pq⁵ + q⁶
7.	(p + q)⁷ = p⁷ + 7p⁶q + 21p⁵q²+ 35p⁴q³ + 35p³q⁴+ 21p²q⁵ + 7pq⁶ + q⁷
8.	(p + q)⁸ = p⁸ + 8p⁷q + 28p⁶q²+ 56p⁵q³ + 70p⁴q⁴+ 56p³q⁵ + 28p²q⁶ + 8pq⁷+ q⁸
9.	(p + q)¹ = p + q

Rules for coefficients of binomial expansion
1.The first term is pⁿ. 2. The second term is ⁿC₁ p^n-1q. 3. In each succeeding term the power of p is reduced by 1 and the power of q is increased by 1. 4. The coefficient of any term is found by multiplying the coefficient of the preceding term by the power of p and dividing the products so obtained by one more than the power of q in that preceding term.

Binomial Distribution – Problems

Binomial Distribution – Problems
Ex. 4 coins are tossed simultaneously. What is the probability of getting (i) 2 heads (ii) at least 2 heads and (iii) at least one head.

The random experiment consists in tossing 4 coins and observing the number of heads. Let occurrence of heads be treated as success.
p = probability of getting a head = ½ , q = 1- ½ = ½ .
Value of p is constant for each coin and the trials are all independent
f(x) = ⁿc_xp^xq^n-x = ⁴c_x (½)^x (½)^4-x . Here n=4
(i) Probability of getting 2 heads : Putting x =2 , we get f(2) = ⁴c₂ (½)² (½)^4-2 = 6 (¼) (¼) = ⅜
(ii) At least 2 heads means 2 or more than 2 heads i.e. 2 or 3 or 4 heads.
So, Probability of at least two heads = f(2)+f(3)+¦(4) = ⁴c₂ (½)² (½)^4-2 +⁴c₃ (½)³ (½)^4-3 +⁴c₄ (½)⁴ (½)⁰
= 6 (¼) (¼) + 4 (¹/₈) (½ ) + 1 (¹/₁₆ ). 1 = ⅜ + ¼ + ¹/₁₆ = ¹¹/₁₆

Binomial Distribution – Problems

Binomial Distribution – Problems

Ex. 1 : Six coins are tossed. Find probability of more than 4 heads
Let us assume probability of success= p. Probability of getting a head =p= ½.
So, p =½ (= constant prob.), q = 1- p = 1- ½ = ½ , n=6.

The probability function is
f(x) = ⁿcxp^xq^n-x = ⁶cx (½)^x (½)^{6 -x}
More than 4 heads means 5 and 6 heads.
So probability = f(5) + f(6) = ⁶c₅ (½)⁵ (½ )^6-5 + ⁶c₆ (½)⁶ (½)^6-6
= [6x (1/2⁵) x (1/2)] + 1 x (1/2⁶) = 7/2⁶ = $\displaystyle \frac{7}{{64}}$

Binomial Distribution – Problems

Binomial Distribution – Problems

Ex. 1 : Given the probability of defective screws is $\displaystyle \frac{1}{6}$ .
Find the following for the binomial distribution of defective screws in a total of 180:
(i) the mean (ii) the s.d. (iii) moment coefficient of skewness.

Here n = 180, p = $\displaystyle \frac{1}{6}$ , q = 1- $\displaystyle \frac{1}{6}$ = $\displaystyle \frac{5}{6}$ .
(i) Mean of binomial distribution = np = 180 x $\displaystyle \frac{1}{6}$ = 30
(ii) s.d. ( $\displaystyle \sigma$ ) = $\displaystyle \sqrt{{}}$ npq = (180. ( $\displaystyle \frac{1}{6}$ ). ( $\displaystyle \frac{5}{6}$ )= $\displaystyle \sqrt{{}}$ 25 = 5
(iii) Moment coefficient of skewness = {(q-p){ / { $\displaystyle \sqrt{{}}$ (npq) } = {( $\displaystyle \frac{5}{6}$ – $\displaystyle \frac{1}{6}$ )} / 5 = [( $\displaystyle \frac{4}{6}$ ) / 5] = $\displaystyle \frac{2}{{15}}$ (or .1333)

Ex. 2 : The incidence of occupational disease in an industry is such that the workmen have a 20% chance of suffering from it. What is the probability that out of six workmen, 4 or more will contract the disease?
Probability that a worker will suffer from disease (p) = $\displaystyle \frac{{20}}{{100}}$ = $\displaystyle \frac{1}{5}$ ; q = 1 ( $\displaystyle \frac{1}{5}$ ) = $\displaystyle \frac{4}{5}$ , n = 6
f(x) = ⁶c_xp^xq^{6 – x} for x = 4, 5, 6 (as x $\displaystyle \ge$ 4)
f(4) + f(5) + f(6) = ⁶c₄(¹/₅)⁴(⁴/₅)² + ⁶c₅(¹/₅)⁵(⁴/₅) + ⁶c₆(¹/₅)⁶(⁴/₅)⁰
= (1/5⁶) x (⁶c_4.4²₊⁶c_5.4₊1)= (1/15625) x( 240+24+1) = 265 / 15625 ( or 0.164)

Binomial Distribution – Problems

Binomial Distribution – Problems

Ex. 1 : The arithmetic mean of binomial distribution is 6 and S.D. is 4. Is this calculation correct.
Here ( $\displaystyle \overline{X}$ = np = 6, s.d = $\displaystyle \sqrt{{}}$ (npq) = 4,
so npq= 4² = 16. q= (npq)/ (np) = $\displaystyle \frac{{16}}{6}$ = 2.67 (i.e q> 1)
As p+q =1, q cannot exceed value of 1. So the calculation is not correct.

Ex. 2 : The incidence of occupational disease in an industry is such that the workmen have a 10% chance of suffering from it. What is the probability that out of 5 workmen, 3 or more will contract the disease?
n = 5 and p = probability of workman suffering from disease = 10%= 0.1. So q = 1 0.1 = 0.9.
f(x) = ⁵C_x. (0.1)^x. (0.9)^5-x, for x = 0, 1, 2, , 5.
The probability that 3 or more workmen will contract the disease P (x $\displaystyle \ge$ 3)
= f (3) + f(4) + f(5)
= ⁵C₃ (0.1)³(0.9)^5-3 + ⁵C₄ (0.1)⁴. (0.9)^5-4 + ⁵C₅ (0.1)⁵
= (10 x 0.001 x 0.81) + (5 x 0.0001 x 0.9) + (1 x 0.00001) = 0.0081 + 0.00045 + 0.00001= 0.0086

Poisson Distribution

Binomial distribution can not be applied where n cannot be estimated. In such cases, Poisson Distribution is applicable.
Poisson distribution is defined by the probability function.
f(x) = (e^-mm^x) / (x!), for x (no. of successes) = 0, 1, 2, 3, ….

x:	0	1	2	3	…. total
_{f(x) :}	_e^–^m	_e^–^m_.m	(e^–^mm²) / 2!	(e^–^mm³) / 3!	…….. 1

(as the total probability must be unity)

If the value of the parameter m is known, the distribution is completely known. The value of m generally lies between 0.1 and 10.

Properties of Poisson Distribution:

Discrete distribution: Like binomial distribution it is also a discrete probability distribution i.e. occurrences can be described by a random variable.
Main parameter: The main parameter is mean (m) which is equal to np i.e. m = np.
Form: It is a positively skewed distribution.

Assumption of Poisson Distribution:

The occurrences of events are independent, i.e. the occurrence of an event in an interval of time or space does not effect the probability of a second occurrence of the event in the same (or any other) interval.
The probability of a single occurrence of the event in a given interval is proportional to the length of the interval.
The probability of occurrence of more than one event in a very small interval is negligible.

Examples of Poisson distribution:
The number of telephone calls received at a particular switch board per minute during a certain hour of the day. The number of deaths per day in a district or town in one year by a disease.
The number of cars passing a certain point per minute.

The number of persons born deaf and dumb per year in a city. The number of typographical errors per page. The number of printing errors per page. The number of defective blades in a pack.

Poisson Distribution Computation

Poisson Distribution is a discrete distribution. You may find out the probability of exactly 0, 1, 2, .n successes, in following steps
Step 1 : Find out arithmetic mean of observed data, denoted as m, i.e., X = m
Step 2 : Compute value of e^{– m} (e = 2.7183, the base of natural logarithms)
e^-m = 1/(e^m) = 1/ (2.7183)^m = 1/ [antilog (log 2.7183 x m)] = 1/ [antilog (.4343 x m)]
Step 3 : Compute probability of 0, 1, 2, ……..n successes, using Poisson Distribution P(x) = e^-m . [(m^x) / x!], or P(r) = e^-m . [(m^r) / r!], where X or r = No. of Successes 0, 1, 2, ……..n, e=2.7183, m=X=Arithmetic Mean

Table of Values of e^–^m
(m lying between 0 and .99)
m	0	1	2	3	4	5	6	7	8	9
0.0	1.0000	.9900	.9808	.9704	.9608	.9512	.9418	.9324	.9231	.9139
0.1	.9048	.8958	.8860	.8781	.8694	.8607	.8521	.8437	.8353	.8270
0.2	.8187	.8106	.8025	.7945	.7866	.7788	.7711	.7634	.7558	.7483
0.3	.7408	.7334	.7261	.7189	.7118	.7047	.6977	.6907	.6839	.6771
0.4	.6703	.6636	.6570	.6505	.6440	.6376	.6313	.6250	.6188	.6125
0.5	.6065	.6005	.5945	.5886	.5827	.5770	.5712	.5655	.5599	.5543
0.6	.5488	.5434	.5379	.5326	.5273	.5220	.5169	.5127	.5066	.5016
0.7	.4966	.4916	.4868	.4810	.4771	.4724	.4677	.4630	.4584	.4538
0.8	.4493	.4449	.4404	.4360	.4317	.4274	.4232	.4190	.4148	.4107
0.9	.4066	.4025	.3985	.3946	.3906	.3867	.3829	.3791	.3753	.3716

Values of e^{– m} for values of m lying between + 1 and + 10

m=1	2	3	4	5
0.36788	0.13534	0.04979	0.01832	0.0698
6	7	8	9	10
0.00279	0.00092	0.000395	0.000121	0.000045

Poisson Distribution – Problems

Poisson Distribution – Problems

Ex. A random variable x follows Poisson distribution having parameter 2. Find the probabilities that x assumes the values (i) 0, 1, 3, (ii) less then 3 (iii) at least 2
(given e-2 = .1353).

Here, m=2.
(i) f(x) =e^-m . [(m^x) / x!] = e^-2 . [(2^x) / x!], for x = 0, 1, 3
f(0) =e^-2 . [(2⁰) / 0!] = [(e^-2 x 1) /1 ] = e^-2 = 0.13534
f(1) =e^-2 . [(2¹) / 1!] = [(e^-2 x2) /1 ] = e^-2 x2 = 0.27068
f(3) =e^-2 . [(2³) / 3!] = [(e^-2x 8) /6 ] = e^-2 x ( $\displaystyle \frac{4}{3}$ ) = 0.1804

(ii) ‘Less than 3’ indicates either 0, or 1 or 2 i.e. x = 0 or 1 or 2.
f(x) = f(0)+ f(1) + f(2) = .1353 + .2706 + [(e^-2.2²) / 2!] = (.1353) + (.2706) +(.1353 x 2)
= .1353 + .2706 + .2706 = .6765.

(iii) ‘At least 2 means either 2 or 3 or 4
f(x) = f(2) + f(3) + f(4) + …….. = 1 – {f(0) + f(1)} = 1 – (.1353 + .2706) = 1–.4059 = .5941.

Poisson Distribution – Problems

Ex. 1 : If a random variable x follows a poisson distribution such that P(x = 1) = P (x = 2); find P(x = 0)
f(x) =e^-m . [(m^x) / x!]. So, P(x=1) = f(1) = e^-m . [(m¹) / 1!] =me^{– m}
P(x=2) = f(2) = e^-m . [(m²) / 2!] = [(m²e^{– m}) / 2]
Now, as given in the problem , f(1) = f(2), so, me^{– m} = [(m²e^{– m}) / 2]. Or, 1=m/2, or m=2
So, f(0) = e^-2 . [(2⁰) / 0!] = e^-2

Ex. 2 : One tenth per cent of the blades produced by the blade manufacturing factory turn out to be defective. The blades are supplied in packets of 20. Use Poisson distribution to calculate the approximate number of packets containing (i) no defective (ii) one defective and (iii) two defective blades respectively in a consignment of 4,00,000 packets.

Let the occurrence of a defective blade be a success. Here, p=( $\displaystyle \frac{1}{{10}}$ ) % = ( $\displaystyle \frac{1}{{1000}}$ ), n=20, m=np
=20 x ( $\displaystyle \frac{1}{{1000}}$ ) = .02
f(x) =e^-m . [(m^x) / x!] = e^-0.02 . [(.02)^x / x!], for 0, 1, 2, …… defective blades
f(x) = 4,00,000 x [{e^-0.02 . (.02)^x } / x!]
f(0) = 4,00,000 x [{e^-0.02 . (.02)⁰ } / 0!] = 4,00,000 x e^{– .02} = 4,00,000 x .9802 = 3,92,080
f(1) = 4,00,000 x [{e^-0.02 . (.02)¹ } / 1!] = 4,00,000 x e^{– .02} x (.02)¹
= 4,00,000 x .9802x .02 = 7842 (appx)
f(2) = 4,00,000 x [{e^-0.02 . (.02)² } / 2!] = (4,00,000 x e^{– .02} x (.02)² ) / 2!
= 4,00,000 x .9802x .0002 = 78 (appx)

Poisson Distribution – Problems

Poisson Distribution – Problems

Ex. Printing mistakes per page committed by a press follows a Poisson distribution. Find the expected frequencies for the following distribution of printing mistakes:

Mistakes/ page	0	1	2	3	4	5
No of Pages	40	30	20	15	10	5	Total = 120

(Value of e^-1.5 = (0.22313)

Here, Mean = [{(40 x 0) + (30 x 1) + (20 x 2) + (3 x 15) + (4 x 10) + (5 x 5)}/ 120] = 1.5
P(0) = e^-1.5 = 0.22313, P(1) = e^{– 1.5}x 1.5 = 0.22313 x 1.5 = .34695
P(2) = e^-1.5 . [(1.5)² / 2!] = .25, P(3) = e^-1.5 . [(1.5)³ / 3!] =0.13,
P(4) = e^-1.5 . [(1.5)⁴ / 4!] =0.05, P(5) = e^-1.5 . [(1.5)⁵ / 5!] =0.01

Expected Frequency = $\displaystyle \frac{{\left( {N{{e}^{{-\lambda }}}{{\lambda }^{x}}} \right)}}{X}$
Putting the values, we get the values as follows

# of Printing Mistakes	# of Pages	Expected Frequency
0	40	27
1	30	40
2	20	30
3	15	16
4	10	6
5	5	1
Total	120	120

Poisson Distribution – Problems

Poisson Distribution – Problems

The number of accidents in a year attributed to bus drivers in a city follows. Poisson distribution with mean 3. Out of 3,000 bus drivers, find the number of drivers with (i) no accident in a year and (ii) at least 3 accidents in a year.
[e^{– 3} = 0.0498]
Here m=3, P(x =r) =e^-m . [(m^r) / r!]
(i) Probability of No accidents =P(0) = e^-3 = .0498.
So, number of drivers with no accidents = 3000 x .0498 = 149
(ii) Probability of at least 3 accidents = P(x $\displaystyle \ge$ 3) = 1 – P(x $\displaystyle \le$ 3) = 1- [P(0) + P(1) + P( 2) + P(3)]
= 1 – [(e^{– m}) + (e^{– m}.m) + {(e^{– m}.m²) / 2!} + {(e^{– m}.m³) / 3!}] = 1- e^{– m} [1 + m+ (m² / 2!) + (m³ / 3!)]
= 1 – e^{– 3} [1 + 3+ (3² / 2!) + (3³ / 3!)] = 1 – e^{– 3} [1 + 3+ ( $\displaystyle \frac{9}{2}$ ) + ( $\displaystyle \frac{9}{2}$ )] = 1 -( e^{– 3} x 13) = 1 – (.0498 x13)
= 1- 0.6474 = 0.3526
So, Number of Bus drivers with at least 3 accidents in a year = 3000 x 0.3526 = 1058

Normal Distribution

Normal Distribution is continuous probability distribution in which the relative frequencies of a continuous variable are distributed according to normal probability law. It is a symmetrical distribution in which the frequencies are distributed evenly about the mean of distribution.

Normal distribution is defined by the probability density function:

$\displaystyle P(x)=\frac{{1.\text{ }{{\text{e}}^{{^{{^{{-\text{ }\frac{1}{2}{{{\left[ {\frac{{x-\bar{x}}}{\sigma }} \right]}}^{2}}}}}}}}}}}{{\sigma \sqrt{{2\pi }}}}$

for $\displaystyle -\infty <x<+\infty$ , where $\displaystyle \overline{X}$ = Mean, $\displaystyle \sigma$ = Standard deviation, e (base of natural logarithm) = 2.7183, $\displaystyle \pi$ = 3.1415.

Normal distribution in its standard normal variate (S.N.V.) form is given by:

$\displaystyle P(z)=\frac{{1.\text{ }{{e}^{{^{{^{{-\text{ }\frac{1}{2}{{Z}^{2}}}}}}}}}}}{{\sqrt{{2\pi }}}}$

$\displaystyle \infty <Z<+\infty$ =S.N.V = $\displaystyle {\frac{{x-\overline{x}}}{\sigma }}$ .

The mean of Z is zero and standard deviation of Z is 1
In a normal distribution, the quartiles
Q₁ and Q₃ are equi-distant from the median. Due to this property Q₃ – M = M – Q₁.

Normal Distribution Properties

Bell Shaped : The normal curve is perfectly symmetrical and bell shaped about mean. This implies that if we fold the curve along its vertical axis at the center, the two halves would coincide.
Continuous Distribution: Normal distribution is a distribution of continuous variables. For this reason, it is called continuous probability distribution.
Parameters of Distribution: Two main parameters of normal distribution are: Mean $\displaystyle \overline{X}$ and Standard Deviation (S.D.). The entire distribution can be known from these two parameters.
Relationship between M.D. and S.D. : In a normal distribution, the mean deviation (M.D.) is $\displaystyle {\frac{4}{5}}$ times the standard deviation, i.e., M.D = $\displaystyle \left( {\frac{4}{5}} \right)$ x S.D

Normal Curve in Statistics

Normal Curve is the most prominent probability distribution model used in statistics. Normal curve is bell-shaped perfectly symmetric curve, centered on the mean, equal to its median and mode.

The equation of the normal curve depends on Mean ( $\displaystyle \overline{X}$ or $\displaystyle \mu$ and Standard Deviation $\displaystyle \sigma$ .
For different values of (X) and $\displaystyle \sigma$ , different normal curves are obtained. Since $\displaystyle \mu$ and $\displaystyle \sigma$ can assume an infinite number of value, it is impracticable to tabulate the area under the curve for different values of ( and (.
For the sake of convenience, standard normal curve or unit normal curve is constructed with $\displaystyle \mu$ = 0 and standard deviation = 1. Subsequently, the given value of the normal variate is transformed into standard units by the formula of Z- transformation

$\displaystyle Z=\frac{{\left( {X-\overline{X}} \right)}}{\sigma }$ , where, Z= z-transformation, $\displaystyle \overline{X}$ (or $\displaystyle \mu$ ) = Arithmetic mean of population, X= Value of Observation, $\displaystyle \sigma$ = S.D. of distribution

Ex. find the area under Standard Normal Curve between z = 0 & z = 1.8 and between z = 0 & z = 1.85.
In the Standard Normal Table, the value corresponding to 1.8 & 0 is 0.4641. Similarly, the value between z = 0, and z = 1.85.is 0.4678
Hence, required areas are respectively 0.4641 and 0.4678.

In the first case, 0.4641 represents the probability that z lies between 0 and 1.8 i.e., p(0<= z <= 1.8) = 0.4641 and in second case, p(0 <=z <= 1.85) = 0.4678.

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Ex. 1 : Find the area under Standard Normal Curve. between z = – 1.67 and z = 0.
Here we are to find area between z = – 1.67 & z = 0 and area between z = 0 & z = 1.67
From the table of area under Standard Normal Curve, the corresponding number is 0.4525 which is the reqd. area. Also P (- 1.67 $\displaystyle \le$ z $\displaystyle \le$ 0) = 0.4525.

Ex. 2 : Find the area under Standard Normal Curve, between z = 0.82 and z = 1.96. This area cannot be calculated directly, so we have to break up as follows :
Reqd. area = (area between z = 0 and z = 1.96) ( (area between z = 0 and z = 0.82) = (0.4750 – 0.2939) = 0.1811. p (0.82 $\displaystyle \le$ z $\displaystyle \le$ 1.96) = 0.1811.

Ex.3 : Find the area under Standard Normal Curve between z = ( 0.75 and z = 0) + (area between z = 0 and z = 2.04)
Required (area between z = 0 and z = .75) + (area between z = 0 and z = 2.04) = .2734 + .4793 = 0.7527. p (- 0.75 $\displaystyle \le$ z $\displaystyle \le$ 2.04) = 0.7527.

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Ex. A normal curve has x! = 20 and $\displaystyle \sigma$ = 4, find the probability that x assumes a value between 16.8 and 27.6.
$\displaystyle Z=\frac{{\left( {X-\overline{X}} \right)}}{\sigma }$ , z = standard normal variate corresponding to x
x1 = 16.8, z1 = [(16.8-20) / 4] = $\displaystyle \frac{{-3.2}}{4}$ = -.8, $\displaystyle \overline{X}$ = 20, $\displaystyle \sigma$ = 4
x2= 27.6, z2= (27.6- 20)/4 = $\displaystyle \frac{{7.6}}{4}$ = 1.9

Now P (16.8 ( x ( 27.6) = P (( .8 ( z ( 1.9) = (area between z = (.8 and z = 0) + (area between z = 0 and z = 1.9) = (area between z = 0 and z = .8) + (area between z = 0 and z = 1.9) = .2881+.4713 = 0.7594.

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Ex. How many male workers in a factory have a salary between (i) Rs.800 and 1360, and (ii) more than Rs.1440 if the mean salary is Rs.1000 and s.d. is Rs.200 and number of workers is 20,000, if the salary of the workers is assumed to be normally distributed.

At first we are to find standard normal variate corresponding to given variates

(i) x1=800, z1=(800-1000) / 200 = -200/200 = -1 as $\displaystyle \overline{X}$ = 1000, $\displaystyle \sigma$ = 200
x2= 1360, z2= (1360-1000) / 200 = $\displaystyle \frac{{360}}{{200}}$ = 1.8
Now p (800 $\displaystyle \le$ x $\displaystyle \le$ 1360) = P (-1 $\displaystyle \le$ z $\displaystyle \le$ 1.8) = (area between z = – 1 and z = 0) + (area between z = 0 and z = 1.8) = (area between z = 0 and z = 1) + (area between z = 0 and z = 1.8) = .3413 + .4641 = .8054
i.e., 80.54% of the total workers have a salary between Rs.800 and Rs.1360
Number of workers getting salary between Rs.800 and Rs.1360 = .8054 x 20,000 = 16108
(ii) For x=1440, z= (1440-1000)/200 = $\displaystyle \frac{{440}}{{200}}$ = 2.2
Now p (x > 1440) = P (z > 2.2) = (area under standard normal curve. to the right of z = 2.2)
= area to the right of z = 0 (area between z = 0 and z = 2.2) = .5000 .4861= 0.0139
So, 1.39% of the total workers have a salary more than Rs.1440, and Number of such workers = .0139 x 20,000 = 278

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Ex. The income distribution of Engineers of a company was found to follow normal distribution. The average income of an Engineer was Rs.14,000. The standard deviation of the income of Engineers was Rs. 2,500. If there were 484 Engineers drawing salary above Rs. 15750, how many Engineers were there in the company?

[The area under standard normal curve between 0 and 0.7 is 0.2580]

$\displaystyle Z=\left[ {\frac{{\left( {X-\overline{X}} \right)}}{\sigma }} \right]$ = (15750 – 14000 )/ 2500 = 0.07
Using Normal Distribution, P(Z..07) = .5 -.2580 = .242
So, the probability that an officer draws salary more than and equal to Rs. 15750 is 0.242
The number of officers in the company = $\displaystyle \frac{{484}}{{.242}}$ = 2000

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Ex. In a sample of 2,000 items, the mean weight and standard deviation are 40 and 20 kilograms respectively. Assuming the distribution to be normal, find the number of items weighing between 20 and 80 kilograms.
$\displaystyle Z=\left[ {\frac{{\left( {X-\overline{X}} \right)}}{\sigma }} \right]$ . Here $\displaystyle \overline{X}$ = 40, $\displaystyle \sigma$ = 20. So, for x=20, z=(20-40) / 20 = -1

The area under standard normal curve between the mean and z = – 1 is 0.3413
So, for x=80, z=(80-40) / 20 = 2
The area under standard normal curve between the mean and Z = 2 is 0.4772
So, The probability of items weighing between 20 and 80
P(20 $\displaystyle \le$ x $\displaystyle \le$ 80) = P (- 1 $\displaystyle \le$ z $\displaystyle \le$ 2) = P(0 $\displaystyle \le$ z $\displaystyle \le$ 1) + P(0 $\displaystyle \le$ z $\displaystyle \le$ 2) = 0.3413 + 0.4772 = 0.8185
Number of items weighing between 20 and 80 kilograms is 2000 x 0.8185 = 1637

Chi Square Distribution

Chi-square distribution (Χ² distribution) is a continuous probability distribution used in Statistical hypothesis tests.
If X₁, …, X_k are independent, standard normal random variables, then the sum of their squares

$\displaystyle Q=\sum\limits_{{i=1}}^{k}{{X_{i}^{2}}}$

is distributed according to the chi-square distribution with k degrees of freedom. This is usually denoted as Q~x²(k) or Q~x²k .
The chi-square distribution has one parameter: k- a positive integer that specifies the number of degrees of freedom (i.e. the number of Xis)
Additivity : From the definition of the chi-square distribution, it follows that the sum of independent chi-square variables is also chi-square distributed. Specifically, if {X_i}ⁿ_i=1 are independent chi-square variables with {ki}ni=1 degrees of freedom, respectively, then Y=X₁+X₂+…X_n is chi-square distributed with k₁ + k₂ +… k_n degrees of freedom

T Distribution in Statistics

The T-Distribution is a theoretical probability distribution. T-distribution depicts the set of observations mostly falling close to the mean, the rest of the observations making up the tails on either side.

T distribution is symmetrical, bell-shaped, and similar to the standard normal curve. It differs from the standard normal curve in the way that it has an additional parameter, called Degrees of Freedom, which changes its shape

Degrees of Freedom : Degrees of freedom, usually symbolized by df, (which can be any real number greater than zero (0.0)), is a parameter of t distribution. Setting the value of df defines a particular member of the family of t distributions. A member of the family of t distributions with a smaller df has more area in the tails of the distribution than one with a larger df.

Effect of df on the four t distribution

Smaller the df, the flatter is the shape of the distribution, resulting in greater area in the tails of the distribution
Relationship to the Normal Curve
The T distribution looks similar to the normal curve. As the df increase, the t distribution approaches the standard normal distribution ( $\displaystyle \mu$ =0.0, $\displaystyle \sigma$ =1.0).
The standard normal curve is a special case of the t distribution when df= $\displaystyle \infty$ . The t distribution approaches the standard normal distribution relatively quickly

F-Distribution in Statistics

Enumerated by Ronald Fisher, F Distribution is the measure of the spread or scattering of members of two observed random samples as a test of whether the samples have the same variability.
F distribution is obtained by taking the ratio of the chi-square distributions of the samples divided by the number of their degrees of freedom

statistic: F=(u/u₁)/(v/v₁) has an F distribution with (u₁,v₁) degrees of freedom , where u and v are independently distributed chi-squared variables with u₁ and v₁ degrees of freedom, respectively,

From the definition of the t distribution, the square of a t statistic may be written as:
t²=(z²/1)/(v/v₁), where z², being the square of a standard normal variable, has a chi-squared distribution
Thus the square of a t variable with v₁ degrees of freedom is an F variable with (1,v₁) degrees of freedom, that is: t²=F(1,v₁)

Click here to see PDF

Statistical Theoretical Distribution

Statistical Theoretical Distribution

Binomial Distribution

Binomial Distribution Properties

Binomial Distribution – Problems

Binomial Distribution – Problems

Binomial Distribution – Problems

Binomial Distribution – Problems

Poisson Distribution

Poisson Distribution Computation

Poisson Distribution – Problems

Poisson Distribution – Problems

Poisson Distribution – Problems

Poisson Distribution – Problems

Poisson Distribution – Problems

Normal Distribution

Normal Curve in Statistics

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Statistical Normal Curve – Problems

Chi Square Distribution

T Distribution in Statistics

F-Distribution in Statistics

Like this:

Work from Home. Earn Money

Thank you for your response. ✨