Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Exposure Distribution
1
Terminology
• Population/distribution
– A population is a definable collection of individual elements (or
units).
• For example, if a workplace has only 10 workers and all of them
are exposed to a chemical, then one-day exposure measurements
for the chemical are conducted to the workers. These 10 workers’
8-hour TWAs can be considered as a one-day population of the
workers’ 8-hour TWA exposures for the chemical.
• Assume this population consists of the set of values below:
(1, 2, 2, 3, 3, 3, 3, 4, 4, 5)
• This data set can also be termed a distribution which describes
the situation of event occurrence.
2
Terminology
• Probability distribution
0. 0.1 0.2 0.3 0.4
– A probability distribution is also a set of numbers, but for each
number (or small subset of numbers) we can assign a
probability (=relative frequency) of occurrence.
0.4
0.2
0.2
0.1
0.1
1
2
3
4
5
The probability distribution for the example is: { 1(0.1), 2(0.2), 3(0.4), 4(0.2), 5(0.1)}
3
Terminology
• Probability distribution (continued)
– A probability distribution is a convenient way to describe a
large population without listing all elements.
– Under certain conditions, a probability distribution can be
described by a single equation or mathematical function
termed probability density function (pdf).
– In industrial hygiene, three pdf’s commonly seen are normal,
lognormal and exponential.
4
Terminology
• Parameters
– A parameter is a constant characteristic of a distribution.
Every distribution has numerous parameters, including a
mean, a median, a standard deviation, and percentiles.
– Mean ( ):
• The mean is the arithmetic average of the distribution.
• It is also denoted in other text as E (for expectation).
– Median:
• The median is the value below and above which lies 50% of
the elements in the distribution.
• It can be thought of as the middle value in an ordered
count of the elements in the distribution.
– Standard deviation ():
• It is a measure of dispersion or variability around the mean
of the distribution.
5
Terminology
• Parameters (continued)
– Coefficient of Variation (CV):
• Defined as the standard deviation divided by the mean.
• It is a measure of relative variability. It is used to compare the
variability in two or more distributions.
– Percentile:
• A percentile (=quantile) is a value at or below which lies a
specified percent or proportion of the distribution.
• The value of the pth percentile is denoted by Xp.
6
Terminology
– Equations:
N
• Mean ( ):
x
i
i 1
N
• Standard deviation ( ):
N
• Coefficient of Variation (CV):
2
(
x
)
i
i 1
N
CV
7
Terminology
– Sample:
• A sample is a collection of elements from a population where the
elements are chosen according to a specific scheme. For each
element in the sample, we measure the value of some variables of
interest, and use the sample data to estimate population
parameters.
X
S
n
• Sample mean ( X ):
X
x
i 1
i
% CV %CV
n
• Sample standard deviation (S):
n
S
(x X )
2
i
i 1
n 1
• Sample coefficient of variation ( CV):
CV
S
X
8
Normal Distribution
Z
a
r
e
a
=
0
.
6
8
4
x
If x values are normally distributed,
then the Z values are normally distributed,
have mean= 0 and standard deviation=1.
Z ~ N (0,1)
a
r
e
a
=
0
.
9
5
4
9
Normal Distribution
• Normal distribution curve
– Symmetric
– The mean equals to median.
– When moving equal distances along the x-axis to the right or
left away from the mean, equal proportions of the distribution
are covered.
– The proportion of the distribution lying between the lowest
value and some higher value is termed cumulative probability,
which happens to be the same thing as a percentile.
10
Normal Distribution
• Example question:
– For a normal distribution with =100, =20, what proportion
of the distribution is less than the value 74.36?
Z
x
74.36 100
1.282
20
Look up Z = -1.282 in the Z table
Area
Z
0.0968 -1.30
0.1056 -1.25
Interpolate. Area = 0.1000
This means that 10% of the distribution 74.36.
11
Lognormal Distribution
• Lognormal distribution curve
A
r
e
a
=
0
.
6
8
3
A
r
e
a
=
0
.
9
5
4
gg
g
2
g
g
g
g
2
g
g
12
Lognormal Distribution
• Lognormal distribution
– The lognormal distribution is a nonsymmetrical curve skewed
to the right when the actual values are plotted on an arithmetic
x-axis.
– When the logarithms of the values (=logtransformed values)
are plotted on the arithmetic x-axis, the skewed curve becomes
the familiar normal distribution.
– The natural logarithm, that is, the logarithm to the base
e=2.71828… is used to do the value transformation.
13
Lognormal Distribution
• Lognormal distribution
– It is common to describe the lognormal distribution by its
geometric mean (GM or g) and geometric standard deviation
(GSD or ).
g
– The GM is the value (in typical units) below and above which
lies 50% of the elements in the population. Hence, the GM is
the population median.
– The GSD is a unitless number and always greater than 1.0.
– The GSD reflects variability in the population around the GM.
14
Lognormal Distribution
• Calculation of GM (or g) and GSD ( g):
– GM:
The GM is the antilog of the arithmetic mean of the
logtransformed values ( l ).
N
ln( xi )
l
l i 1
N
GM e
– GSD:
N
GSD e l
[ln( x ) ]
2
l
i 1
i
l
N
15
Lognormal Distribution
• Sample estimates of the GM and GSD are calculated in a similar
fashion to those of X and S in the normal distribution.
n
GM e
Xl
Xl
ln( x )
i
i 1
n
n
GSD e
Sl
sl
2
[ln(
x
)
X
]
i
l
i 1
n1
• The GM and GSD can be used to estimate percentile of the
lognormal distribution.
X p % GM GSD
Z p%
16
Lognormal Distribution
For example, if there is a lognormal distribution of exposure levels with
GM=150 ppm and GSD=2.5, the value of X95% is:
X 95% 150 2.51.645 677.18
• More equations:
Z p%
ln( X p % ) ln(GM )
ln(GSD)
GM
X 84%
GSD
X 16% GM
17