Download Hypothesis Testing of Process Capability Indices

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
CHAPTER
5
Hypothesis Testing of Process
Capability Indices
5.1
Statement of the Problem
109
5.2
Definition and Role of Generalized P-values
110
5.3
Relation between GCIs and Generalized P-Values
113
5.1 Statement of the Problem
Statistics is all about inferring every thing possible about an entire population based on
a few random samples taken from that population. This inference is usually about a few
parameters of a particular probability distribution which, we suspect, would adequately
model the population of interest. The accuracy of the estimates of the parameters is
heavily dependent on the sample size, sampling technique, method of estimation etc.
One way of qualifying a single estimate of a statistic is through confidence intervals.
Instead of a single estimate of a parameter, the accuracy of which is not sure about,
confidence intervals provide a range of values within which we can be reasonably sure
that the true parameter value lies. But the uncertainty regarding the accuracy of the
estimate, still persists to some extent. In hypothesis testing we determine whether a
hypothesized value of the parameter is admissible or not, based on random samples
taken from the process and the parameter estimate derived from it. A hypothesis test,
this way settles doubt regarding the true value of the unknown parameter, to the more
109
C HAPTER 5. H YPOTHESIS T ESTING
OF
P ROCESS C APABILITY I NDICES
satisfaction of the investigator. At least in few cases, we have to give a clear answer in
the form of either accepting or rejecting a hypothesized value of the parameter.
In quality control situations like process capability analysis, we may be required to
give a definite decision in the form of either accepting or rejecting the capability of the
process. That is, we have to test hypothesis of the form
H0 : the process is not capable
against
H1 : the process is capable.
If C is the capability index used to assess the capability of the process such that the
threshold value of C , that makes the process capable is C0 , then we have the one-sided
statistical testing problem
H0 : C ≤ C0
(not capable)
H1 : C > C0
(capable)
against
(5.1.1)
The value of C0 is never found smaller than one, but in many cases equal to 1.33.
Kane (1986) had investigated the testing of (5.1) in connection with Cp index, and
succeeded in preparing a table giving critical values of C0 , which can be used to accept or reject the hypothesis corresponding to various values of n. He had drawn two
Operating Characteristic (OC) curves for two situations: (i) n=30, C0 =1.33; (ii) n=70,
C0 =1.46, by a two-point design fixing two points on the OC curve. These tables can be
used to implement the method of testing of hypothesis to carry out process capability
analysis through the index Cp . These points were corresponding to one low and one
high values of Cp . Hoffman (1993) had used another approach for testing the values of
Cp for many n and significance levels. Kirmani et al. (1991) considered the testing of Cp
when the data are collected as m subgroups of size n each. Kocherlakota (1992) also
made a study and arrived at critical values for Cp , which decides whether the process
is capable or not using the index Cp .
5.2 Definition and Role of Generalized P-values
When the underlying family of distributions contain two or more unknown parameters,
conventional tests are available only for special functions of the parameters. This is
because in many situations it is not possible or not easy to find test statistics having
distributions free of nuisance parameters. The need for extending the definition of con110
C HAPTER 5. H YPOTHESIS T ESTING
OF
P ROCESS C APABILITY I NDICES
ventional test statistics and p-value so that the resulting methods do not depend on nuisance parameters, had been felt by many. For example, the Behrens-Fisher problem
has been considered from many angles by Dempster (1967), Kempthorne and Folks
(1971), Fraser (1979) and Barnard (1984).
The origin of extended p-values or genealized p-values is the resultant of thinking
about an extension of the conventional p-value, which answers the case of nuisance
parameters also. In fact, Weerahandi (1987) used a generalized p-value for comparing
parameters of two regressions with unequal variances and established that it is the
exact probability of a well defined extreme region. Motivated by that application, the idea
of generalized p-value had been introduced formally by Tsui and Weerahandi (1989) to
deal with statistical testing problems in which nuisance parameters are present and it
is impossible or difficult to obtain nontrivial tests with a fixed level of significance. The
idea of generalized p-values may be given as follows:
Let S = (S1 , S2 , . . . , Sk ) ∈ Rk denote a statistic based on X = (X1 , X2 , . . . , Xn ).
In theory we can consider an independent copy X1∗ , X2∗ , . . . , Xn∗ of X1 , X2 , . . . , Xn and
denote the statistic based on Xi∗ ’s by S∗ . Thus S∗ is an independent copy of the observable random vector S whose distribution is indexed by ξ ∈ Rp . We will use s and
s∗ to denote realized values of S and S∗ respectively.
Suppose θ = π(ξ) be the parametric function of interest about which we want to
test a hypothesis of the form
H0 : θ ≤ θ0 against H1 : θ > θ0
(5.2.1)
Similar to the case of conventional p-value, to define the idea of generalized p-value, we
need the definition of what is known as a Generalized Test Variable (GTV) Tθ (S, S∗ , ξ)
which is a function of (S, S∗ , ξ) satisfying the following conditions.
(GTV1)
:
The conditional distribution of Tθ0 (S, S∗ , ξ), conditional on
S = s, is free of ξ
(GTV2)
(GTV3)
:
For every allowable s ∈ Rk ,
:
Tθ0 (s, s, ξ) = θ0
Tθ (S, S∗ , ξ) is stochastically increasing or decreasing in θ,
for every allowable s ∈ Rk
As in the case of test statistics, the search for generalized test variables can also be
confined to functions of sufficient statistics if exist, for ensuring no loss of information
about θ. Then for every t = Tθ (s, s, ξ) corresponding to s ∈ Rk , the generalized p-value
111
C HAPTER 5. H YPOTHESIS T ESTING
OF
P ROCESS C APABILITY I NDICES
for testing (5.2.1) is given by
(
p=
P [Tθ (S, S∗ , ξ)] ≥ t|θ = θ0 ],
P [Tθ (S, S∗ , ξ)] ≤ t|θ = θ0 ],
if Tθ (S, S∗ , ξ) is increasing in θ
if Tθ (S, S∗ , ξ) is decreasing in θ
It may be noted from the definition of a GPQ defined in section 1.6 that a GPQ is also
satisfying (GTV1) and (GTV2) of GTVs. So a GTV T̃θ (S, S∗ , ξ) for θ can always be
constructed by using the corresponding GPQ Rθ (S, S∗ , ξ) for θ by the relation,
T̃θ (S, S∗ , ξ) = Rθ (S, S∗ , ξ) − θ.
Then T̃θ (S, S∗ , ξ) will be a decreasing function in θ, and hence the generalized p-value
for testing (5.2.1) is given by
p = P [T̃θ (S, S∗ , ξ) ≤ 0|θ0 ]
(5.2.2)
as the observed value becomes zero. The generalized p-value is the exact probability
of an extreme region. Smaller the p-value, the greater is the evidence against the null
hypothesis.
The concept of generalized p-value has been used in a large number of problems
to test hypothesis regarding “nonstandard” quantities involving nuisance parameters.
Since the class of test statistics is a particular case of test variables, this is indeed a
generalization of conventional fixed-level test. Generalized fixed-level tests may be of
interest to decision theorists who insist on conventional fixed-level tests as well as to
others.
As in the case of GCIs, there were apprehensions regarding the holding of repeated
sampling properties by the generalized hypothesis tests using the idea of generalized
p-values. A comparison between the two-testing schemes is possible by comparing
the power and size of the conventional fixed level and generalized fixed level tests. A
number of simulation studies have been conducted to understand the power and size
of the generalized fixed level tests in various contexts. According to their findings, not
only did the generalized tests provide excellent approximate tests with nominal sizes,
but they are also as good as, or often substantially out perform, the other approximate
tests. Moreover, with typical values of nuisance parameters, they found the size of fixed
level tests based on generalized p-values to be less than the nominal size, suggesting
the holding of the repeated sampling property in a more desirable way. A few works in
this direction are: Griffths and Judge (1992), Thursby (1992), Weerahandi and Johnson
112
C HAPTER 5. H YPOTHESIS T ESTING
OF
P ROCESS C APABILITY I NDICES
(1992), Ananda and Weerahandi (1994), Zhou and Mathew (1994), Ananda (1999),
Krishamoorthy and Mathew (2002, 2003, 2004), Mcnally et al. (2003), Lin and Lee
(2005), Roy and Mathew (2005) and Kurian and Sebastian (2008).
5.3 Relation between GCIs and Generalized P-Values
As already noted, the construction of the generalized p-value required for performing
generalized hypothesis tests, is based on a suitable generalized test variable. This is
similar to the construction of a GCI using the definition of GPQ. It is already noted in
the previous section that a generalized test variable can be reached from a generalized
pivotal quantity. The route from confidence interval estimation to testing of hypothesis
may be explained in connection with that of the capability index Cpk .
It has been noticed in section 2.3.1 that RCpk =
d−|Rµ −M |
√
3 Rσ 2
is a GPQ for deriving the
GLCLs for Cpk , where Rµ and Rσ2 are defined by
r
Rµ = X̄ −
r
= X̄ −
and
R
σ2
√
n − 1 ((X̄ ∗ − µ)/(σ/ n))
p
S
n
(n − 1)S ∗2 /σ 2
n − 1 Z∗
S
n U∗
S 2σ2
(n − 1)S 2
=
=
S ∗2
U ∗2
where
Z∗ =
X̄ ∗ − µ
√ ∼ N (0, 1)
σ/ n
U ∗2 =
(n − 1)S ∗2
∼ χ2n−1
σ2
and
Now, let us consider the following equation
TCpk = RCpk − Cpk .
Then, it can be verified that TCpk satisfies the conditions (GTV1), (GTV2), (GTV3) and
TCpk is stochastically decreasing in Cpk . Therefore, the p-value for testing
H0 : Cpk ≤ C0 against Cpk > C0
(5.3.1)
may be given as P [TCpk ≤ 0|Cpk = Co ] as the observed value of TCpk is zero. This
probability may also be written as
P [RCpk − Cpk ≤ 0|Cpk = C0 ] = P [RCpk ≤ C0 ]
113
C HAPTER 5. H YPOTHESIS T ESTING
OF
P ROCESS C APABILITY I NDICES
We have calculated the 100αth percentile points of RCpk as the 100(1 − α)% lower
confidence limits for Cpk , together with its coverage probability in chapter 2. Actually,
what we have obtained is the expected value of the lower confidence limits as E(Gpk ).
Observing the connection between the coverage probability and the generalized pvalue, we can construct generalized test procedure based on the idea of generalized
p-value by making use of generalized lower confidence limits, and vice versa. Thus, if
Tα is the 100(1 − α)% generalized lower confidence limit computed in the case of Cpk ,
then the generalized test procedure for testing H0 : Cpk ≤ C0 verses H1 : Cpk > C0 ,
is to reject H0 if Tα > C0 , and otherwise accept. We have also computed the power of
the corresponding generalized test procedure based on the generalized p-value, for the
various situations considered in the simulation studies, described in the previous chapters in connection with the construction of lower confidence limits for various indices. It
has been noticed that the power of the generalized test procedure was quite satisfactory. The results of these simulation experiments are reported in Kurian and Sebastian
(2008). However, we are not reporting it here as we have presented the coverage probabilities in extensive volume in the previous chapters. The ultimate conclusion is that
the idea of generalized p-values is equally efficient in Hypothesis testing as its counterpart the idea of generalized confidence intervals in Interval estimation, when nuisance
parameters are involved in the corresponding problems.
Bibliography
Ananda, M.M.A. (1999). Estimation and testing of a parallel system with exponential
failure and repair times. Journal of Statistical Planning and Inference, 77 237-246.
Ananda, M.M.A. and Weerahandi, S. (1994). Implications of Generalized p-values
in conventional fixed level testing and confidence intervals. Technical Report,
Department of Mathematics, University of Nevada, Las Vegas.
Barnard, G.A. (1984). Comparing the means of two independent samples. Applied
Statistics, 33 266-271.
Dempster, A.P. (1967). Upper and lower probability inference based on a sample from
a finite univariate population. Biometrica, 54 515-528.
Fraser, D.A.S. (1979). Inference and Linear Models, New York, McGraw Hill.
Griffths, W., and Judge, G. (1992). Testing and Estimating location vectors when the
error co-variance matrix is unknown. Journal of Econometrics, 54 121-138.
114
C HAPTER 5. H YPOTHESIS T ESTING
OF
P ROCESS C APABILITY I NDICES
Hoffman, L.L. (1993). A general approach for testing the process capability index.
Quality and Reliability Engineering International, 9 445-449.
Kane, V.E. (1986). Process capability indices. Journal of Quality Technology, 18(1)
41-52.
Kempthorne, O. and Folks, L. (1971). Probability, Statistics, and Data Analysis, Ames.
Iowa Iowa State University Press.
Kirmani, S.N.U.A., Kocherlakota, K. and Kocherlakota, S. (1991). Estimation of σ
and the process capability index based on Sub-Samples. Communications in
Statistics Theory and Methods, 30 275-291.
Kocherlakota, S. (1992). Process capability index recent developments. Sankhya The
Indian Journal of Statistics, 54(B)3 352-369.
Krishnamoorthy, K. and Mathew, T. (2002). Assessing Occupational Exposure Via the
One-Way Random Effects Model With Balanced Data. Journal of Agricultural,
Biological and Environmental Statistics , 7 440-457.
Krishnamoorthy, K. and Mathew, T. (2003). Inferences on the means of lognormal
distributions using generalized p-values and generalized confidence intervals. J.
Statist. Plann. Inference, 115 103-120.
Krishnamoorthy, K. and Mathew, T. (2004). One-sided tolerance limits in balanced and
unbalanced one-way random models based on generalized confidence intervals.
Technometrics, 46(1) 44-52.
Kurian K.M. and Sebstian G. (2008). On the power of generalized tests for process
capability indices. STARS (to appear).
Lin, S.H. and Lee, J.C. (2005). Generalized inferences on the common mean of several normal populations. Journal of Statistical Planning and Inference, 134 568582.
Mcnally, R.J., Iyer, H. and Mathew, T. (2003). Tests for individual and population bioequivalence based on generalized p-values. Stat. Med., 22 31-53.
Roy, A. and Mathew, T. (2005). A generalized confidence limit for the reliability function
of a two-parameter exponential distribution. J. Statist. Plann. Inference , 128 509517.
115
C HAPTER 5. H YPOTHESIS T ESTING
OF
P ROCESS C APABILITY I NDICES
Thusby, J.G. (1992). A comparison of several exact and approximate tests for structural shift under heteroscedasticity. Journal of Econometrics, 53 363-386.
Tsui, K.W. and Weerahandi, S. (1989). Generalized P-Values in Significance Testing
of Hypotheses in the Presence of Nuisance Parameters. Journal of the American
Statistical Association, 84 602-607.
Weerahandi, S. (1987). Testing regression equality with unequal variances. Econometrica, 55 1211-1215.
Weerahandi, S. and Johnson, R.A. (1992). Testing reliability in a stress-strength model
when X and Y are normally distributed. Technometrics, 34 83-91.
Zhou, L. and Mathew, T. (1994). Some Tests for Variance Components Using Generalized P-Values. Technometrics, 36 394-402.
116
Related documents