Survey
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
CHAPTER 5 Hypothesis Testing of Process Capability Indices 5.1 Statement of the Problem 109 5.2 Definition and Role of Generalized P-values 110 5.3 Relation between GCIs and Generalized P-Values 113 5.1 Statement of the Problem Statistics is all about inferring every thing possible about an entire population based on a few random samples taken from that population. This inference is usually about a few parameters of a particular probability distribution which, we suspect, would adequately model the population of interest. The accuracy of the estimates of the parameters is heavily dependent on the sample size, sampling technique, method of estimation etc. One way of qualifying a single estimate of a statistic is through confidence intervals. Instead of a single estimate of a parameter, the accuracy of which is not sure about, confidence intervals provide a range of values within which we can be reasonably sure that the true parameter value lies. But the uncertainty regarding the accuracy of the estimate, still persists to some extent. In hypothesis testing we determine whether a hypothesized value of the parameter is admissible or not, based on random samples taken from the process and the parameter estimate derived from it. A hypothesis test, this way settles doubt regarding the true value of the unknown parameter, to the more 109 C HAPTER 5. H YPOTHESIS T ESTING OF P ROCESS C APABILITY I NDICES satisfaction of the investigator. At least in few cases, we have to give a clear answer in the form of either accepting or rejecting a hypothesized value of the parameter. In quality control situations like process capability analysis, we may be required to give a definite decision in the form of either accepting or rejecting the capability of the process. That is, we have to test hypothesis of the form H0 : the process is not capable against H1 : the process is capable. If C is the capability index used to assess the capability of the process such that the threshold value of C , that makes the process capable is C0 , then we have the one-sided statistical testing problem H0 : C ≤ C0 (not capable) H1 : C > C0 (capable) against (5.1.1) The value of C0 is never found smaller than one, but in many cases equal to 1.33. Kane (1986) had investigated the testing of (5.1) in connection with Cp index, and succeeded in preparing a table giving critical values of C0 , which can be used to accept or reject the hypothesis corresponding to various values of n. He had drawn two Operating Characteristic (OC) curves for two situations: (i) n=30, C0 =1.33; (ii) n=70, C0 =1.46, by a two-point design fixing two points on the OC curve. These tables can be used to implement the method of testing of hypothesis to carry out process capability analysis through the index Cp . These points were corresponding to one low and one high values of Cp . Hoffman (1993) had used another approach for testing the values of Cp for many n and significance levels. Kirmani et al. (1991) considered the testing of Cp when the data are collected as m subgroups of size n each. Kocherlakota (1992) also made a study and arrived at critical values for Cp , which decides whether the process is capable or not using the index Cp . 5.2 Definition and Role of Generalized P-values When the underlying family of distributions contain two or more unknown parameters, conventional tests are available only for special functions of the parameters. This is because in many situations it is not possible or not easy to find test statistics having distributions free of nuisance parameters. The need for extending the definition of con110 C HAPTER 5. H YPOTHESIS T ESTING OF P ROCESS C APABILITY I NDICES ventional test statistics and p-value so that the resulting methods do not depend on nuisance parameters, had been felt by many. For example, the Behrens-Fisher problem has been considered from many angles by Dempster (1967), Kempthorne and Folks (1971), Fraser (1979) and Barnard (1984). The origin of extended p-values or genealized p-values is the resultant of thinking about an extension of the conventional p-value, which answers the case of nuisance parameters also. In fact, Weerahandi (1987) used a generalized p-value for comparing parameters of two regressions with unequal variances and established that it is the exact probability of a well defined extreme region. Motivated by that application, the idea of generalized p-value had been introduced formally by Tsui and Weerahandi (1989) to deal with statistical testing problems in which nuisance parameters are present and it is impossible or difficult to obtain nontrivial tests with a fixed level of significance. The idea of generalized p-values may be given as follows: Let S = (S1 , S2 , . . . , Sk ) ∈ Rk denote a statistic based on X = (X1 , X2 , . . . , Xn ). In theory we can consider an independent copy X1∗ , X2∗ , . . . , Xn∗ of X1 , X2 , . . . , Xn and denote the statistic based on Xi∗ ’s by S∗ . Thus S∗ is an independent copy of the observable random vector S whose distribution is indexed by ξ ∈ Rp . We will use s and s∗ to denote realized values of S and S∗ respectively. Suppose θ = π(ξ) be the parametric function of interest about which we want to test a hypothesis of the form H0 : θ ≤ θ0 against H1 : θ > θ0 (5.2.1) Similar to the case of conventional p-value, to define the idea of generalized p-value, we need the definition of what is known as a Generalized Test Variable (GTV) Tθ (S, S∗ , ξ) which is a function of (S, S∗ , ξ) satisfying the following conditions. (GTV1) : The conditional distribution of Tθ0 (S, S∗ , ξ), conditional on S = s, is free of ξ (GTV2) (GTV3) : For every allowable s ∈ Rk , : Tθ0 (s, s, ξ) = θ0 Tθ (S, S∗ , ξ) is stochastically increasing or decreasing in θ, for every allowable s ∈ Rk As in the case of test statistics, the search for generalized test variables can also be confined to functions of sufficient statistics if exist, for ensuring no loss of information about θ. Then for every t = Tθ (s, s, ξ) corresponding to s ∈ Rk , the generalized p-value 111 C HAPTER 5. H YPOTHESIS T ESTING OF P ROCESS C APABILITY I NDICES for testing (5.2.1) is given by ( p= P [Tθ (S, S∗ , ξ)] ≥ t|θ = θ0 ], P [Tθ (S, S∗ , ξ)] ≤ t|θ = θ0 ], if Tθ (S, S∗ , ξ) is increasing in θ if Tθ (S, S∗ , ξ) is decreasing in θ It may be noted from the definition of a GPQ defined in section 1.6 that a GPQ is also satisfying (GTV1) and (GTV2) of GTVs. So a GTV T̃θ (S, S∗ , ξ) for θ can always be constructed by using the corresponding GPQ Rθ (S, S∗ , ξ) for θ by the relation, T̃θ (S, S∗ , ξ) = Rθ (S, S∗ , ξ) − θ. Then T̃θ (S, S∗ , ξ) will be a decreasing function in θ, and hence the generalized p-value for testing (5.2.1) is given by p = P [T̃θ (S, S∗ , ξ) ≤ 0|θ0 ] (5.2.2) as the observed value becomes zero. The generalized p-value is the exact probability of an extreme region. Smaller the p-value, the greater is the evidence against the null hypothesis. The concept of generalized p-value has been used in a large number of problems to test hypothesis regarding “nonstandard” quantities involving nuisance parameters. Since the class of test statistics is a particular case of test variables, this is indeed a generalization of conventional fixed-level test. Generalized fixed-level tests may be of interest to decision theorists who insist on conventional fixed-level tests as well as to others. As in the case of GCIs, there were apprehensions regarding the holding of repeated sampling properties by the generalized hypothesis tests using the idea of generalized p-values. A comparison between the two-testing schemes is possible by comparing the power and size of the conventional fixed level and generalized fixed level tests. A number of simulation studies have been conducted to understand the power and size of the generalized fixed level tests in various contexts. According to their findings, not only did the generalized tests provide excellent approximate tests with nominal sizes, but they are also as good as, or often substantially out perform, the other approximate tests. Moreover, with typical values of nuisance parameters, they found the size of fixed level tests based on generalized p-values to be less than the nominal size, suggesting the holding of the repeated sampling property in a more desirable way. A few works in this direction are: Griffths and Judge (1992), Thursby (1992), Weerahandi and Johnson 112 C HAPTER 5. H YPOTHESIS T ESTING OF P ROCESS C APABILITY I NDICES (1992), Ananda and Weerahandi (1994), Zhou and Mathew (1994), Ananda (1999), Krishamoorthy and Mathew (2002, 2003, 2004), Mcnally et al. (2003), Lin and Lee (2005), Roy and Mathew (2005) and Kurian and Sebastian (2008). 5.3 Relation between GCIs and Generalized P-Values As already noted, the construction of the generalized p-value required for performing generalized hypothesis tests, is based on a suitable generalized test variable. This is similar to the construction of a GCI using the definition of GPQ. It is already noted in the previous section that a generalized test variable can be reached from a generalized pivotal quantity. The route from confidence interval estimation to testing of hypothesis may be explained in connection with that of the capability index Cpk . It has been noticed in section 2.3.1 that RCpk = d−|Rµ −M | √ 3 Rσ 2 is a GPQ for deriving the GLCLs for Cpk , where Rµ and Rσ2 are defined by r Rµ = X̄ − r = X̄ − and R σ2 √ n − 1 ((X̄ ∗ − µ)/(σ/ n)) p S n (n − 1)S ∗2 /σ 2 n − 1 Z∗ S n U∗ S 2σ2 (n − 1)S 2 = = S ∗2 U ∗2 where Z∗ = X̄ ∗ − µ √ ∼ N (0, 1) σ/ n U ∗2 = (n − 1)S ∗2 ∼ χ2n−1 σ2 and Now, let us consider the following equation TCpk = RCpk − Cpk . Then, it can be verified that TCpk satisfies the conditions (GTV1), (GTV2), (GTV3) and TCpk is stochastically decreasing in Cpk . Therefore, the p-value for testing H0 : Cpk ≤ C0 against Cpk > C0 (5.3.1) may be given as P [TCpk ≤ 0|Cpk = Co ] as the observed value of TCpk is zero. This probability may also be written as P [RCpk − Cpk ≤ 0|Cpk = C0 ] = P [RCpk ≤ C0 ] 113 C HAPTER 5. H YPOTHESIS T ESTING OF P ROCESS C APABILITY I NDICES We have calculated the 100αth percentile points of RCpk as the 100(1 − α)% lower confidence limits for Cpk , together with its coverage probability in chapter 2. Actually, what we have obtained is the expected value of the lower confidence limits as E(Gpk ). Observing the connection between the coverage probability and the generalized pvalue, we can construct generalized test procedure based on the idea of generalized p-value by making use of generalized lower confidence limits, and vice versa. Thus, if Tα is the 100(1 − α)% generalized lower confidence limit computed in the case of Cpk , then the generalized test procedure for testing H0 : Cpk ≤ C0 verses H1 : Cpk > C0 , is to reject H0 if Tα > C0 , and otherwise accept. We have also computed the power of the corresponding generalized test procedure based on the generalized p-value, for the various situations considered in the simulation studies, described in the previous chapters in connection with the construction of lower confidence limits for various indices. It has been noticed that the power of the generalized test procedure was quite satisfactory. The results of these simulation experiments are reported in Kurian and Sebastian (2008). However, we are not reporting it here as we have presented the coverage probabilities in extensive volume in the previous chapters. The ultimate conclusion is that the idea of generalized p-values is equally efficient in Hypothesis testing as its counterpart the idea of generalized confidence intervals in Interval estimation, when nuisance parameters are involved in the corresponding problems. Bibliography Ananda, M.M.A. (1999). Estimation and testing of a parallel system with exponential failure and repair times. Journal of Statistical Planning and Inference, 77 237-246. Ananda, M.M.A. and Weerahandi, S. (1994). Implications of Generalized p-values in conventional fixed level testing and confidence intervals. Technical Report, Department of Mathematics, University of Nevada, Las Vegas. Barnard, G.A. (1984). Comparing the means of two independent samples. Applied Statistics, 33 266-271. Dempster, A.P. (1967). Upper and lower probability inference based on a sample from a finite univariate population. Biometrica, 54 515-528. Fraser, D.A.S. (1979). Inference and Linear Models, New York, McGraw Hill. Griffths, W., and Judge, G. (1992). Testing and Estimating location vectors when the error co-variance matrix is unknown. Journal of Econometrics, 54 121-138. 114 C HAPTER 5. H YPOTHESIS T ESTING OF P ROCESS C APABILITY I NDICES Hoffman, L.L. (1993). A general approach for testing the process capability index. Quality and Reliability Engineering International, 9 445-449. Kane, V.E. (1986). Process capability indices. Journal of Quality Technology, 18(1) 41-52. Kempthorne, O. and Folks, L. (1971). Probability, Statistics, and Data Analysis, Ames. Iowa Iowa State University Press. Kirmani, S.N.U.A., Kocherlakota, K. and Kocherlakota, S. (1991). Estimation of σ and the process capability index based on Sub-Samples. Communications in Statistics Theory and Methods, 30 275-291. Kocherlakota, S. (1992). Process capability index recent developments. Sankhya The Indian Journal of Statistics, 54(B)3 352-369. Krishnamoorthy, K. and Mathew, T. (2002). Assessing Occupational Exposure Via the One-Way Random Effects Model With Balanced Data. Journal of Agricultural, Biological and Environmental Statistics , 7 440-457. Krishnamoorthy, K. and Mathew, T. (2003). Inferences on the means of lognormal distributions using generalized p-values and generalized confidence intervals. J. Statist. Plann. Inference, 115 103-120. Krishnamoorthy, K. and Mathew, T. (2004). One-sided tolerance limits in balanced and unbalanced one-way random models based on generalized confidence intervals. Technometrics, 46(1) 44-52. Kurian K.M. and Sebstian G. (2008). On the power of generalized tests for process capability indices. STARS (to appear). Lin, S.H. and Lee, J.C. (2005). Generalized inferences on the common mean of several normal populations. Journal of Statistical Planning and Inference, 134 568582. Mcnally, R.J., Iyer, H. and Mathew, T. (2003). Tests for individual and population bioequivalence based on generalized p-values. Stat. Med., 22 31-53. Roy, A. and Mathew, T. (2005). A generalized confidence limit for the reliability function of a two-parameter exponential distribution. J. Statist. Plann. Inference , 128 509517. 115 C HAPTER 5. H YPOTHESIS T ESTING OF P ROCESS C APABILITY I NDICES Thusby, J.G. (1992). A comparison of several exact and approximate tests for structural shift under heteroscedasticity. Journal of Econometrics, 53 363-386. Tsui, K.W. and Weerahandi, S. (1989). Generalized P-Values in Significance Testing of Hypotheses in the Presence of Nuisance Parameters. Journal of the American Statistical Association, 84 602-607. Weerahandi, S. (1987). Testing regression equality with unequal variances. Econometrica, 55 1211-1215. Weerahandi, S. and Johnson, R.A. (1992). Testing reliability in a stress-strength model when X and Y are normally distributed. Technometrics, 34 83-91. Zhou, L. and Mathew, T. (1994). Some Tests for Variance Components Using Generalized P-Values. Technometrics, 36 394-402. 116