TY - JOUR
T1 - VAMP1RE
T2 - A single criterion for rating and ranking confidence-interval procedures
AU - Yeh, Yingchieh
AU - Schmeiser, Bruce W.
N1 - Publisher Copyright:
© 2015 Copyright © IIE Transactions.
PY - 2015/11/2
Y1 - 2015/11/2
N2 - We propose VAMP1RE, a single criterion for rating and ranking confidence-interval procedures (CIPs) that use a fixed sample size. The quality of a CIP is traditionally thought to be many dimensional, typically composed of the probability of covering the unknown performance measure and the mean (and sometimes the standard deviation) of interval width, each of these over some set of nominal coverage probabilities. These many criteria reflect symptoms, rather than causes, of CIP quality. The VAMP1RE criterion focuses on two causes: departure from validity-violation of assumptions-and inability to mimic-the dissimilarity, for every data set, of a CIPs interval to that of an ideal CIP. The ideal CIP is both valid (that is, adheres to all assumptions) and is an agreed-upon standard; possibly the ideal CIP is allowed knowledge not available to the real-world CIPs of interest. A high inability to mimic the ideal CIP implies that a CIP uses data inefficiently. For a given CIP, the VAMP1RE criterion is the expected squared difference between Schrubens coverage values (analogous to p values) arising from the given CIP and from the ideal CIP. The implication is that an interval arising from a particular data set is good not because it is large or small but, rather, it is good to the extent that it is similar to the interval provided by the ideal CIP. We discuss the relationship to Schrubens coverage function, provide a graphical interpretation, decompose the VAMP1RE criterion into the two cause components, and provide examples to illustrate that the VAMP1RE criterion provides numerical values that are useful for rating and ranking CIPs.
AB - We propose VAMP1RE, a single criterion for rating and ranking confidence-interval procedures (CIPs) that use a fixed sample size. The quality of a CIP is traditionally thought to be many dimensional, typically composed of the probability of covering the unknown performance measure and the mean (and sometimes the standard deviation) of interval width, each of these over some set of nominal coverage probabilities. These many criteria reflect symptoms, rather than causes, of CIP quality. The VAMP1RE criterion focuses on two causes: departure from validity-violation of assumptions-and inability to mimic-the dissimilarity, for every data set, of a CIPs interval to that of an ideal CIP. The ideal CIP is both valid (that is, adheres to all assumptions) and is an agreed-upon standard; possibly the ideal CIP is allowed knowledge not available to the real-world CIPs of interest. A high inability to mimic the ideal CIP implies that a CIP uses data inefficiently. For a given CIP, the VAMP1RE criterion is the expected squared difference between Schrubens coverage values (analogous to p values) arising from the given CIP and from the ideal CIP. The implication is that an interval arising from a particular data set is good not because it is large or small but, rather, it is good to the extent that it is similar to the interval provided by the ideal CIP. We discuss the relationship to Schrubens coverage function, provide a graphical interpretation, decompose the VAMP1RE criterion into the two cause components, and provide examples to illustrate that the VAMP1RE criterion provides numerical values that are useful for rating and ranking CIPs.
KW - estimation
KW - output analysis
KW - p values
KW - simulation
KW - statistical analysis
UR - http://www.scopus.com/inward/record.url?scp=84940461424&partnerID=8YFLogxK
U2 - 10.1080/0740817X.2015.1047068
DO - 10.1080/0740817X.2015.1047068
M3 - 期刊論文
AN - SCOPUS:84940461424
SN - 0740-817X
VL - 47
SP - 1203
EP - 1216
JO - IIE Transactions (Institute of Industrial Engineers)
JF - IIE Transactions (Institute of Industrial Engineers)
IS - 11
ER -