ASTM D7915-2018 red 3750 Standard Practice for Application of Generalized Extreme Studentized Deviate (GESD) Technique to Simultaneously Identify Multiple Outliers in a Data Set.pdf
《ASTM D7915-2018 red 3750 Standard Practice for Application of Generalized Extreme Studentized Deviate (GESD) Technique to Simultaneously Identify Multiple Outliers in a Data Set.pdf》由会员分享,可在线阅读,更多相关《ASTM D7915-2018 red 3750 Standard Practice for Application of Generalized Extreme Studentized Deviate (GESD) Technique to Simultaneously Identify Multiple Outliers in a Data Set.pdf(7页珍藏版)》请在麦多课文档分享上搜索。
1、Designation: D7915 14D7915 18 An American National StandardStandard Practice forApplication of Generalized Extreme Studentized Deviate(GESD) Technique to Simultaneously Identify MultipleOutliers in a Data Set1This standard is issued under the fixed designation D7915; the number immediately following
2、 the designation indicates the year oforiginal adoption or, in the case of revision, the year of last revision. A number in parentheses indicates the year of last reapproval. Asuperscript epsilon () indicates an editorial change since the last revision or reapproval.1. Scope Scope*1.1 This practice
3、provides a step by step procedure for the application of the Generalized Extreme Studentized Deviate (GESD)Many-Outlier Procedure to simultaneously identify multiple outliers in a data set. (See Bibliography.)1.2 This practice is applicable to a data set comprising observations that is represented o
4、n a continuous numerical scale.1.3 This practice is applicable to a data set comprising a minimum of six observations.1.4 This practice is applicable to a data set where the normal (Gaussian) model is reasonably adequate for the distributionalrepresentation of the observations in the data set.1.5 Th
5、e probability of false identification of outliers associated with the decision criteria set by this practice is 0.01.1.6 It is recommended that the execution of this practice be conducted under the guidance of personnel familiar with thestatistical principles and assumptions associated with the GESD
6、 technique.1.7 This standard does not purport to address all of the safety concerns, if any, associated with its use. It is the responsibilityof the user of this standard to establish appropriate safety safety, health, and healthenvironmental practices and determine theapplicability of regulatory li
7、mitations prior to use.1.8 This international standard was developed in accordance with internationally recognized principles on standardizationestablished in the Decision on Principles for the Development of International Standards, Guides and Recommendations issuedby the World Trade Organization T
8、echnical Barriers to Trade (TBT) Committee.2. Terminology2.1 Definitions of Terms Specific to This Standard:2.1.1 outlier, nan observation (or a subset of observations) which appears to be inconsistent with the remainder of the dataset.3. Significance and Use3.1 The GESD procedure can be used to sim
9、ultaneously identify up to a pre-determined number of outliers (r) in a data set,without having to pre-examine the data set and make a priori decisions as to the location and number of potential outliers.3.2 The GESD procedure is robust to masking. Masking describes the phenomenon where the existenc
10、e of multiple outliers canprevent an outlier identification procedure from declaring any of the observations in a data set to be outliers.3.3 The GESD procedure is automation-friendly, and hence can easily be programmed as automated computer algorithms.4. Procedure4.1 Specify the maximum number of o
11、utliers (r) in a data set to be identified. This is the number of cycles required to beexecuted (see 4.2) for the identification of up to r outliers.4.1.1 The recommended maximum number of outliers (r) by this practice is two (2) for data sets with six to twelve observations.1 This practice is under
12、 the jurisdiction of ASTM Committee D02 on Petroleum Products, Liquid Fuels, and Lubricants and is the direct responsibility of SubcommitteeD02.94 on Coordinating Subcommittee on Quality Assurance and Statistics.Current edition approved May 1, 2014July 1, 2018. Published June 2014August 2018. Origin
13、ally approved in 1988. Last previous edition approved in 2014 as D7915 14.DOI: 10.1520/D7915-14.10.1520/D7915-18.This document is not an ASTM standard and is intended only to provide the user of an ASTM standard an indication of what changes have been made to the previous version. Becauseit may not
14、be technically possible to adequately depict all changes accurately, ASTM recommends that users consult prior editions as appropriate. In all cases only the current versionof the standard as published by ASTM is to be considered the official document.*A Summary of Changes section appears at the end
15、of this standardCopyright ASTM International, 100 Barr Harbor Drive, PO Box C700, West Conshohocken, PA 19428-2959. United States14.1.2 For data sets with more than twelve observations, the recommended maximum number of outliers (r) is the lesser of ten(10) or 20 %.4.1.3 The recommended values for r
16、 in 4.1.1 and 4.1.2 are not intended to be mandatory. Users can specify other values basedon their specific needs.4.2 Set the current cycle number c to 1 (c = 1).4.2.1 Assign the original data set to be assessed (in 4.1) as the data set for the current cycle 1 and label it as DTS1.4.3 Compute test s
17、tatistic T for each observation in the initial starting data set assigned to the current cycle (DTS0c) as follows:T 5|x 2x|s (1)where:x = an observation in the data set,x = average calculated using all observations in the data set, ands = sample standard deviation calculated using all observations i
18、n the data set.4.4 RemoveIdentify the observation in the data set associated with the largest absolute magnitude of the test statistic T and forma reduced in the data set (DTSi), where i = number of observations removed from the initial data set.of the current cycle.4.4 Re-calculate T for all observ
19、ations in the reduced data set from 4.3.4.5 Repeat steps If current cycle c is less than r, execute 4.34.5.1 to 4.44.5.4 until; otherwise rgo numberto 4.6of observationshave been removed from the initial data set. That is, until calculation of all .Ts for all observations in the reduced data set DTS
20、rhas been completed.4.5.1 Remove the observation identified in 4.4 from the data set of the current cycle.4.5.2 Increment the current cycle number by 1:c = ccurrent + 1.4.5.3 Assign the reduced data set in 4.5.1 (that is, data set with the observation identified in 4.4 removed) as the data set forth
21、e new cycle number and label it as DTSc.4.5.4 Repeat steps 4.3 to 4.5.4.6 Compare Beginning with c = r, compare the maximum T computed in each data set (DTSthe dataset 0 to DTSrc), to a criticalvalue critical associated with the data set DTSfori, cycle c, where critical is chosen based on a false id
22、entification probability of 0.01.See Table A1.1 in Annex A1 for critical values applicable to different data set sizes.sizes and cycle numbers (c).4.7 Identify the data set DTSm for which the maximum T exceeds critical, and m (number of observations removed from theinitial data set DTS0) is the larg
23、est value (0 DTS01 T0 DTS1 T1 DTS2 T2 DTS3 T3 DTS4 T4 DTS5 T5 DTS6 T635.0 0.30 35.0 0.44 35.0 0.64 35.0 0.97 35.0 0.94 35.0 1.05 35.0 1.1635.0 0.30 35.0 0.44 35.0 0.64 35.0 0.97 35.0 0.94 35.0 1.0536.6 0.05 36.6 0.04 36.6 0.17 36.6 0.37 36.6 0.32 36.6 0.40 36.6 0.4936.6 0.05 36.6 0.04 36.6 0.17 36.6
24、 0.37 36.6 0.32 36.6 0.4034.7 0.37 34.7 0.52 34.7 0.73 34.7 1.08 34.7 1.06 34.7 1.17 34.7 1.2934.7 0.37 34.7 0.52 34.7 0.73 34.7 1.08 34.7 1.06 34.7 1.1736.2 0.04 36.2 0.14 36.2 0.29 36.2 0.52 36.2 0.48 36.2 0.56 36.2 0.6636.2 0.04 36.2 0.14 36.2 0.29 36.2 0.52 36.2 0.48 36.2 0.5637.0 0.14 37.0 0.06
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
5000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- ASTMD79152018RED3750STANDARDPRACTICEFORAPPLICATIONOFGENERALIZEDEXTREMESTUDENTIZEDDEVIATEGESDTECHNIQUETOSIMULTANEOUSLYIDENTIFYMULTIPLEOUTLIERSINADATASETPDF

链接地址:http://www.mydoc123.com/p-526514.html