欢迎来到麦多课文档分享! | 帮助中心 海量文档,免费浏览,给你所需,享你所想!
麦多课文档分享
全部分类
  • 标准规范>
  • 教学课件>
  • 考试资料>
  • 办公文档>
  • 学术论文>
  • 行业资料>
  • 易语言源码>
  • ImageVerifierCode 换一换
    首页 麦多课文档分享 > 资源分类 > PPT文档下载
    分享到微信 分享到微博 分享到QQ空间

    Basic Statistics - Concepts and Examples.ppt

    • 资源ID:378852       资源大小:980.50KB        全文页数:23页
    • 资源格式: PPT        下载积分:2000积分
    快捷下载 游客一键下载
    账号登录下载
    微信登录下载
    二维码
    微信扫一扫登录
    下载资源需要2000积分(如需开发票,请勿充值!)
    邮箱/手机:
    温馨提示:
    如需开发票,请勿充值!快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。
    如需开发票,请勿充值!如填写123,账号就是123,密码也是123。
    支付方式: 支付宝扫码支付    微信扫码支付   
    验证码:   换一换

    加入VIP,交流精品资源
     
    账号:
    密码:
    验证码:   换一换
      忘记密码?
        
    友情提示
    2、PDF文件下载后,可能会被浏览器默认打开,此种情况可以点击浏览器菜单,保存网页到桌面,就可以正常下载了。
    3、本站不支持迅雷下载,请使用电脑自带的IE浏览器,或者360浏览器、谷歌浏览器下载即可。
    4、本站资源下载后的文档和图纸-无水印,预览文档经过压缩,下载后原文更清晰。
    5、试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。

    Basic Statistics - Concepts and Examples.ppt

    1、Basic Statistics - Concepts and Examples,Data sources: Data Reduction and Error Analysis for the Physical Sciences, Bevington, 1969The Statistics HomePage: http:/ Concepts,Variables: Variables are things that we measure, control, or manipulate in research. They differ in many respects, most notably

    2、in the role they are given in our research and in the type of measures that can be applied to them. Observational vs. experimental research. Most empirical research belongs clearly to one of those two general categories. In observational research we do not (or at least try not to) influence any vari

    3、ables but only measure them and look for relations (correlations) between some set of variables. In experimental research, we manipulate some variables and then measure the effects of this manipulation on other variables.Dependent vs. independent variables. Independent variables are those that are m

    4、anipulated whereas dependent variables are only measured or registered.,Variable Types and Information Content,Nominal variables allow for only qualitative classification. That is, they can be measured only in terms of whether the individual items belong to some distinctively different categories, b

    5、ut we cannot quantify or even rank order those categories. Typical examples of nominal variables are gender, race, color, city, etc. Ordinal variables allow us to rank order the items we measure in terms of which has less and which has more of the quality represented by the variable, but still they

    6、do not allow us to say “how much more.” A typical example of an ordinal variable is the socioeconomic status of families. Interval variables allow us not only to rank order the items that are measured, but also to quantify and compare the sizes of differences between them. For example, temperature,

    7、as measured in degrees Fahrenheit or Celsius, constitutes an interval scale. Ratio variables are very similar to interval variables; in addition to all the properties of interval variables, they feature an identifiable absolute zero point, thus they allow for statements such as x is two times more t

    8、han y. Typical examples of ratio scales are measures of time or space.,Measurement scales. Variables differ in “how well“ they can be measured. Measurement error involved in every measurement, which determines the “amount of information” obtained. Another factor is the variables “type of measurement

    9、 scale.“,Most statistical data analysis procedures do not distinguish between theinterval and ratio properties of the measurement scales.,Systematic and Random Errors,Error: Defined as the difference between a calculated or observed value and the “true” valueBlunders: Usually apparent either as obvi

    10、ously incorrect data points or results that are not reasonably close to the expected value. Easy to detect.Systematic Errors: Errors that occur reproducibly from faulty calibration of equipment or observer bias. Statistical analysis in generally not useful, but rather corrections must be made based

    11、on experimental conditions.Random Errors: Errors that result from the fluctuations in observations. Requires that experiments be repeated a sufficient number of time to establish the precision of measurement.,Accuracy vs. Precision,Accuracy: A measure of how close an experimental result is to the tr

    12、ue value.Precision: A measure of how exactly the result is determined. It is also a measure of how reproducible the result is.Absolute precision: indicates the uncertainty in the same units as the observation Relative precision: indicates the uncertainty in terms of a fraction of the value of the re

    13、sult,Uncertainties,In most cases, cannot know what the “true” value is unless there is an independent determination (i.e. different measurement technique).Only can consider estimates of the error. Discrepancy is the difference between two or more observations. This gives rise to uncertainty.Probable

    14、 Error: Indicates the magnitude of the error we estimate to have made in the measurements. Means that if we make a measurement that we “probably” wont be wrong by that amount.,Parent vs. Sample Populations,Parent population: Hypothetical probability distribution if we were to make an infinite number

    15、 of measurements of some variable or set of variables.Sample population: Actual set of experimental observations or measurements of some variable or set of variables. In General: (Parent Parameter) = lim (Sample Parameter)When the number of observations, N, goes to infinity.,N -,some univariate stat

    16、istical terms:,mode: value that occurs most frequently in a distribution(usually the highest point of curve)may have more than one mode in a dataset,median: value midway in the frequency distributionhalf the area of curve is to right and other to left,mean: arithmetic averagesum of all observations

    17、divided by # of observations,poor measure of central tendency in skewed distributions,range: measure of dispersion about mean(maximum minus minimum),when max and min are unusual values, range may bea misleading measure of dispersion,Distribution vs. Sample Size,histogram is a useful graphic represen

    18、tation of information content of sample or parent population,many statistical tests assume values are normally distributed,not always the case! examine data prior to processing,from: Jensen, 1996,Deviations,The deviation, di, of any measurement xi from the mean m of the parent distribution is define

    19、d as the difference between xi and m,Average deviation, a, is defined as the average of the magnitudes of the deviations, which is given by the absolute value of the deviations.,variance: average squared deviation of all possible observationsfrom a sample mean (calculated from sum of squares),standa

    20、rd deviation: positive square root of the variancesmall std dev: observations are clustered tightlyaround a central valuelarge std dev: observations are scattered widelyabout the mean,s2i = lim 1/N S (xi - )2,i=1,n,s2i =,S (xi - )2,i=1,n,N - 1,where: is the mean,xi is observed value, andN is the num

    21、ber of observations,N-,Number decreased from N to N - 1for the “sample” variance as is used in the calculation,Sample Mean and Standard Deviation,Sample Mean,Our best estimate of the standard deviation s would be from:,Sample Variance,But we cannot know the true parent mean so the best estimate of t

    22、he sample variance and standard deviation would be:,Distributions,Binomial Distribution: Allows us to define the probability, p, of observing x a specific combination of n items, which is derived from the fundamental formulas for the permutations and combinations.Permutations: Enumerate the number o

    23、f permutations, Pm(n,x), of coin flips, when we pick up the coins one at a time from a collection of n coins and put x of them into the “heads” box.,Distributions - cont.,Combinations: Relates to the number of ways we can combine the various permutations enumerated above from our coin flip experimen

    24、t. Thus the number of combinations is equal to the number of permutations divided by the degeneracy factor x! of the permutations.,Probability and the Binomial Distribution,Coin Toss Experiment: If p is the probability of success (landing heads up) is not necessarily equal to the probability q = 1 -

    25、 p for failure (landing tails up) because the coins may be lopsided!The probability for each of the combinations of x coins heads up and n -x coins tails up is equal to pxqn-x. The binomial distribution can be used to calculate the probability:,The coefficients PB(x,n,p) are closely related to the b

    26、inomial theorem for the expansion of a power of a sum:,Mean and Variance: Binomial Distribution,The mean of the binomial distribution is evaluated by combining the definition of with the function that defines the probability, yielding:,The average of the number of successes will approach a mean valu

    27、e given by the probability for success of each item p times the number of items. For the coin toss experiment p=1/2, half the coins should land heads up on average.,If the the probability for a single success p is equal to the probability for failure p=q=1/2, the final distribution is symmetric abou

    28、t the mean and mode and median equal the mean. The variance, s2 = m/2.,Other Probability Distributions: Special Cases,Poisson Distribution: An approximation to the binomial distribution for the special case when the average number of successes is very much smaller than the possible number i.e. n bec

    29、ause p 1. Important for the study of such phenomena as radioactive decay. Distribution is NOT necessarily symmetric! Data are usually bounded on one side and not the other. Advantage is that s2 = m., = 1.67 s = 1.29, = 10.0 s = 3.16,Gaussian or Normal Error Distribution Details,Gaussian Distribution

    30、: Most important probability distribution in the statistical analysis of experimental data. functional form is relatively simple and the resultant distribution is reasonable. Again this is a special limiting case to the binomial distribution where the number of possible different observations, n, be

    31、comes infinitely large yielding np 1. Most probable estimate of the mean from a random sample of observations is the average of those observations!,G = 2.354s,Tangent along the steepest portion of the probability curve intersects at e-1/2 and intersects x axis at the points x = 2s,Probable Error (P.

    32、E.) is defined as the absolute value of the deviation such that PG of the deviation of any random observation is 1/2,P.E. = 0.6745s = 0.2865 G,For gaussian or normal error distributions:Total area underneath curve is 1.00 (100%)68.27% of observations lie within 1 std dev of mean95% of observations l

    33、ie within 2 std dev of mean99% of observations lie within 3 std dev of mean,Variance, standard deviation, probable error, mean, and weighted root mean square error are commonly used statistical terms in geodesy.,compare (rather than attach significance to numerical value),Gaussian Details, cont.,The

    34、 probability function for the Gaussian distribution is defined as:,The integral probability evaluated between the limits of zs, Where z is the dimensionless range z = |x -|/s is:,Gaussian Density vs. Distribution Functions,Lorentzian or Cauchy Distribution,Lorentzian Distribution: Similar distributi

    35、on function but unrelated to binomial distribution. Useful for describing data related to resonance phenomena, with particular applications in nuclear physics (e.g. Mssbauer effect). Distribution is symmetric about . Distinctly different probability distribution from Gaussian function. Mean and standard deviation not simply defined.,G=Full Width at Half-Maximum,


    注意事项

    本文(Basic Statistics - Concepts and Examples.ppt)为本站会员(figureissue185)主动上传,麦多课文档分享仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知麦多课文档分享(点击联系客服),我们立即给予删除!




    关于我们 - 网站声明 - 网站地图 - 资源地图 - 友情链接 - 网站客服 - 联系我们

    copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
    备案/许可证编号:苏ICP备17064731号-1 

    收起
    展开