Skip to main content
Engineering LibreTexts

7.4: Analysis with finite statistics

  • Page ID
    122624
  • \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \( \newcommand{\dsum}{\displaystyle\sum\limits} \)

    \( \newcommand{\dint}{\displaystyle\int\limits} \)

    \( \newcommand{\dlim}{\displaystyle\lim\limits} \)

    \( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)

    ( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\id}{\mathrm{id}}\)

    \( \newcommand{\Span}{\mathrm{span}}\)

    \( \newcommand{\kernel}{\mathrm{null}\,}\)

    \( \newcommand{\range}{\mathrm{range}\,}\)

    \( \newcommand{\RealPart}{\mathrm{Re}}\)

    \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)

    \( \newcommand{\Argument}{\mathrm{Arg}}\)

    \( \newcommand{\norm}[1]{\| #1 \|}\)

    \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)

    \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)

    \( \newcommand{\vectorA}[1]{\vec{#1}}      % arrow\)

    \( \newcommand{\vectorAt}[1]{\vec{\text{#1}}}      % arrow\)

    \( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vectorC}[1]{\textbf{#1}} \)

    \( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)

    \( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)

    \( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)

    \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)

    \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)

    \(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)
    1. Finite Statistics with Student \(t\) distribution
      1. Normal distribution assumes an infinite population, but only limited samples are available for calculating the mean and variance of a sample set
        1. Sample mean \[ \bar{x} = \frac{1}{N} \sum_{i=1}^{N} x_{i} \] \[ \bar{x} \neq x' \ \mathrm{explicitly} \]
        2. Sample variance \[ S_{x}^{2} = \frac{1}{N-1}\sum_{i=1}^{N}(x_{i}-\bar{x})^{2} \]
        3. Since another statistical term was used in \(S_{x}\), the degree of freedom is considered \[\nu = N-c\] where \(N\) is number of samples and \(c\) number of statistical constraints used in calculations
      2. If accidentally substituting sample statistics into Normal distribution \[ p(x) = \frac{1}{S_{x}\sqrt{2\pi}}\mathrm{e}^{\frac{-(x-\bar{x})^{2}}{2S_{x}^{2}}} \] then
        1. Comparison of the distribution of normal (blue) and finite statistics (red)T_distribution_1df_enhanced.svg.png
        2. An overestimate of probability near sample mean
        3. Underestimates the probability far from the tail
        4. More complex function so two-sided Table format is provided
      3. Normalizing the \(t\) variable (similar to \(z_{1}\))
        1. Non-dimensional term \[ t = \frac{x-\bar{x}}{S_{x}} \] depends on two parameters
          1. \(\nu\) degree of freedom (i.e. # of samples)
          2. \(P\) confidence interval of probability
        2. Method of prediction of next selection of random variable \[ x_{N+1} = \bar{x} \pm t_{\nu,P}S_{x}\  (\%\ P)\]
          1. Same confidence interval \(\%\ P\) but fewer samples increases range of uncertainty
          2. Same sample size but smaller \(\%\ P\) reduces range of uncertainty
          3. As \(N\rightarrow \infty\), then \(t_{\nu, P}\rightarrow z_{p}\) since the sample statistics approach infinite terms
      4. Example
        Exercise \(\PageIndex{1}\)

        Example of finite version: Consider the same capacitors but mean (\(\bar{x} = 145\ \mu\)F) and standard deviation (\(S_{x} = 33\ \mu\)F) statistics are determined from 31 samples.

        • What is expected value of the next capacitor with 95% confidence?
        • How would fewer data points \(N<31\) affect that range for the prediction?
        Answer

        From the set of \(\nu = N - 1 = 30\) and confidence interval, the \(t_{30,95\%} = 2.042\). Therefore \[x_{32} = 145 \pm 2.042(33) = 77.6 \longleftrightarrow 212.4\ \mu\mathrm{F}\] 

        If the same mean and standard deviation statistics were collected from \(N=24\) data points, \(t_{23,95\%} = 2.069\) then the results would indicate \[ 76.7 < x_{25} < 213.3\ \mu\mathrm{F}\]

      5. Using the Standard Deviation of the Means
        1. The sample statistics \(\bar{x}\) and \(S_{x}\) provide insight on the true properties \(x'\) and \(\sigma\), respectively
        2. Using the \(N\) samples, the central limit theorem offers a range on the prediction of the true mean: \[x' = \bar{x} \pm t_{\nu,P}S_{\bar{x}} \] where \[S_{\bar{x}} = \frac{S_{x}}{\sqrt{N}}\]
        3. Note the difference between next measurement versus and indication of the true mean value with confidence interval
        4. Offers an uncertainty of \(x'\) from the statistics instead of the sensor
        5. Exercise \(\PageIndex{2}\)

          From the prior example, how large is uncertainty of the true mean value from 24 samples at 99.5\% confidence?

          Answer

          The \(t\) multiplier is found from \(t_{23,99.5\%} = 3.104 \). Hence, the range of the true mean would be \[x' = 145 \pm 3.104\left(\frac{33}{\sqrt{24}}\right) = 145 \pm 20.91\ \mu\mathrm{F}\]

          The width of the uncertainty decreases as the confidence level \(\% P\) similarly decreases. 

    2. Design of Experiments
      1. Numerous methods define the mechanism to choose range of multi-variable experiments
      2. Approaches available from NIST
      3. Other other author (following LibreText sections)

    7.4: Analysis with finite statistics is shared under a CC BY-NC 4.0 license and was authored, remixed, and/or curated by LibreTexts.

    • Was this article helpful?