STATISTICS FROM A TO Z<br />-- CONFUSING CONCEPTS CLARIFIED
  • Home
    • Why This Book Is Needed
    • Articles List, Additional Concepts
    • Examples: 1-Page Summaries
    • Examples: Concept Flow Diagram
    • Examples: Compare and Contrast Tables
    • Examples: Cartoons
    • Example: Which to Use When Article
  • Buy
  • Blog
  • Sample Articles
  • Videos
  • Author
  • Communicate
  • Files
  • Errata

Statistics Tip of the Week: Use a Dotplot or a Boxplot to picture Variation

4/27/2017

0 Comments

 
In last week's Tip of the Week, we said that a Histogram was good for picturing the shape of  the data. What a Histogram is not good for is picturing Variation -- as measured by Standard Deviation or Variance. The size of the range for each bar is purely arbitrary. Larger ranges would make for fewer bars and a narrower picture. Also, the width of the bars in the picture can be varied, making the spread appear wider or narrower.

A Dot Plot can be used to picture Variation if the number of data points is relatively small. Each individual point is shown as a dot, and you can show exactly how many go into each bin.
Picture
Boxplots, also known as Box and Whiskers Plots can very effectively provide a detailed picture of Variation.  In our Nov. 10 2016 Tip of the Week, we showed how several Box and Whiskers Plots can enable you to visually choose the most effective of several treatments. Here's an illustration of the anatomy of a Box and Whiskers Plot
Picture
 In the example above, the IQR box represents the InterQuartile Range, which is a useful measure of Variation. This plot shows us that 50% of the data points (those between the 25th and 75th Percentiles) were within the range of 40 – 60 centimeters. 25% were below 40 and 25% were above 60. The Median, denoted by the vertical line in the box is about 48 cm.

Any data point outside 1.5 box lengths from the box is called an Outlier. Here, the outlier with a value of 2 cm. is shown by a circle. Not shown above, but some plots define an Extreme Outlier as one that is more than 3 box lengths outside the box. Those can be shown by an asterisk.
0 Comments

New Statistics Video: ANOVA Part 4 (of 4): 2-Way (aka 2-Factor)

4/24/2017

0 Comments

 
The 4th in a playlist of 6 videos about ANOVA and related subjects.  Here is a one-page summary of the Keys to Understanding this concept.

View the video
Picture
See the videos page on this website for a list of available and planned videos.
0 Comments

My interview on Statistics Views

4/22/2017

0 Comments

 
Picture

Statistics Views is an award-winning website created for professional statisticians, analysts, students, and any user of statistics in interdisciplinary subjects. It provides new articles, interviews, data sources, training materials, links to leading statistics research publications and new blogs, and job opportunities.

Statistics Views recently interviewed me​ about my book, Statistics from A to Z -- Confusing Concepts Clarified.

0 Comments

Statistics Tip of the Week: What's the difference between a Bar Chart and a Histogram?

4/20/2017

0 Comments

 
Picture
Both Bar Charts and Histograms use the height of bars (rectangles of the same width) to visually depict data. So, they look similar.
 
But, they
  1. differ in whether the bars are separated or placed together (contiguous)
  2. depict different types of data, and
  3. are used for different purposes
 
1. Separated or contiguous
  • Bar Charts: separated
  • Histograms: contiguous (the example pictured appears to not be contiguous only because the count is zero for the bins 10-19 and 20-29).
 
2. Types of data
  • Bar Charts: Counts or Percentages of Nominal (also known as "Categorical") Variables. These Variables have names as their values (e.g. "chocolate", "vanilla", "strawberry"), not numbers.
  • Histograms: Counts or Percentages or Probabilities of the number of data points within a Range 
 
3. How Used
  • Bar Charts are used to display relative sizes (of the counts of the named values). 
  • Histograms are used to display the shape of the Distribution of the data. The Histogram pictured above tells us that the data is roughly Normally Distributed.
0 Comments

#Statistics Tip of the Week: Increase Sample Size to Reduce Sampling Errors

4/13/2017

0 Comments

 
Picture
All other things being equal, an increase in Sample Size (n) reduces all types of Sampling Errors, including Alpha and Beta Errors and the Margin of Error.

A Sampling "Error" is not a mistake. It is simply the reduction in accuracy to be expected when one makes an estimate based on a portion – a Sample – of the data in Population or Process. There are several types of Sampling Error.
Two types of Sampling Errors are described in terms of their Probabilities:
  • p is the Probability of an Alpha Error, the Probability of a False Positive.
  • β is the Probability of a Beta Error, the Probability of a False Negative
A third type, the Margin of Error (MOE) is the width of an interval in the units of the data. It is half the width of a 2-sided Confidence Interval.
All three types of Sampling Error are reduced when the Sample Size is increased.
 
This makes intuitive sense, because a very small Sample is more likely to not be a good representative of the properties of the larger Population or Process. But, the values of Statistics calculated from a much larger Sample are likely to be much closer to the values of the corresponding Population or Process Parameters. 

For more on p, see my video P, the p-value. In the future, there will also be videos on Alpha and Beta Error, the Margin of Error, and Confidence Intervals. You can subscribe to the channel to be notified.
0 Comments

New Video: ANOVA Part 3 (of 4): 1-Way (aka Single Factor) 

4/10/2017

1 Comment

 
Here are the 4 Keys to Understanding from this video:
Picture
This is the 3rd of 6 videos based on content from the book that I plan to make on the topic of ANOVA and related concepts:
  1. ANOVA -- Part 1: What it Does
  2. ANOVA -- Part 2: How it Does It
  3. ANOVA -- Part 3: One-Way (aka Single Factor)
  4. ANOVA -- Part 4: 2-Way (aka 2-Factor)
  5. ANOVA vs Regression
  6. ANOM
The Videos page of this website will maintain the latest status on these.

See my channel for other videos based on content from the book.
1 Comment

Statistics Tip of the Week: Common Cause vs. Special Cause Variation

4/6/2017

2 Comments

 
Picture
All processes have variation. A process can be said to be "under control", "stable", or "predictable" if the variation is
  • confined within a defined range (Control Charts can tell us that)
and is
  • random/ shows no pattern (Run Rules determine this)
 
Such Variation is called Common Cause Variation; it is like random "noise" within an under-control process. Variation which is not Common Cause is called Special Cause Variation. It is a signal that factors outside the process are affecting it.
 
Any Special Cause Variation must be eliminated before one can attempt to narrow the range of Common Cause Variation.
 
Until we eliminate Special Cause Variation, we don't have a process that we can improve. There are factors outside the process which affect it, and that changes the actual process that is happening in ways that we don't know.

Once we know that we have Special Cause Variation, we can use various Root Cause Analysis methods to identify the Special Cause, so that we can eliminate it. Only then can we use process/ quality improvement methods like Lean Six Sigma to try to reduce the Common Cause Variation.

Here are some examples of Special Causes of Variation:
  • an equipment malfunction causes occasional spikes in the size of holes drilled
  • an out-of-stock condition causes a customer order to be delayed
  •  vibration from a passing train causes a chemical reaction to speed up
  • a temporarily opened window causes the temperature to drop
  • an untrained employee temporarily fills in

​Here is an example of a Control Chart. Each point is the Mean of a small Sample of data. The Upper Control Limit (UCL) and the Lower Control Limit (LCL) are usually set at 3 Standard Deviations from the Center Line.
Picture
We see that there is one anomalous Sample Mean outside the Control Limits. This is due to Special Cause Variation. So, we need to do some root cause analysis to determine what caused that. And we need to make changes to eliminate it, before we can try to narrow the range of the Control Limits.

In addition to being within Control Chart limits, the data must be random. There are a number of Run Rules which describe patterns which are not random.  Some patterns are not always easy to spot by eyeballing charts. Fortunately, the same software which produces Control Charts will usually also identify patterns described by the Run Rules.

​Here are some common patterns which indicate non-random (Special Cause) Variation. A Sigma is a Standard Deviation.
  • Trend: 6 consecutively increasing or 6 consecutively decreasing points
  • Shift in the Mean: 8 consecutive points on the same side of the Center Line
  • Cycle: 14 consecutive points alternating up and down
  • 2 out of 3 points beyond 2 Sigma and on the same side of the Center Line
  • 4 out of 5 points beyond 1 Sigma and on the same side of the Center Line
  • 15 consecutive points within 1 Sigma of the Center line
 
Reproduced by permission of John Wiley and Sons, Inc.
from the book, Statistics from A to Z – Confusing Concepts Clarified

2 Comments

You are not alone if you are confused by statistics #9.

4/3/2017

0 Comments

 
Picture
0 Comments

    Author

    Andrew A. (Andy) Jawlik is the author of the book, Statistics from A to Z -- Confusing Concepts Clarified, published by Wiley.

    Archives

    March 2021
    December 2020
    November 2020
    October 2020
    September 2020
    August 2020
    May 2020
    March 2020
    February 2020
    January 2020
    December 2019
    November 2019
    October 2019
    September 2019
    July 2019
    June 2019
    May 2019
    April 2019
    March 2019
    February 2019
    January 2019
    December 2018
    November 2018
    October 2018
    September 2018
    August 2018
    July 2018
    June 2018
    May 2018
    April 2018
    March 2018
    February 2018
    January 2018
    December 2017
    November 2017
    October 2017
    September 2017
    August 2017
    July 2017
    June 2017
    May 2017
    April 2017
    March 2017
    February 2017
    January 2017
    December 2016
    November 2016
    October 2016
    September 2016
    August 2016

    Categories

    All
    New Video
    Stats Tip Of The Week
    You Are Not Alone

    RSS Feed

  • Home
    • Why This Book Is Needed
    • Articles List, Additional Concepts
    • Examples: 1-Page Summaries
    • Examples: Concept Flow Diagram
    • Examples: Compare and Contrast Tables
    • Examples: Cartoons
    • Example: Which to Use When Article
  • Buy
  • Blog
  • Sample Articles
  • Videos
  • Author
  • Communicate
  • Files
  • Errata