STATISTICS FROM A TO Z<br />-- CONFUSING CONCEPTS CLARIFIED
  • Home
    • Why This Book Is Needed
    • Articles List, Additional Concepts
    • Examples: 1-Page Summaries
    • Examples: Concept Flow Diagram
    • Examples: Compare and Contrast Tables
    • Examples: Cartoons
    • Example: Which to Use When Article
  • Buy
  • Blog
  • Sample Articles
  • Videos
  • Author
  • Communicate
  • Files
  • Errata

Statistics Tip of the Week: Nonparametric Methods can use Signs, Ranks, Signed Ranks, etc.

2/24/2018

1 Comment

 
In our Tip of the Week for 12/14/2017, we said that Nonparametric Methods -- like Wilicoxon, Mann-Whitney, and Kruskal-Wallis -- can work with data that Parametric Methods -- like t-tests and ANOVA -- cannot. Nonparametric methods are often called “distribution-free,” because they are free of any assumptions about the source Distribution(s). 

They can do this because they convert the data into things like Signs, Ranks, Signed Ranks, and Ranks Sums. Here's how that works: 


Signs
We’ll be comparing Sample data to a value we specify. It could be a target, a historical value, an industry standard, etc. Let’s say that the historical Median time to complete an operation in an industrial process has been 30 seconds. We collect a Sample of 10 time measurements: 28, 31, 30, 33, 32, 28, 30, 31, 27, 32

  • If a time is less than the Median of 30 seconds, we give it a negative sign.
  • If it is 30 seconds, we give it a zero.
  • If it is greater than 30 seconds, we give it a plus sign. ​
Picture
  • Count of plusses: 5
  • Count of minuses: 3
We can use the Counts of these signs – instead of the original data –
in a Nonparametric method called the Sign Test.

Ranks
Let’s take that same Sample of data, and order it from low to high. Next, assign a Rank from low to high. For ties, split the difference between the values tied. For example, there are two 28’s. These occupy two Ranks after 1 (a 2 and a 3), so we give them both a 2.5. The next Rank would be a 4, but there’s another tie, so we mark the next two as 4.5’s. 
Picture

Signed Ranks
Signed Ranks, as you might guess, combine the concepts of Signs and Ranks. But there is a change in how Signs are assigned, and one step uses absolute values, so we’ll use a different example with some negative numbers.

Let’s say we are doing an analysis of the effect of a training program on employee productivity. If we were doing a Parametric test, we’d use the Paired t-test (aka Dependent Samples t-test.) We count the number of transactions that they process in an hour. For each employee, we subtract their Before Training number from their After Training number. The information we are capturing is the difference.

Instead of plus and minus signs, we’ll use +1 and 0. We compare the data values to a specified value, as we did in our example of the historical Median of 30. Each Sample data value is their After production number minus their Before number.
We’ll be testing the Null Hypothesis that there is zero difference, so the specified value is zero. 


Step 1: Sign: For each data value, assign a Sign:

  • if the data value is greater than the specified value (0 in this example), then the Sign = +1
  •  if it’s less than or equal to the specified value, then:Sign=0

Step 2: Calculate the Absolute Values 

Step 3: Rank the Absolute Values to produce the Absolute Ranks
​

Step 4: Signed Rank: Multiply the Sign times the Absolute Ranks 
Picture
Signed Rank tests are the NP counterpart to the Dependent Samples (aka Paired Samples) t-test. ​
Rank Sum tests are the NP counterpart of the the Independent Samples (aka 2-Samples) t-test. Since the Rank Sum method will require a somewhat lengthy description and this Tip is getting a bit long., We'll save Rank Sums for another Statistics Tip of the Week.
1 Comment

New Video: Sampling Distribution

2/21/2018

0 Comments

 
New video, Sampling Distribution,  on the book's YouTube channel. This is also part of the playlist on Probability Distributions.
Picture
0 Comments

Statistics Tip of the Week: Different Distributions can have Discrete or Continuous probability graphs for Discrete or Continuous Data

2/14/2018

0 Comments

 
These graphs show the difference between a Distribution that has a Discrete data and a Discrete stairstep Probability graph compared to a Distribution with Continuous data and a Continuous smooth curve. 
Picture
For the Discrete data Distribution, the values of the Variable X can only be non-negative integers, because they are Counts. There is no Probability shown for 1.5,  for example, because 1.5 is not an integer, and so it is not a legitimate value for X. The Probabilities for Discrete data Distribution are shown as separate columns. There is nothing between the columns, because there are no values on the horizontal axis between the individual integers.

For Continuous Distributions, values of horizontal-axis Variable are real numbers, and there are an infinite number of them between any two integers. Continuous data are also called Measurement data; examples are length, weight, pressure, etc. The Probabilities for Continuous Distributions are infinitesimal points on smooth curves. 
Picture
​For the first six Distributions described in the table above, the data used to create the values on the horizontal axis come from a single Sample or Population or Process. And the data are either Discrete or Continuous. The F and Chi-Square (𝜒2) Distributions are hybrids. Their horizontal axis Variable is calculated from a ratio of two numbers, and the source data don’t have to be one type or another. Being a ratio, the horizontal axis Variable (F or 𝜒2) is Continuous. The Probability curve is smooth and Continuous. 

For more, see my YouTube video Probability Distributions -- Part 1 (of 3): What They Are. There are also videos on the F Distribution and the Chi-Square Distribution.  See the Videos page of this website for the latest status of available and planned videos.
0 Comments

You are not alone if you are confused by #statistics -- #17

2/12/2018

0 Comments

 
Picture
0 Comments

Statistics Tip of the Week:  Sum of Squares Between (SSB)

2/6/2018

0 Comments

 
​In our Tip of the Week for Jan 25, 2018, we described Sum of Squares Within (SSW) as a measure of Variation in a group (sample, population or process) of data values. In ANOVA, Sum of Squares Between (SSB) is used together with SSW to determine whether there is a Statistically Significant difference among the Means of several groups.
 
Here's a conceptual illustration of Variation Within and Between groups. Each bell-shaped curve represents a group. The widths of the curves represent how much variation there is within each. That is what SSW represents.
Picture
For the variation between Means, ​we calculate the differences between the Means of each group and the Overall Mean.  Then, we square those differences and then we sum those squares. This gives us the Sum of Squares Between, SSB. 
Picture
where X-bar is the Mean of an individual group and x-double-bar is the Overall Mean.
 
ANOVA then calculates a Mean SSB (MSB) and a Mean SSW (MSW). The previous Tip of the Week describes how these are used to calculate the value of the Test Statistic F in the F-test which produces the conclusion from the ANOVA.
 
There is also more on this in my YouTube Video ANOVA-- Part 2, How It Does It. 
0 Comments

    Author

    Andrew A. (Andy) Jawlik is the author of the book, Statistics from A to Z -- Confusing Concepts Clarified, published by Wiley.

    Archives

    March 2021
    December 2020
    November 2020
    October 2020
    September 2020
    August 2020
    May 2020
    March 2020
    February 2020
    January 2020
    December 2019
    November 2019
    October 2019
    September 2019
    July 2019
    June 2019
    May 2019
    April 2019
    March 2019
    February 2019
    January 2019
    December 2018
    November 2018
    October 2018
    September 2018
    August 2018
    July 2018
    June 2018
    May 2018
    April 2018
    March 2018
    February 2018
    January 2018
    December 2017
    November 2017
    October 2017
    September 2017
    August 2017
    July 2017
    June 2017
    May 2017
    April 2017
    March 2017
    February 2017
    January 2017
    December 2016
    November 2016
    October 2016
    September 2016
    August 2016

    Categories

    All
    New Video
    Stats Tip Of The Week
    You Are Not Alone

    RSS Feed

  • Home
    • Why This Book Is Needed
    • Articles List, Additional Concepts
    • Examples: 1-Page Summaries
    • Examples: Concept Flow Diagram
    • Examples: Compare and Contrast Tables
    • Examples: Cartoons
    • Example: Which to Use When Article
  • Buy
  • Blog
  • Sample Articles
  • Videos
  • Author
  • Communicate
  • Files
  • Errata