STATISTICS FROM A TO Z<br />-- CONFUSING CONCEPTS CLARIFIED
  • Home
    • Why This Book Is Needed
    • Articles List, Additional Concepts
    • Examples: 1-Page Summaries
    • Examples: Concept Flow Diagram
    • Examples: Compare and Contrast Tables
    • Examples: Cartoons
    • Example: Which to Use When Article
  • Buy
  • Blog
  • Sample Articles
  • Videos
  • Author
  • Communicate
  • Files
  • Errata

New Video: Fail to Reject the Null Hypothesis

11/28/2016

0 Comments

 
For many people, "Fail to Reject the Null Hypothesis" is one of the most confusing concepts in statistics. That's not surprising, because that phrase is a triple negative. It's like saying, "I don't not have no money".
Picture
This new video uses a methodical approach and creative graphics to dissect the concept and make it understandable and "rememberable".

This is the 5th and final video in the Hypothesis Testing playlist on my ​​Y​ou Tube channel. See the Videos page on this website for what's next.
Picture
0 Comments

Statistics Tip of the Week: the values of a Categorical Variable are names of categories.

11/23/2016

0 Comments

 
Categorical Variables are used in ANOMA, ANOVA, with Proportions, and in the Chi-Square Tests for Independence and Goodness of Fit. Categorical Variables are also known as "Nominal" (named) Variables and "Attributes" Variables.

​The concept can be confusing, because the values of a Categorical Variable are not numbers, but names of categories. The numbers associated with Categorical Variables come from counts of the data values within a named category.  Here's how it works:
Picture

  • In this example there are two Categorical Variables,  "Gender" and "Ice Cream (flavor)". 
  • The values of the two Categorical Variables are the names of the categories for the Variable.  For example, the Categorical Variable "Gender" has 2 possible values: "female" and "male". 
  • If we're going to use these Variables in a Chi-Square Test for Independence, for example, we need to have some numbers. The numbers are the counts of the data values in each category.  For example, the count of persons whose gender is "female" and whose favorite ice cream flavor is "vanilla" is 25.
0 Comments

My book on statistics is in esteemed company ...

11/17/2016

0 Comments

 
... at least on the display of works by local authors at our town library in Ridgefield, CT. Maurice Sendak, maybe the best and best-known children's book author of his time, lived in Ridgefield until his death in 2012.
Picture
The library has a display of painted cutout images of books by local authors. I have no idea how they decide what to place where.  But it's nice to see my book there with his (even though, of course, I have no illusions that it belongs in that esteemed company).
0 Comments

Statistics Tip of the Week: How the selection of a value for Alpha specifies the Critical Value

11/17/2016

0 Comments

 
In Hypothesis Testing, before the data is collected, a value for Alpha, the Level of Significance,  is selected. The person performing the test selects the value. Most commonly, 5% is selected. 
Picture
Alpha is a Cumulative Probability -- the Probability of a range of values. It is shown as a shaded area under the curve of the Distribution of a Test Statistic, such as z.

If we have Distribution of a Test Statistic and a Cumulative Probability at one or both tails of the curve of the Distribution, software or tables will tell us the value of the Test Statistic which forms the boundary of the Cumulative Probability.

In this concept flow diagram, we show how selecting Alpha = 5% for a one-tailed (right tailed) test results in the Critical Value being 1.645.
0 Comments

New Video: Reject the Null Hypothesis

11/15/2016

0 Comments

 
Picture

A new video Reject the Null Hypothesis is now available. This is the 4th video to be uploaded to my channel on You Tube. It will be added to the playlist for Hypothesis Testing. The 5th and final video in the playlist (Fail to Reject the Null Hypothesis) will be uploaded in 2 weeks. 

For many people, the two possible outcomes of a Hypothesis Test: 
  • Reject the Null Hypothesis, and
  • Fail to Reject the Null Hypothesis
are among the most confusing concepts in statistics. This is not surprising, because Reject the Null Hypothesis is a double negative, and Fail to Reject the Null Hypothesis is a triple negative.

In both videos, we use a marriage proposal scenario -- among other things -- to illustrate the concept. The woman responds in the language of a statistician:
Picture

The Videos page on this website has the latest status of available and planned videos.
0 Comments

Statistics Tip of the Week: Use Boxplots to compare Variation in several datasets.

11/10/2016

0 Comments

 
A Boxplot, also known as Box and Whiskers Plot, is a good way to visually depict Variation in a dataset (e.g., a Sample or Population). And showing several Boxplots vertically is useful for comparing Variation among several datasets.
 
The boxes depict the range within which 50% of the data falls for each dataset.
  • The bottom of the box identifies the 25th percentile (25% of the data is below)
  • The line in the middle is the Median (50th percentile)
  • The top of the box is the 75th percentile
  • The line segments (the "whiskers") at the top and bottom extend to the highest and lowest values of the dataset. The whiskers are drawn to extend only as far as 1.5 box lengths. (If there are no data points that far out, the whisker ends at the farthest point.) Points beyond 1.5 box lengths are termed "Outliers". Points beyond 3 box lengths are called "Extremes" or "Extreme Outliers". 
Picture
​In this illustration, a higher score is better. Treatment A has the highest individual score, but it has considerable more Variation in results than Treatments B and C. The Medians for Treatments A, B, and C are fairly close. So, we can see at a glance that Treatment D can be eliminated from consideration. Treatment B has the highest Median and is gives very consistent results (small Variation). So, this plot may be all we need to select B as the best treatment.
0 Comments

New Video: Hypothesis Testing

11/7/2016

0 Comments

 
Picture

Hypothesis Testing is one of the two main methods of Inferential Statistics (statistics involving Samples). I've just uploaded a new video -- on Hypothesis Testing -- to my YouTube channel Statistics from A to Z -- Confusing Concepts Clarified. 

In the last two weeks, two videos on key concepts within Hypothesis Testing, the Null Hypothesis and the Alternative Hypothesis, were uploaded to the channel. In the coming weeks, two  more videos will be uploaded to complete the playlist on Hypothesis Testing. These are Reject the Null Hypothesis and Fail to Reject the Null Hypothesis. 
Picture

0 Comments

Statistics Tip of the Week: Sampling with Replacement is required when using the Binomial Distribution

11/2/2016

4 Comments

 
One of the requirements for using the Binomial Distribution is that each trial must be independent. One consequence of this is that the Sampling must be With Replacement.

To illustrate this, let's say we are doing a study in a small lake to determine the Proportion of lake trout. Each trial consists of catching and identifying 1 fish. If it's a lake trout, we count 1. The population of the fish is finite. We don't know this, but let's say it's 100 total fish 70 lake trout and 30 other fish. 

Each time we catch a fish, we throw it back before catching another fish. This is called Sampling With Replacement. Then, the Proportion of lake trout is remains at 70%. And the Probability for any one trial is 70% for lake trout. 

Picture

If, on the other hand, we keep each fish we catch, then we are Sampling Without Replacement. Let's say that the first 5 fish which we catch (and keep) are lake trout. Then, there are now 95 fish in the lake, of which 65 are lake trout. The percentage of lake trout is now 65/95 =68.4%. This is a change from the original 70%.

So, we don't have the same Probability each time of catching a lake trout. Sampling Without Replacement has caused the trials to not be independent. So, we can't use the Binomial Distribution. We must use the Hypergeometric Distribution instead.
4 Comments

    Author

    Andrew A. (Andy) Jawlik is the author of the book, Statistics from A to Z -- Confusing Concepts Clarified, published by Wiley.

    Archives

    March 2021
    December 2020
    November 2020
    October 2020
    September 2020
    August 2020
    May 2020
    March 2020
    February 2020
    January 2020
    December 2019
    November 2019
    October 2019
    September 2019
    July 2019
    June 2019
    May 2019
    April 2019
    March 2019
    February 2019
    January 2019
    December 2018
    November 2018
    October 2018
    September 2018
    August 2018
    July 2018
    June 2018
    May 2018
    April 2018
    March 2018
    February 2018
    January 2018
    December 2017
    November 2017
    October 2017
    September 2017
    August 2017
    July 2017
    June 2017
    May 2017
    April 2017
    March 2017
    February 2017
    January 2017
    December 2016
    November 2016
    October 2016
    September 2016
    August 2016

    Categories

    All
    New Video
    Stats Tip Of The Week
    You Are Not Alone

    RSS Feed

  • Home
    • Why This Book Is Needed
    • Articles List, Additional Concepts
    • Examples: 1-Page Summaries
    • Examples: Concept Flow Diagram
    • Examples: Compare and Contrast Tables
    • Examples: Cartoons
    • Example: Which to Use When Article
  • Buy
  • Blog
  • Sample Articles
  • Videos
  • Author
  • Communicate
  • Files
  • Errata