Consider the following data set. 4; 5; 6; 6; 6; 7; 7; 7; 7; 7; 7; 8; 8; 8; 9; 10 This data set can be represented by following histogram. Each interval has width one, and each value is located in the middle of an interval. The histogram displays a
symmetrical distribution of data. A distribution is symmetrical if a vertical line can be drawn at some point in the histogram such that the shape to the left and the right of the vertical line are mirror images of each other. The mean, the median, and the mode are each seven for these data. In a perfectly symmetrical distribution, the mean and the median are the same. This example has one mode (unimodal), and the mode is the same as the mean and median. In a
symmetrical distribution that has two modes (bimodal), the two modes would be different from the mean and median. The histogram for the data: 4; 5; 6; 6; 6; 7; 7; 7; 7; 8 is not symmetrical. The right-hand side seems "chopped off" compared to the left side. A distribution of this type is called skewed to the left because it is
pulled out to the left. The mean is 6.3, the
median is 6.5, and the mode is seven. Notice that the mean is less than the median, and they are both less than the mode. The mean and the median both reflect the skewing, but the mean reflects it more so. The histogram for the data: 6; 7; 7; 7; 7; 8; 8; 8; 9; 10, is also not symmetrical. It is skewed to the
right. The mean is 7.7, the median is
7.5, and the mode is seven. Of the three statistics, the mean is the largest, while the mode is the smallest. Again, the mean reflects the skewing the most. Generally, if the distribution of data is skewed to the left, the mean is less than the median, which is often less than the mode. If the distribution of data is skewed to the right, the mode is often less than the median, which is less than the mean. Skewness and symmetry become
important when we discuss probability distributions in later chapters. Example \(\PageIndex{1}\) Statistics are used to compare and sometimes identify authors. The following lists shows a simple random sample that compares the letter counts for three authors. Solution Exercise \(\PageIndex{1}\) Discuss the mean, median, and mode for each of the following problems. Is there a pattern between the shape and measure of the center? a. Figure \(\PageIndex{7}\).b.
c. ReviewLooking at the distribution of data can reveal a lot about the relationship between the mean, the median, and the mode. There are three types of distributions. A left (or negative) skewed distribution has a shape like Figure \(\PageIndex{2}\). A right (or positive) skewed distribution has a shape like Figure \(\PageIndex{3}\). A symmetrical distribution looks like Figure \(\PageIndex{1}\). Use the following information to answer the next three exercises: State whether the data are symmetrical, skewed to the left, or skewed to the right. Exercise 2.7.2 1; 1; 1; 2; 2; 2; 2; 3; 3; 3; 3; 3; 3; 3; 3; 4; 4; 4; 5; 5 Answer The data are symmetrical. The median is 3 and the mean is 2.85. They are close, and the mode lies close to the middle of the data, so the data are symmetrical. Exercise 2.7.3 16; 17; 19; 22; 22; 22; 22; 22; 23 Exercise 2.7.4 87; 87; 87; 87; 87; 88; 89; 89; 90; 91 Answer The data are skewed right. The median is 87.5 and the mean is 88.2. Even though they are close, the mode lies to the left of the middle of the data, and there are many more instances of 87 than any other number, so the data are skewed right. Exercise 2.7.5 When the data are skewed left, what is the typical relationship between the mean and median? Exercise 2.7.6 When the data are symmetrical, what is the typical relationship between the mean and median? Answer When the data are symmetrical, the mean and median are close or the same. Exercise 2.7.7 What word describes a distribution that has two modes? Exercise 2.7.8 Describe the shape of this distribution. Figure \(\PageIndex{9}\)Answer The distribution is skewed right because it looks pulled out to the right. Exercise 2.7.9 Describe the relationship between the mode and the median of this distribution. Figure \(\PageIndex{10}\)Exercise 2.7.10 Describe the relationship between the mean and the median of this distribution. Figure \(\PageIndex{11}\)Answer The mean is 4.1 and is slightly greater than the median, which is four. Exercise 2.7.11 Describe the shape of this distribution. Figure \(\PageIndex{12}\)Exercise 2.7.12 Describe the relationship between the mode and the median of this distribution. Figure \(\PageIndex{13}\)Answer The mode and the median are the same. In this case, they are both five. Exercise 2.7.13 Are the mean and the median the exact same in this distribution? Why or why not? Figure \(\PageIndex{14}\)Exercise 2.7.14 Describe the shape of this distribution. Figure \(\PageIndex{15}\)Answer The distribution is skewed left because it looks pulled out to the left. Exercise 2.7.15 Describe the relationship between the mode and the median of this distribution. Figure \(\PageIndex{16}\)Exercise 2.7.16 Describe the relationship between the mean and the median of this distribution. Figure \(\PageIndex{17}\)Answer The mean and the median are both six. Exercise 2.7.17 The mean and median for the data are the same. 3; 4; 5; 5; 6; 6; 6; 6; 7; 7; 7; 7; 7; 7; 7 Is the data perfectly symmetrical? Why or why not? Exercise 2.7.18 Which is the greatest, the mean, the mode, or the median of the data set? 11; 11; 12; 12; 12; 12; 13; 15; 17; 22; 22; 22 Answer The mode is 12, the median is 12.5, and the mean is 15.1. The mean is the largest. Exercise 2.7.19 Which is the least, the mean, the mode, and the median of the data set? 56; 56; 56; 58; 59; 60; 62; 64; 64; 65; 67 Exercise 2.7.20 Of the three measures, which tends to reflect skewing the most, the mean, the mode, or the median? Why? Answer The mean tends to reflect skewing the most because it is affected the most by outliers. Exercise 2.7.21 In a perfectly symmetrical distribution, when would the mode be different from the mean and median? What is located in the middle of all scores?The median is the middle value in distribution when the values are arranged in ascending or descending order. The median divides the distribution in half (there are 50% of observations on either side of the median value).
Is the score that occurs most often in a data set?If a data set has an even number of values, the median is the mean of the two middle values. The mode of a data set is the value that occurs most often.
What are the two types of measures numbers can be grouped into?Data at the highest level: qualitative and quantitative
In short, it's a collection of measurements or observations, divided into two different types: qualitative and quantitative.
What two pieces of information does a deviation score convey?The two pieces of information that a deviation score conveys about a raw score are: i) The sign of the deviation score depicts if the raw score lies above or below the mean. ii) The deviation score's magnitude gives the raw score's distance from the mean.
|