Strand 1 — Statistics & Probability
Descriptive Statistics
5th Year · 6th Year (Leaving Cert)
- ✓By the end of this lesson students will be able to calculate and interpret measures of central tendency (mean, median, mode) for both ungrouped and grouped data.
- ✓By the end of this lesson students will be able to calculate and interpret measures of spread (range, standard deviation, interquartile range) for both ungrouped and grouped data.
- ✓By the end of this lesson students will be able to determine quartiles (Q1, Q2, Q3) for a given data set.
- ✓By the end of this lesson students will be able to construct and interpret box plots using the five-number summary.
- ✓By the end of this lesson students will be able to choose appropriate descriptive statistics to summarise and compare data sets.
Key concepts
The mean is the sum of all values in a data set divided by the number of values. It is a measure of central tendency. For grouped data, the mid-interval value of each class is used to represent the data in that class.
Grouped: \(\bar{x} = \frac{\sum fx}{\sum f}\)
The median is the middle value of a data set when the data is arranged in ascending or descending order. If there is an even number of values, the median is the average of the two middle values. It is a measure of central tendency that is less affected by outliers than the mean. For grouped data, the median class is the class interval containing the \((\frac{n}{2})^{th}\) value in the cumulative frequency.
The mode is the value that appears most frequently in a data set. A data set can have one mode (unimodal), more than one mode (multimodal), or no mode. For grouped data, the modal class is the class interval with the highest frequency.
The range is the difference between the maximum and minimum values in a data set. It is a simple measure of spread.
The standard deviation is a measure of the average amount of variation or dispersion of data points around the mean. A low standard deviation indicates that data points tend to be close to the mean, while a high standard deviation indicates that data points are spread out over a wider range of values. It is calculated as the square root of the variance.
Grouped: \(\sigma = \sqrt{\frac{\sum f(x - \bar{x})^2}{\sum f}} \text{ or } \sigma = \sqrt{\frac{\sum fx^2}{\sum f} - (\bar{x})^2}\)
Quartiles divide an ordered data set into four equal parts. Q1 (Lower Quartile) is the median of the lower half of the data (25th percentile). Q2 is the median of the entire data set (50th percentile). Q3 (Upper Quartile) is the median of the upper half of the data (75th percentile). The Interquartile Range (IQR) is the difference between the upper and lower quartiles, representing the spread of the middle 50% of the data.
A box plot is a graphical representation of the five-number summary of a data set: minimum value, Q1, median (Q2), Q3, and maximum value. It provides a visual summary of the centre, spread, and skewness of the data.
Key facts to remember
- 1Always order data in ascending or descending order before finding the median or quartiles.
- 2The mean is sensitive to extreme values (outliers), while the median is resistant to them.
- 3The standard deviation measures the typical distance of data points from the mean.
- 4The Interquartile Range (IQR) measures the spread of the middle 50% of the data, making it robust to outliers.
- 5For grouped data, use the mid-interval value to represent each class when calculating mean and standard deviation.
- 6A box plot visually summarises the minimum, Q1, median, Q3, and maximum values, providing insight into data distribution and skewness.
- 7The sum of deviations from the mean, \(\sum (x - \bar{x})\), is always zero.
Worked examples
Example 1
The following are the scores of 10 students in a maths test: 65, 72, 88, 60, 75, 92, 68, 80, 75, 70. Calculate the mean, median, mode, range, and standard deviation for this data set.
Answer
Mean = 74.5, Median = 73.5, Mode = 75, Range = 32, Standard Deviation \(\approx 9.426\)
Always order the data first for median and quartiles. Use your calculator's statistical functions to check your mean and standard deviation.
Example 2
The table below shows the distribution of heights (in cm) of 50 students. Calculate the mean height and the standard deviation of the heights.
Answer
Mean height \(\approx 162.3\) cm, Standard Deviation \(\approx 5.252\) cm
Ensure you use the mid-interval values (x) for calculations involving grouped data, not the class boundaries.
Example 3
The number of goals scored by a football team in their last 15 matches are: 1, 3, 0, 2, 4, 1, 1, 3, 2, 0, 5, 1, 2, 3, 1. Find the five-number summary and draw a box plot for this data.
Answer
Five-number summary: Minimum = 0, Q1 = 1, Median = 2, Q3 = 3, Maximum = 5. IQR = 2. (Box plot would be drawn as described in step 9).
When finding quartiles, if 'n' is odd, the median (Q2) is included in neither the lower nor upper half for finding Q1 and Q3. If 'n' is even, the data is simply split into two equal halves.
Common mistakes
- ✗Not ordering data before calculating the median or quartiles.
- ✗Confusing the formula for population standard deviation with sample standard deviation (though NCCA typically focuses on population unless specified).
- ✗Incorrectly calculating mid-interval values for grouped data.
- ✗Errors in using calculator functions for statistics, or not showing manual steps when required.
- ✗Drawing box plots without a clear, labelled scale or incorrectly plotting the five-number summary points.
Exam tips
- ★Read the question carefully to determine if the data is ungrouped or grouped, as different formulas and methods apply.
- ★Always show your working step-by-step, even if you use a calculator for the final answer. This is crucial for gaining full marks.
- ★Familiarise yourself with your scientific calculator's statistical functions (e.g., 'STAT' mode) to quickly verify your manual calculations for mean and standard deviation.
- ★When drawing graphs like box plots, ensure all axes are labelled, a clear scale is used, and all points of the five-number summary are accurately represented.
Ready to practise?
Try a problem on this topic
Snap a photo or type a question — get step-by-step working instantly.
