Lesson Notes By Weeks and Term v5 - Grade 11

Data handling: summarising and interpreting data – Week 2 focus

Download the Lessonotes Mobile South Africa app for faster lesson access on Android and iPhone.

Subject: Mathematical Literacy

Class: Grade 11

Term: Term 4

Week: 2

Theme: General lesson support

Lesson Video

This page supports the lesson note with a companion video and a short classroom-ready summary.

For class groups and homework, share this lesson page so learners also get the summary, objectives, and full lesson context.

Performance objectives

Lesson summary

This week, we delve deeper into the world of data handling, focusing on how to summarise and interpret data effectively. In South Africa, understanding data is crucial for making informed decisions in all aspects of life, from understanding government statistics about unemployment to interpreting medical information during a pandemic, or even understanding personal finances and household budgets. Without these skills, people can be easily misled by incorrect assumptions and statistics, impacting their life choices significantly. This week's focus is on going beyond simple calculations and concentrating on what the data means and how to communicate that meaning.

Lesson notes

This week will build upon concepts learned in Grade 10, focusing on more in-depth interpretation and application. 2.1 Measures of Central Tendency: Mean (Average): The sum of all values divided by the number of values.

Ungrouped Data: Sum of values / Number of values.

Grouped Data: Σ(midpoint of interval frequency) / Σ frequency. We use the midpoint of each interval as an approximation for the value of each data point within that interval.

Why it Matters:* The mean gives a sense of the "typical" value.

However, it can be significantly affected by outliers (extreme values).

How to Use it:* Consider the distribution of data before choosing the mean. If outliers exist, the median may be a better representation.

Median (Middle Value): The middle value when the data is arranged in ascending order. If there are an even number of data points, the median is the average of the two middle values.

Ungrouped Data: Arrange the data in order. The median is the (n+1)/2-th term where n is the number of values.

Grouped Data: Use the cumulative frequency to determine the median class (the class containing the median). The median can be estimated using linear interpolation within the median class: L + [((N/2) - CF) / f] w where L is the lower boundary of the median class, N is the total number of data, CF is the cumulative frequency of the class before the median class, f is the frequency of the median class, and w is the width of the median class.

Why it Matters:* The median is resistant to outliers, making it a better measure of central tendency for skewed data.

How to Use it:* Use the median when outliers might distort the mean.

Mode (Most Frequent Value): The value that appears most often in the data set.

Ungrouped Data: Simply count the frequency of each value.

Grouped Data: The modal class is the class with the highest frequency.

Why it Matters:* The mode identifies the most common value or category.

How to Use it:* The mode is particularly useful for categorical data or identifying the most popular option.

Example 1: Mean, Median and Mode (Ungrouped) The following are the monthly water bills (in Rands) of 10 households in a township: 120, 150, 130, 180, 140, 150, 160, 170, 150,

2

5

0. Mean: (120+150+130+180+140+150+160+170+150+250)/10 = R160 Median: First, arrange the data in ascending order: 120, 130, 140, 150, 150, 150, 160, 170, 180,

2

5

0. Since there are 10 values, the median is the average of the 5th and 6th values: (150+150)/2 = R150 Mode: R150 (appears 3 times)

Example 2: Mean (Grouped) The following table shows the distribution of ages of people attending a community health clinic: | Age Group | Frequency | |---|---| | 0-10 | 20 | | 11-20 | 35 | | 21-30 | 45 | | 31-40 | 30 | | 41-50 | 20 | To calculate the mean, we need to find the midpoint of each interval: | Age Group | Frequency | Midpoint | Midpoint * Frequency | |---|---|---|---| | 0-10 | 20 | 5 | 100 | | 11-20 | 35 | 15.5 | 542.5 | | 21-30 | 45 | 25.5 | 1147.5 | | 31-40 | 30 | 35.5 | 1065 | | 41-50 | 20 | 45.5 | 910 | Total Frequency = 150 Sum of (Midpoint * Frequency) = 3765 Mean Age = 3765/150 = 25.1 years 2.2 Measures of Dispersion: These measures describe the spread or variability of data.

Range: The difference between the highest and lowest values in the data set.

Why it Matters:* The range gives a quick indication of the overall spread.

How to Use it:* Useful for a simple overview, but sensitive to outliers.

Quartiles: Values that divide the data into four equal parts when arranged in ascending order.

Q1 (First Quartile): 25th percentile. 25% of the data falls below this value.

Q2 (Second Quartile): 50th percentile. This is the same as the median.

Q3 (Third Quartile): 75th percentile. 75% of the data falls below this value.

Interquartile Range (IQR): The difference between the third quartile (Q3) and the first quartile (Q1). IQR = Q3 - Q1 Why it Matters:* The IQR measures the spread of the middle 50% of the data, making it resistant to outliers.

How to Use it:* A robust measure of spread, particularly when outliers are present.

Percentiles: Values that divide the data into 100 equal parts when arranged in ascending order. For example, the 90th percentile is the value below which 90% of the data falls.

Example 3: Range, Quartiles and IQR (Ungrouped) Using the same water bill data from Example 1: 120, 150, 130, 180, 140, 150, 160, 170, 150,

2

5

0. Ordered data: 120, 130, 140, 150, 150, 150, 160, 170, 180, 250 Range: 250 - 120 = R130 Q1: The median of the lower half of the data (excluding the overall median if the number of data points is odd).

The lower half is: 120, 130, 140, 150,

1

5

0. The median of this is R

1

4

0. So, Q1 = R

1

4

0. Q3: The median of the upper half of the data.

The upper half is: 160, 170, 180,

2

5

0. The median is (160+170)/2 = R

1

7

5. So, Q3 = R

1

7

5. IQR: 175 - 140 = R35 2.3 Box and Whisker Plots: A box and whisker plot (or boxplot) is a visual representation of the data that shows the minimum value, Q1, median (Q2), Q3, and the maximum value.