Data handling: critiquing reports, graphs and media – Week 2 focus
Download the Lessonotes Mobile South Africa app for faster lesson access on Android and iPhone.
Subject: Mathematical Literacy
Class: Grade 12
Term: 3rd Term
Week: 2
Theme: General lesson support
This page supports the lesson note with a companion video and a short classroom-ready summary.
For class groups and homework, share this lesson page so learners also get the summary, objectives, and full lesson context.
This week, we delve into the crucial skill of critically evaluating reports, graphs, and media that present data. In today’s world, we are bombarded with information, often presented visually or through media reports. Simply accepting this information at face value can be misleading. Understanding how data can be manipulated, misinterpreted, or selectively presented is essential for making informed decisions about our lives, communities, and the country. This ability empowers you to be a discerning consumer of information, challenge biased reporting, and advocate for fair representation of data.
Bias in Data: Bias refers to a systematic error in the data collection or presentation process that leads to an inaccurate or unfair representation of reality.
Bias can arise from various sources: Sampling Bias: Occurs when the sample is not representative of the population. For instance, surveying only residents of wealthy suburbs to gauge national poverty levels would introduce sampling bias.
Example: A survey on cellphone preferences conducted only among university students won't reflect the views of the entire South African population, including the elderly or those in rural areas with limited access.
Response Bias: Occurs when respondents provide inaccurate or untruthful answers. This can be due to social desirability bias (answering in a way that is seen as socially acceptable), recall bias (difficulty remembering past events accurately), or interviewer bias (the interviewer influencing the respondent's answers).
Example: Asking people if they always wear their seatbelts is likely to result in inflated positive responses due to social desirability.
Measurement Bias: Occurs when the measurement instrument or process is flawed.
Example: A questionnaire asking about income brackets that are too broad might not capture income disparities effectively.
Publication Bias: Occurs when studies with positive or statistically significant results are more likely to be published than studies with negative or inconclusive results. This can skew the overall picture of a topic.
Misleading Graphs: Graphs are powerful tools for visualizing data, but they can also be manipulated to mislead viewers.
Truncated Axes: Starting the vertical axis at a value other than zero can exaggerate differences between data points.
Example: A graph showing a slight increase in crime rates that starts the y-axis at 95% instead of 0% will make the increase appear much larger than it actually is.
Inconsistent Scales: Using different scales on different parts of a graph can distort the visual representation of the data.
Selective Data Presentation: Choosing to present only certain data points while omitting others can create a biased picture.
Inappropriate Graph Type: Using a pie chart to represent data that does not sum to 100% or using a bar graph when a line graph would be more appropriate can be misleading.
Missing Labels and Titles: Failing to provide clear labels for axes, units of measurement, or a descriptive title makes it difficult to interpret the graph accurately. Measures of Central Tendency and Dispersion: Mean (Average): The sum of all values divided by the number of values. Sensitive to outliers (extreme values).
Median: The middle value when the data is ordered. Less sensitive to outliers than the mean.
Mode: The most frequent value in the data set.
Range: The difference between the highest and lowest values. A simple measure of dispersion but very sensitive to outliers.
Example: Consider the monthly salaries of 5 employees at a small spaza shop: R2000, R2000, R2500, R3000, R15000 (the owner). Mean = (2000+2000+2500+3000+15000)/5 = R4900 Median = R2500 (the middle value when arranged in ascending order) Mode = R2000 Range = R15000 - R2000 = R13000 In this case, the mean is heavily influenced by the owner's salary and doesn't accurately reflect the typical employee's salary. The median provides a better representation. This illustrates how the choice of measure of central tendency can be manipulated.
Sampling Methods: Random Sampling: Each member of the population has an equal chance of being selected. Minimizes bias.
Stratified Sampling: The population is divided into subgroups (strata), and a random sample is taken from each stratum. Ensures representation of all subgroups.
Convenience Sampling: Selecting participants who are easily accessible. Highly susceptible to bias.
Example: Surveying people at a shopping mall about their spending habits.
Systematic Sampling: Selecting participants at regular intervals from a list.
Example: Choosing every 10th person on a voter registration list.
Cluster Sampling: Dividing the population into clusters and randomly selecting clusters to sample. Useful when the population is geographically dispersed.
Example: Randomly selecting a few schools in a province and surveying all students in those schools.