Data handling: collecting and representing data – Week 5 focus
Download the Lessonotes Mobile South Africa app for faster lesson access on Android and iPhone.
Subject: Mathematical Literacy
Class: Grade 10
Term: Term 4
Week: 5
Theme: General lesson support
This page supports the lesson note with a companion video and a short classroom-ready summary.
For class groups and homework, share this lesson page so learners also get the summary, objectives, and full lesson context.
Data handling is a crucial skill in the 21st century, and especially relevant in South Africa, where understanding statistics and information is essential for informed decision-making. From interpreting crime statistics in your neighbourhood to understanding election results or managing household budgets, the ability to collect, represent, and interpret data empowers you to participate more effectively in society. This week, we will focus on the fundamental aspects of data handling: the various methods of collecting data and different ways to represent this data visually and numerically.
This week focuses on the foundational elements of data handling: collecting raw data and then representing it in meaningful ways. 2.1 Data Collection Methods: Data collection is the process of gathering information. The method you choose depends on what you want to find out.
Here are some common methods: Surveys: Surveys involve asking a group of people questions. They can be done face-to-face (interviews), by phone, by mail, or online.
Advantages:* Can collect a large amount of data quickly. Can be tailored to specific questions. Relatively inexpensive.
Disadvantages: Response rates can be low. People may not answer truthfully (social desirability bias). Questions must be carefully worded to avoid bias. It's crucial to consider the target population (the group you want to learn about) and the sample (the group you actually survey). If the sample is not representative of the target population, your results will be biased.
Example: A municipality wants to know how residents feel about a proposed new bus route. They conduct a telephone survey of 500 randomly selected households within the municipality.
Observations: Observations involve watching and recording behavior. This can be done in a natural setting (e.g., observing traffic patterns at an intersection) or in a controlled environment (e.g., observing how students interact in a classroom).
Advantages:* Can provide rich, detailed data about behavior. Can be used to study phenomena that are difficult to measure in other ways.
Disadvantages:* Can be time-consuming. Observer bias can be a problem (the observer's expectations can influence what they see). Ethical considerations (e.g., privacy) must be addressed.
Example: A researcher observes the number of people using a community library at different times of the day to determine peak usage hours.
Experiments: Experiments involve manipulating one or more variables to see what effect they have on another variable. This is often done in a laboratory setting, but can also be done in the field.
Advantages:* Can establish cause-and-effect relationships.
Disadvantages:* Can be difficult to control all variables. Ethical considerations (e.g., ensuring participants are not harmed) must be addressed. Experiments can be expensive and time consuming.
Example: A farmer tests the effectiveness of different fertilizers on maize yields by applying each fertilizer to different plots of land and comparing the yields at harvest.
Using Existing Data: Sometimes, the data you need already exists! This could include government statistics, census data, school records, or company reports.
Advantages:* Saves time and money. Access to large datasets.
Disadvantages:* Data may not be exactly what you need. You have no control over how the data was collected. Data may be outdated or inaccurate.
Example: A student researches unemployment rates in different provinces using Statistics South Africa (Stats SA) data. 2.2 Data Representation: Once you've collected data, you need to present it in a way that is easy to understand.
Here are some common methods: Bar Graphs: Used to compare categorical data (data that can be divided into groups). The height of each bar represents the frequency or amount for each category.
Example: A bar graph showing the number of learners in each grade at a school.
Pie Charts: Used to show parts of a whole. Each slice represents a proportion of the total. The size of each slice is proportional to the percentage it represents.
Example: A pie chart showing the percentage of a household's income spent on different categories (rent, food, transportation, etc.).
Histograms: Used to show the distribution of continuous data (data that can take on any value within a range). The data is grouped into intervals (bins), and the height of each bar represents the frequency of data within that interval.
Example: A histogram showing the distribution of heights of students in a class.
NOTE: Histograms are different from bar graphs. Histograms show the distribution of data over a continuous range, while bar graphs compare discrete categories. In histograms, the bars touch each other to emphasize the continuous nature of the data.
Line Graphs: Used to show trends over time. The x-axis represents time, and the y-axis represents the value of the variable being measured.
Example: A line graph showing the monthly rainfall in a particular city over a year.
Tables: Organized way to present data using rows and columns.
Example: A table showing the number of reported crimes in different police precincts over a year. 2.3 Measures of Central Tendency: These are ways to describe the "center" of a data set: Mean: The average of all the numbers in a data set. To calculate the mean, add up all the numbers and divide by the total number of numbers.
Formula: Mean = (Sum of all values) / (Number of values)
Example: Find the mean of the following test scores: 70, 80, 90, 60, 100.