Master the three pillars of data analysis — Mean, Median & Mode — for grouped data, cumulative frequency distributions, and ogives. Everything you need to ace your board exam.
| 1. Introduction 2. Mean of Grouped Data 3. Three Methods for Mean 4. Mode of Grouped Data |
5. Median of Grouped Data 6. Cumulative Frequency & Ogive 7. Worked Examples (10+) 8. Practice Sets A–D |
| 📖 | Section 1 — Introduction |
In your earlier classes, you learned to organise raw data into frequency distributions — both ungrouped and grouped. You also studied how to display data through bar graphs, histograms, and frequency polygons.
You were introduced to three special numbers that summarise a dataset:
| Measure | What it tells you | Symbol |
|---|---|---|
| Mean | The arithmetic average of all values | x̄ |
| Median | The middle value when data is arranged in order | M |
| Mode | The value that appears most frequently | Mo |
In this chapter, we extend these concepts to grouped data — where data is organised into class intervals rather than listed individually. We also explore the cumulative frequency distribution and how to draw curves called ogives.
| 📐 | Section 2 — Mean of Grouped Data |
The mean (or arithmetic average) is the sum of all observed values divided by the total number of observations.
For ungrouped data with values x₁, x₂, …, xₙ having frequencies f₁, f₂, …, fₙ:
When data is grouped into class intervals, we cannot use exact values. Instead, we use the class mark — the midpoint of each interval — to represent all values in that class.
| 🔢 | Section 3 — Three Methods to Calculate Mean |
Simply multiply each class mark xᵢ by its frequency fᵢ, sum all products, then divide by total frequency.
When to use: When the values of xᵢ and fᵢ are small enough to compute easily.
Choose any class mark as an assumed mean ‘a’ (usually the central one). Calculate deviations dᵢ = xᵢ − a for each class.
When to use: When class marks are large numbers — deviations are smaller and easier to work with.
A further simplification: divide the deviations by the class size h to get smaller numbers uᵢ.
When to use: Best when all deviations dᵢ share a common factor (which equals h, the class size).
| 🏆 | Section 4 — Mode of Grouped Data |
The mode is the value that occurs most frequently. For grouped data, we first identify the modal class — the class interval with the highest frequency.
| Symbol | Meaning |
|---|---|
| l | Lower boundary of the modal class |
| f₁ | Frequency of the modal class (highest frequency) |
| f₀ | Frequency of the class before the modal class |
| f₂ | Frequency of the class after the modal class |
| h | Width (size) of the class interval |
| ⚖️ | Section 5 — Median of Grouped Data |
The median divides an ordered dataset into two equal halves. For grouped data, we use cumulative frequencies to locate the median class — the class containing the n/2-th observation.
| Step | What to do |
|---|---|
| 1 | Calculate the cumulative frequency (cf) for each class |
| 2 | Find n/2 (where n = total number of observations) |
| 3 | Identify the class whose cumulative frequency first exceeds n/2 → this is the median class |
| 4 | Apply the median formula |
| Symbol | Meaning |
|---|---|
| l | Lower boundary of the median class |
| n | Total number of observations |
| cf | Cumulative frequency of the class before the median class |
| f | Frequency of the median class |
| h | Class width (size of the interval) |
| 📉 | Section 6 — Cumulative Frequency & Ogive |
A cumulative frequency is a running total — it tells you how many observations fall below a certain value.
|
Less-Than Ogive
Plot points: (upper class limit, cumulative frequency) → Rising curve from bottom-left to top-right |
More-Than Ogive
Plot points: (lower class limit, cumulative frequency) → Falling curve from top-left to bottom-right |
| cf ↑ |
|
||||||||||||||||||||||||||||||
| Values (Class Boundaries) → |
| ✏️ | Section 7 — Worked Examples (10 Solved Problems) |
| 📝 | Section 8 — Practice Sets A, B, C & D |
1. Find the mean of the following distribution:
| Class | 0–8 | 8–16 | 16–24 | 24–32 | 32–40 |
|---|---|---|---|---|---|
| f | 6 | 10 | 14 | 8 | 2 |
2. The mean score is 35. The frequencies for class 20–30 and 30–40 are 10 and f respectively, with total = 50. Find f.
3. Calculate the mean electricity consumption (units) from the table: 0–50: 10 families, 50–100: 20, 100–150: 15, 150–200: 5.
1. x̄ = (6×4 + 10×12 + 14×20 + 8×28 + 2×36) ÷ 40 = (24+120+280+224+72)÷40 = 720÷40 = 18
2. Using x̄ = 35: Σfᵢxᵢ = 1750. Work out to find f = 15
3. x̄ = (10×25 + 20×75 + 15×125 + 5×175) ÷ 50 = 4500÷50 = 90 units
1. Find the mode: Class intervals 10–20, 20–30, 30–40, 40–50, 50–60 with frequencies 4, 10, 22, 12, 6.
2. For a data set, the modal class is 25–35 with f₁ = 18, f₀ = 12, f₂ = 10, h = 10. Find the mode.
3. In a shoe size survey, modal class is size 7–8, f₁ = 40, f₀ = 20, f₂ = 15, h = 1. Find the modal size.
1. Modal class = 30–40 (f₁=22, f₀=10, f₂=12, h=10, l=30). Mode = 30 + [(22−10)÷(44−10−12)]×10 = 30 + (12÷22)×10 = 35.45
2. Mode = 25 + [(18−12)÷(36−12−10)]×10 = 25 + (6÷14)×10 = 25 + 4.29 = 29.29
3. Mode = 7 + [(40−20)÷(80−20−15)]×1 = 7 + 20/45 = 7.44
1. Find the median for: Classes 0–20, 20–40, 40–60, 60–80, 80–100 with frequencies 6, 14, 16, 10, 4 (n = 50).
2. The median of a distribution is 35.4. The cf before the median class is 20, f = 25, h = 10, l = 30. Verify.
3. From the ogive, median is found where both curves intersect at x = 42. State the median.
1. n/2 = 25. cf: 6, 20, 36→ Median class = 40–60. Median = 40 + [(25−20)÷16]×20 = 40 + 6.25 = 46.25
2. Median = 30 + [(25−20)÷25]×10 = 30 + 2 = 32 ≠ 35.4 (check data values)
3. Median = 42 (read directly from ogive intersection)
1. If Mean = 48 and Mode = 45, use the empirical formula to find Median.
2. A data set has Median = 31.5 and Mean = 33. Find the Mode.
3. In a class of 30 students, the frequency distribution of heights (cm) is: 140–145: 5, 145–150: 8, 150–155: 10, 155–160: 5, 160–165: 2. Find (a) mean height (b) modal height.
1. 3M = 45 + 2(48) = 141 → Median = 47
2. Mode = 3(31.5) − 2(33) = 94.5 − 66 = 28.5
3a. x̄ = [5(142.5)+8(147.5)+10(152.5)+5(157.5)+2(162.5)]÷30 = 4530÷30 = 151 cm
3b. Modal class = 150–155 (f=10, f₀=8, f₂=5, h=5). Mode = 150 + [(10−8)÷(20−8−5)]×5 = 150 + (2/7)×5 = 151.43 cm
| 📚 | Chapter Summary — Statistics at a Glance |
|
📊 Mean Formulas
Direct: x̄ = Σfᵢxᵢ ÷ Σfᵢ
Assumed Mean: x̄ = a + Σfᵢdᵢ ÷ Σfᵢ Step-Deviation: x̄ = a + h(Σfᵢuᵢ ÷ Σfᵢ) 📈 Ogive
Less-than: plot (upper limit, cf)
More-than: plot (lower limit, cf) Intersection → Median |
🏆 Mode Formula
Mode = l + [(f₁−f₀) ÷ (2f₁−f₀−f₂)] × h
Modal class = class with highest frequency ⚖️ Median Formula
Median = l + [(n/2 − cf) ÷ f] × h
Empirical: 3 Median = Mode + 2 Mean |
| 🎯 | Exam Quick-Check — 8 Must-Know Points |
| 1 | The class mark is always the midpoint of a class interval = (upper + lower) ÷ 2 |
| 2 | All three methods (direct, assumed mean, step-deviation) give the same mean |
| 3 | For mode: f₁ is the modal class frequency, f₀ is the one before it, f₂ is after it |
| 4 | For median: find n/2, locate the class whose cumulative frequency first exceeds n/2 |
| 5 | Empirical relationship: 3 Median = Mode + 2 Mean (holds approximately) |
| 6 | For ogive: less-than plots upper limits; more-than plots lower limits — both on y-axis cumulative frequency |
| 7 | Mode formula requires equal class widths — always check before applying |
| 8 | Mean is best for comparing distributions; Median is best for skewed data; Mode shows the most common value |
| ✗ | Using actual data values instead of class marks in the mean formula for grouped data |
| ✗ | Forgetting to check if class intervals are continuous before finding mode/median |
| ✗ | Using n instead of n/2 to locate the median class |
| ✗ | Confusing “cf” in median formula — it is cumulative frequency of the class BEFORE the median class, not the median class itself |