Venn Diagrams & Set Theory
🟢 Lite — Quick Review (1h–1d)
Rapid summary for last-minute revision before your exam.
A Venn diagram uses overlapping circles to represent sets and their relationships. Each circle represents a category, and the overlaps represent elements belonging to multiple categories. MAT questions test your ability to read, interpret, and perform basic set operations from Venn diagram representations.
What this topic covers in MAT:
- Reading a 2-set Venn diagram (two overlapping circles)
- Reading a 3-set Venn diagram (three mutually overlapping circles)
- Identifying regions: only in A, only in B, only in C, A and B only, A and C only, B and C only, all three, none
- Calculating totals, complements, and union/intersection values
- Using set notation: A ∪ B (union), A ∩ B (intersection), A′ (complement), n(A) (number in A)
Key formulas and the inclusion-exclusion principle:
- n(A ∪ B) = n(A) + n(B) − n(A ∩ B)
- n(A ∪ B ∪ C) = n(A) + n(B) + n(C) − n(A ∩ B) − n(A ∩ C) − n(B ∩ C) + n(A ∩ B ∩ C)
- n(only A) = n(A) − n(A ∩ B) − n(A ∩ C) + n(A ∩ B ∩ C) [subtract the overlaps counted twice, add back the triple overlap]
- n(None) = Total − n(A ∪ B ∪ C)
⚡ MAT exam tips:
- In a 3-set Venn diagram, there are exactly 8 distinct regions. Be systematic: label each region and fill in numbers for all 8 before answering questions.
- When the diagram shows a table alongside it, use the table for exact values and the diagram for understanding relationships.
- Watch the wording: “students who study both Mathematics and Physics” (includes those also studying Chemistry) vs “students who study Mathematics and Physics only” (excludes those studying Chemistry). These are different.
- The complement approach is powerful: instead of counting those who study at least one subject, count the total minus those who study none.
- Time target: 4–5 minutes per Venn diagram passage with 4–6 questions.
🟡 Standard — Regular Study (2d–2mo)
Standard content for students with a few days to months.
Understanding the 3-Set Venn Diagram Regions
A 3-set Venn diagram has 8 regions:
| Region | Description | In Words |
|---|---|---|
| 1 | Only A | Studies A but not B or C |
| 2 | A and B only | Studies A and B but not C |
| 3 | Only B | Studies B but not A or C |
| 4 | A and C only | Studies A and C but not B |
| 5 | All three (A ∩ B ∩ C) | Studies A, B, and C |
| 6 | B and C only | Studies B and C but not A |
| 7 | Only C | Studies C but not A or B |
| 8 | None | Studies none of the three |
The total n(A ∪ B ∪ C) = sum of regions 1–7. The grand total = sum of all 8 regions.
Step-by-Step Problem-Solving Approach
Step 1 — Label all 8 regions On your rough sheet, draw a 3-circle Venn and label the 8 regions. This takes 15 seconds and prevents confusion throughout.
Step 2 — Fill in what you know, starting with the centre The triple intersection (region 5) is often given or can be deduced. Start there and work outward.
Step 3 — Use subtraction for “only” regions If n(A) is given and you know the overlaps, calculate “only A” by subtracting all A overlaps from n(A) and adding back the triple intersection.
Step 4 — Use the grand total to find “none” n(None) = Grand Total − n(A ∪ B ∪ C). This is a common question type.
Step 5 — Verify the total Sum all 8 regions. It must equal the grand total given in the problem.
Worked Example — 3-Set Venn Diagram
Passage: In a class of 80 students, each student studies at least one of three subjects: Mathematics (M), Physics (P), and Chemistry (C). The Venn diagram shows the following information:
- Total in M = 42
- Total in P = 38
- Total in C = 45
- Students in M and P only = 8
- Students in M and C only = 6
- Students in P and C only = 5
- Students in all three = 4
Step 1 — Label the 8 regions Let the regions be: r1 (only M), r2 (M&P only), r3 (only P), r4 (M&C only), r5 (all three), r6 (P&C only), r7 (only C), r8 (none).
Given: r2 = 8, r4 = 6, r6 = 5, r5 = 4.
Step 2 — Calculate “only” regions using totals
n(M) = 42 = r1 + r2 + r4 + r5 42 = r1 + 8 + 6 + 4 → r1 = 42 − 18 = 24
n(P) = 38 = r2 + r3 + r5 + r6 38 = r3 + 8 + 4 + 5 → r3 = 38 − 17 = 21
n(C) = 45 = r4 + r5 + r6 + r7 45 = r7 + 6 + 4 + 5 → r7 = 45 − 15 = 30
Step 3 — Calculate n(A ∪ B ∪ C) n(M ∪ P ∪ C) = r1 + r2 + r3 + r4 + r5 + r6 + r7 = 24 + 8 + 21 + 6 + 4 + 5 + 30 = 98
Wait — this exceeds the class total of 80. This indicates the given data is inconsistent. In a real exam, you’d recognise this impossibility immediately. But for the sake of this exercise, let’s re-examine the question: actually, n(M) = 42 means students who study Mathematics (including those also studying other subjects). Our calculation of 98 for the union exceeds 80, meaning the overlaps have been double-counted too many times. This signals an error in the problem data. In MAT, this situation occasionally arises in “find the error” questions, but more commonly you’d re-read to check if n(M) meant “only M.” Let’s assume the problem meant n(M) = 42 (including overlaps, which is standard interpretation). Then the data is inconsistent with a total of 80.
Let’s instead assume the total of 80 is correct and the individual subject totals need to fit. Using inclusion-exclusion: n(M ∪ P ∪ C) = 42 + 38 + 45 − (n(M∩P) + n(M∩C) + n(P∩C)) + n(M∩P∩C) = 125 − (n(M∩P) + n(M∩C) + n(P∩C)) + 4 = 129 − (n(M∩P) + n(M∩C) + n(P∩C))
If n(M∪P∪C) = 80 (all students study at least one), then: n(M∩P) + n(M∩C) + n(P∩C) = 129 − 80 = 49.
But given: n(M∩P) = r2 + r5 = 8 + 4 = 12 n(M∩C) = r4 + r5 = 6 + 4 = 10 n(P∩C) = r6 + r5 = 5 + 4 = 9 Sum = 12 + 10 + 9 = 31.
31 ≠ 49, confirming the data is inconsistent. In a real exam, you might note this and select “data inconsistent” as the answer if that option exists.
Let’s modify the example slightly to produce consistent data:
Modified consistent data: Total students = 80. Each studies at least one of M, P, C.
- Total in M = 38, Total in P = 35, Total in C = 40
- M and P only = 6, M and C only = 5, P and C only = 4
- All three = 3
Calculate only regions:
- r1 (only M) = n(M) − r2 − r4 − r5 = 38 − 6 − 5 − 3 = 24
- r3 (only P) = n(P) − r2 − r5 − r6 = 35 − 6 − 3 − 4 = 22
- r7 (only C) = n(C) − r4 − r5 − r6 = 40 − 5 − 3 − 4 = 28
n(M ∪ P ∪ C) = 24 + 6 + 22 + 5 + 3 + 4 + 28 = 92 > 80. Still inconsistent!
Let’s try again with self-consistent numbers:
Self-consistent data: Total = 80. n(M) = 40, n(P) = 38, n(C) = 36. M∩P only = 7, M∩C only = 5, P∩C only = 4. All three = 3.
- Only M = 40 − 7 − 5 − 3 = 25
- Only P = 38 − 7 − 3 − 4 = 24
- Only C = 36 − 5 − 3 − 4 = 24 Union = 25+7+24+5+3+4+24 = 92. Still > 80.
The issue is the union formula. Let’s compute properly: Sum of individuals = 40+38+36 = 114 Sum of pairwise intersections (all) = (7+3) + (5+3) + (4+3) = 10+8+7 = 25 Sum of triple intersection = 3 (already subtracted 3 times, needs to be added back once) n(∪) = 114 − 25 + 3 = 92. Still 92.
To make the union = 80, we need sum of individuals − sum of pairwise + triple = 80. With sum of individuals = 114, we need: 114 − sum of pairwise + 3 = 80 → sum of pairwise = 37. Each pairwise sum includes the triple. Let M∩P = a+3, M∩C = b+3, P∩C = c+3. Then a+b+c+9 = 37 → a+b+c = 28.
If a=7, b=5, c=4 → 7+5+4=16, not 28.
Let’s use: M∩P = 12 (including triple), M∩C = 10 (including triple), P∩C = 8 (including triple). Then pairwise sums = 12+10+8=30. Triple=3. Union = 114−30+3=87. Closer.
Try M=42, P=40, C=38. Sum=120. M∩P=14, M∩C=10, P∩C=8, triple=4. Pairwise sum=14+10+8=32. Union=120−32+4=92.
Try M=36, P=34, C=32. Sum=102. M∩P=10, M∩C=8, P∩C=6, triple=3. Pairwise=24. Union=102−24+3=81. Close to 80.
Let’s fix: M=36, P=34, C=32. M∩P=10, M∩C=8, P∩C=6, triple=3.
- Only M = 36 − 10 − 8 + 3 = 21 (subtract double-counts, add triple back)
- Only P = 34 − 10 − 6 + 3 = 21
- Only C = 32 − 8 − 6 + 3 = 21 Union = 21+7+18+5+3+4+18 = 76? Wait: M∩P only = M∩P − triple = 10−3=7. M∩C only = 8−3=5. P∩C only = 6−3=3.
- Only M = 36 − 10 − 8 + 3 = 21
- Only P = 34 − 10 − 6 + 3 = 21
- Only C = 32 − 8 − 6 + 3 = 21 Total in union = 21+7+21+5+3+3+21 = 81? Let’s recalculate: r1(only M)=21; r2(M&P only)=7; r3(only P)=21; r4(M&C only)=5; r5(all three)=3; r6(P&C only)=3; r7(only C)=21. Sum = 21+7+21+5+3+3+21 = 81. Still not 80.
Adjust: let M=36, P=33, C=31. Sum=100. M∩P=10, M∩C=8, P∩C=6, triple=3.
- Only M = 36−10−8+3=21
- Only P = 33−10−6+3=20
- Only C = 31−8−6+3=20 r2=M&P only=10−3=7; r4=M&C only=8−3=5; r6=P&C only=6−3=3. Union=21+7+20+5+3+3+20=79. Add none=1 to get 80. Perfect!
Final consistent data for use: Total = 80; none = 1 n(M) = 36; n(P) = 33; n(C) = 31 M∩P only = 7; M∩C only = 5; P∩C only = 3; All three = 3 Only M = 21; Only P = 20; Only C = 20
Questions Based on This Data
Q1: How many students study Mathematics only? Answer: 21 ✓
Q2: How many students study at least one subject? Answer: 79 (80 total minus 1 who studies none)
Q3: How many study exactly two subjects? Answer: r2 + r4 + r6 = 7 + 5 + 3 = 15
Q4: How many study Mathematics or Physics (or both)? Answer: n(M ∪ P) = n(M) + n(P) − n(M ∩ P) = 36 + 33 − (7+3) = 69 − 10 = 59 Or from regions: 21+7+20+3+5+3 = 59 ✓
Q5: What percentage of students study all three subjects? Answer: (3/80) × 100 = 3.75%
Q6: Among students who study Physics, what percentage study Chemistry as well?
- n(P) = 33; n(P ∩ C) = r6 + r5 = 3 + 3 = 6
- Percentage = (6/33) × 100 = 18.2% ≈ 18%
Common Traps in Venn Diagrams
- Confusing “both A and B” with “only A and B”: “Both A and B” includes those also in C. “Only A and B” excludes C.
- Forgetting to add back the triple intersection: When calculating “only A” from n(A), you subtract A∩B and A∩C, but the triple has been subtracted twice, so add it back once.
- Mixing up “none” and “universal set”: “None” is the region outside all circles. The universal set (total) includes all regions including none.
- Assuming circles are to scale: In MAT Venn diagrams, circles are often drawn approximately, not to exact proportion. Use the numbers, not the visual area.
- Double-counting when summing regions: Always use the inclusion-exclusion formula to avoid accidental double-counting.
🔴 Extended — Deep Study (3mo+)
Comprehensive coverage for students on a longer study timeline.
Advanced Venn Diagram Problem Types
1. Venn diagram from a table of totals Sometimes the passage gives you totals like “25 students study Mathematics,” “30 study Physics,” and a table of intersections but not the actual Venn diagram values. You must construct the Venn values from this data.
Approach: Use the formulas in reverse. Given n(M), n(M∩P), n(M∩C), n(M∩P∩C), solve for “only M” etc.
2. Venn diagram with conditional statements Statements like “all students who study Physics also study Mathematics” imply P ⊆ M, meaning the P circle is entirely inside the M circle. This simplifies the Venn diagram to a nested structure.
3. Venn diagram with missing information You may be given partial data (e.g., total in M, total in P, total in both, and the total number of students), and asked to find the number who study neither. Use: n(neither) = Total − n(M ∪ P) = Total − (n(M) + n(P) − n(M∩P)).
4. Three-set problems with only two circles drawn Sometimes a passage describes three sets but only shows a 2-circle Venn diagram. In this case, one of the sets is the complement of another or is nested within another.
5. Application to real-world scenarios MAT Venn diagrams commonly use: students studying subjects, people owning products/brands, people with hobbies/skills, employees with qualifications.
Time-Saving Calculation Techniques
- Use the “complement of union” shortcut: n(A′ ∩ B′ ∩ C′) = Total − [n(A) + n(B) + n(C) − n(A∩B) − n(A∩C) − n(B∩C) + n(A∩B∩C)]. This is the “none” region in one step.
- Memorise the 3-set inclusion-exclusion formula: n(A∪B∪C) = n(A)+n(B)+n(C) − n(A∩B) − n(A∩C) − n(B∩C) + n(A∩B∩C). With this memorized, you can solve union problems instantly.
- For “exactly two” questions: r2+r4+r6 = (A∩B + A∩C + B∩C) − 3×(A∩B∩C). This saves calculating each “only pair” separately.
- For “at least one” vs “only one”: n(at least one) = n(A∪B∪C). n(exactly one) = r1+r3+r7. These are different — don’t confuse them.
Cross-Topic Integration
Venn diagrams connect directly to data sufficiency (see data-a-007) — a common data sufficiency question asks: “How many students study Mathematics?” where one statement gives the Venn data and another gives additional constraints. They also appear alongside tables in mixed-graph passages (data-a-006).
MAT-specific patterns: Venn diagram questions account for approximately 4–6 questions in the DILR section of MAT. Most are 3-set problems (60%), with 2-set problems at 30% and more complex 4-set problems at 10%. The most challenging Venn questions in MAT involve finding minimum or maximum possible values given overlapping constraints.
Practice with Realistic Data Set
Passage: In a survey of 200 professionals, each uses at least one of three social media platforms: LinkedIn (L), Twitter (T), and Instagram (I). The following data is collected:
- 95 use LinkedIn
- 88 use Twitter
- 80 use Instagram
- 35 use both LinkedIn and Twitter
- 30 use both LinkedIn and Instagram
- 28 use both Twitter and Instagram
- 15 use all three platforms
Advanced questions to attempt:
- How many professionals use LinkedIn only?
- How many use at least one platform?
- How many use exactly two platforms?
- How many use Twitter but not Instagram?
- If the number using LinkedIn only is the same as the number using Instagram only, and the number using Twitter only is 40, verify whether all given data is consistent.
- What is the minimum possible number of professionals who use LinkedIn and Twitter but not Instagram?
Work through each systematically using the 8-region approach.
Content adapted based on your selected roadmap duration. Switch tiers using the selector above.
📐 Diagram Reference
A three-circle Venn diagram showing three overlapping circles labeled Mathematics, Physics, and Chemistry, with the number of students in each region indicated as typical for a MAT exam DI passage, with a supplementary table of exact values for each region.
Diagrams are generated per-topic using AI. Support for AI-generated educational diagrams coming soon.