Skip to main content
Commerce Stream 3% exam weight

Statistics for Business

Part of the A/L Examination (Sri Lanka) study roadmap. Commerce Stream topic commer-013 of Commerce Stream.

Statistics for Business

🟢 Lite — Quick Review (1h–1d)

Rapid summary for last-minute revision before your exam.

Statistics for Business — Key Facts for Sri Lanka A/L Examination

Measures of Central Tendency:

MeasureFormulaBest Use
MeanSum of values / nSymmetric data, no outliers
MedianMiddle value (after sorting)Skewed data, ordinal data
ModeMost frequent valueCategorical data, peak value

Dispersion Measures:

MeasureFormulaWhat it measures
RangeMax - MinSimplest, ignores distribution
VarianceΣ(x-x̄)²/(n-1)Average squared deviation
Standard Deviation√VarianceSame unit as data
Coefficient of Variation(SD/Mean)×100Relative variability

A/L Exam Tip: When a Sri Lankan business question mentions “average,” always ask: mean, median, or mode? They can give very different answers with skewed data!


🟡 Standard — Regular Study (2d–2mo)

Standard content for students with a few days to months.

Statistics for Business — Detailed Study Guide

Data Classification and Presentation

Types of Data:

ClassificationTypeExample
By sourcePrimarySurvey results from Sri Lankan consumers
By sourceSecondaryCensus data, past exam results
By natureQuantitativeMonthly sales in Rs.
By natureQualitativeProduct type (textile, food)
By timeTime seriesTea export prices 2020-2024
By timeCross-sectionalSales of different shops on one day

Classifying Raw Data:

Raw Data: Randomly recorded observations
Frequency Distribution: Summary table showing frequencies

Steps:
1. Find range = Maximum - Minimum
2. Decide number of classes (usually 5-15)
3. Class width = Range / Number of classes
4. Set class boundaries (avoid gaps)
5. Tally and count frequencies

Example - Sri Lankan Business Scenario:

Monthly sales (in Rs. '000) for 40 small businesses:
45, 52, 38, 61, 55, 47, 63, 58, 42, 49
55, 67, 72, 48, 53, 60, 44, 56, 65, 51
58, 63, 71, 49, 54, 46, 62, 57, 43, 50
59, 64, 69, 53, 47, 56, 61, 54, 48, 60

Range = 72 - 38 = 34
Classes (6 classes): 36-41, 42-47, 48-53, 54-59, 60-65, 66-71, 72-77
Width = 34/6 ≈ 6 (use width 6)

Class     | Tally           | Frequency
----------|-----------------|----------
36 - 41   | I               | 1
42 - 47   | IIII I          | 6
48 - 53   | IIII IIII       | 9
54 - 59   | IIII IIII I     | 11
60 - 65   | IIII IIII       | 9
66 - 71   | II              | 3
72 - 77   | I               | 1
          |                 | ---
          |                 | 40

Graphical Presentation:

ChartUseSri Lankan Example
HistogramContinuous data frequencyDistribution of tea export weights
Frequency PolygonComparing distributionsMonthly sales comparison
Bar ChartCategorical dataExports by province
Pie ChartParts of a wholeSri Lanka export composition
OgiveCumulative frequencyBelow/above threshold analysis
Line ChartTime seriesColombo Stock Exchange index over 5 years

Measures of Central Tendency

Mean (Arithmetic Average):

Ungrouped: x̄ = Σx / n
Grouped: x̄ = Σ(f×x) / Σf

Where x = class midpoint, f = frequency

Example (Ungrouped):

Monthly salary (Rs. '000) of 10 employees:
40, 45, 50, 55, 60, 65, 70, 80, 120, 200
Mean = (40+45+50+55+60+65+70+80+120+200) / 10 = 785/10 = Rs. 78,500

Example (Grouped):

Class (Rs.'000) | Midpoint (x) | Freq (f) | f×x
----------------|--------------|----------|------
30 - 39         | 34.5         | 5        | 172.5
40 - 49         | 44.5         | 12       | 534.0
50 - 59         | 54.5         | 18       | 981.0
60 - 69         | 64.5         | 10       | 645.0
70 - 79         | 74.5         | 5        | 372.5
                |              | Σf = 50 | Σfx = 2,705

Mean = 2,705 / 50 = 54.1 (Rs. '000) = Rs. 54,100

Median (Middle Value):

Ungrouped: Arrange data, find middle position
If n is odd: Median = value at position (n+1)/2
If n is even: Median = average of values at n/2 and n/2+1

Grouped: Median = L + [(n/2 - cummf) / f] × w

Where:
L = Lower boundary of median class
cummf = Cumulative frequency before median class
f = Frequency of median class
w = Class width

Example (Grouped):

Same distribution, find median:
Σf = 50, so n/2 = 25th value
Cumulative frequencies: 5, 17, 35, 45, 50
Median class = 50 - 59 (cumulative 17 to 35)

L = 49.5, cummf = 17, f = 18, w = 10
Median = 49.5 + [(25 - 17) / 18] × 10
        = 49.5 + (8/18) × 10
        = 49.5 + 4.44 = Rs. 53,940

Mode:

Ungrouped: Most frequently occurring value

Grouped: Mode = L + [(f1 - f0) / (2f1 - f0 - f2)] × w

Where:
L = Lower boundary of modal class
f1 = Frequency of modal class
f0 = Frequency before modal class
f2 = Frequency after modal class
w = Class width

A/L Key: The mode formula is complex — for A/L, students can identify the modal class (highest frequency) and estimate from there. Exact calculation often not required in basic questions.

Relationship Between Mean, Median, Mode:

DistributionRelationshipSri Lankan Example
Symmetric/NormalMean ≈ Median ≈ ModeTea leaf weights
Positively skewed (right)Mode < Median < MeanIncome distribution
Negatively skewed (left)Mean < Median < ModeFailure rates

Measures of Dispersion

Range:

Range = Maximum Value - Minimum Value

Variance and Standard Deviation:

Population Variance: σ² = Σ(x-μ)² / N
Sample Variance: s² = Σ(x-x̄)² / (n-1)
Standard Deviation: σ or s = √Variance

Shortcut formula for ungrouped data:
s² = [Σx² - (Σx)²/n] / (n-1)

Example:

Sales (Rs. '000) for 5 shops: 50, 55, 60, 65, 70
Mean = 300/5 = 60

Deviations: -10, -5, 0, +5, +10
Squared deviations: 100, 25, 0, 25, 100
Sum = 250

s² = 250 / (5-1) = 62.5
s = √62.5 = 7.91 (Rs. '000)

Interpretation: On average, sales deviate from mean by Rs. 7,910

Coefficient of Variation (CV):

CV = (Standard Deviation / Mean) × 100

Used to compare variability of different datasets
Lower CV = More consistent/stable
Higher CV = More variable/risky

Example:

Shop A: Mean = Rs. 500,000, SD = Rs. 50,000 → CV = 10%
Shop B: Mean = Rs. 200,000, SD = Rs. 40,000 → CV = 20%
Shop A is more consistent despite larger absolute variation

Probability

Basic Probability Rules:

P(A) = Favourable outcomes / Total outcomes
P(A) = 0 to 1 (0 = impossible, 1 = certain)

Addition Rule: P(A or B) = P(A) + P(B) - P(A and B)
Multiplication Rule: P(A and B) = P(A) × P(B) (independent events)

Example - Sri Lankan Business Application:

A garment factory has two production lines:
- Line A produces 60% of output, 2% defective
- Line B produces 40% of output, 5% defective

P(Defective) = P(A)×P(Defective|A) + P(B)×P(Defective|B)
             = (0.60 × 0.02) + (0.40 × 0.05)
             = 0.012 + 0.020 = 0.032 = 3.2%

Expected Value:

Expected Value = Σ(P × Outcome)
E(X) = Σ(xi × Pi)

Used for decision-making under uncertainty

Example:

A Sri Lankan investor is evaluating a business venture:
Outcome      | Probability | Profit (Rs.)
-------------|-------------|--------------
High demand  | 0.25        | 500,000
Medium demand| 0.50        | 150,000
Low demand   | 0.25        | -100,000

EV = (0.25 × 500,000) + (0.50 × 150,000) + (0.25 × -100,000)
   = 125,000 + 75,000 - 25,000 = Rs. 175,000

Decision: Positive EV suggests the venture is profitable on average

🔴 Extended — Deep Study (3mo+)

Comprehensive coverage for students on a longer study timeline.

Statistics for Business — Complete Notes for A/L Sri Lanka

Time Series Analysis

Components of Time Series:

ComponentDescriptionSri Lankan Example
Trend (T)Long-term directionGradual increase in Sri Lanka tea exports
Seasonal (S)Regular patterns within a yearHigher clothing sales Dec-Jan
Cyclical (C)Business cycle fluctuationsGDP growth cycles 5-7 years
Irregular/Random (I)Unpredictable variationsEffect of 2019 Easter attacks on tourism

Additive Model: Y = T + S + C + I Multiplicative Model: Y = T × S × C × I

Moving Averages (for trend extraction):

3-Period Moving Average:
Period 1-3 Average → centres at period 2
Period 2-4 Average → centres at period 3
And so on...

Centered Moving Average (for even periods):
For 4-period MA, must center by averaging two successive MAs

Example:

Year | Sales (Rs.M) | 3-Period MA
-----|--------------|-------------
2020 | 45           | -
2021 | 52           | (45+52+58)/3 = 51.7
2022 | 58           | (52+58+63)/3 = 57.7
2023 | 63           | (58+63+55)/3 = 58.7
2024 | 55           | (63+55+70)/3 = 62.7
2025 | 70           | (55+70+75)/3 = 66.7
2026 | 75           | -

Seasonal Variation (using multiplicative model):

Step 1: Calculate trend (moving average)
Step 2: Calculate Ratio-to-Moving-Average = Actual / Trend
Step 3: Average the ratios for each season (removes irregular)
Step 4: Adjust so seasonal indices average to 1 (or 100)

Seasonal Index > 1: Above trend in that season
Seasonal Index < 1: Below trend in that season

Example - Quarterly Tea Exports:

Quarter | 2023 Ratio | 2024 Ratio | 2025 Ratio | Average | Seasonal Index
--------|------------|------------|------------|---------|--------------
Q1      | 0.85       | 0.88       | 0.90       | 0.877   | 0.88
Q2      | 0.95       | 0.97       | 0.94       | 0.953   | 0.95
Q3      | 1.05       | 1.08       | 1.06       | 1.063   | 1.07
Q4      | 1.15       | 1.12       | 1.18       | 1.150   | 1.15

Interpretation: Q4 shows highest tea export demand (harvest season)
Q1 shows lowest (off-peak period)

Correlation and Regression

Scatter Diagram:

PatternCorrelationSri Lankan Example
Upward slopingPositiveAdvertising spend vs. sales
Downward slopingNegativeDebt ratio vs. profitability
No patternZeroUnrelated variables
Perfect linePerfect correlationRare in real data

Karl Pearson’s Coefficient of Correlation (r):

r = Σ(x-x̄)(y-ȳ) / √[Σ(x-x̄)² × Σ(y-ȳ)²]
r = [nΣxy - ΣxΣy] / √[(nΣx² - (Σx)²)(nΣy² - (Σy)²)]

Interpretation:

r valueInterpretation
+1.0Perfect positive correlation
+0.7 to +0.99Strong positive
+0.4 to +0.69Moderate positive
+0.1 to +0.39Weak positive
0No correlation
-0.1 to -0.39Weak negative
-0.4 to -0.69Moderate negative
-0.7 to -0.99Strong negative
-1.0Perfect negative correlation

Example - Sri Lankan Business:

Advertising (Rs.M) vs. Sales (Rs.M):
x = Advertising: 10, 15, 20, 25, 30
y = Sales: 50, 65, 70, 85, 100

n = 5
Σx = 100, Σy = 370, Σxy = 8,000, Σx² = 2,250, Σy² = 29,450

r = [5(8000) - 100(370)] / √[(5×2250 - 10000)(5×29450 - 136900)]
  = [40000 - 37000] / √[(11250-10000)(147250-136900)]
  = 3000 / √[1250 × 10350]
  = 3000 / √12937500
  = 3000 / 3596.87 = 0.834 (Strong positive correlation)

Linear Regression:

Regression Line: y = a + bx

b = [nΣxy - ΣxΣy] / [nΣx² - (Σx)²]
a = ȳ - b × x̄

Where b = slope (change in y for each unit change in x)
a = intercept (y value when x = 0)

Example:

Using same data:
b = [5(8000) - 100(370)] / [5(2250) - 10000]
  = 3000 / 1250 = 2.4

a = 370/5 - 2.4 × 100/5 = 74 - 2.4 × 20 = 74 - 48 = 26

Regression equation: y = 26 + 2.4x

Interpretation: For every Rs. 1 million increase in advertising,
sales increase by Rs. 2.4 million. With no advertising, base sales = Rs. 26M.

A/L Exam Tip: In A/L questions, regression is often tested with prediction: “If advertising is Rs. 35 million, predict sales.” Sub x=35 into y = a + bx.

Index Numbers

Simple Index:

Price Index (Simple) = (Pn / P0) × 100
Quantity Index (Simple) = (Qn / Q0) × 100

Weighted Index Numbers:

Laspeyres Index (uses base year quantities — common):

PL = Σ(Pn × Q0) / Σ(P0 × Q0) × 100

Advantages: Uses fixed weights, easier to compute
Disadvantages: May overstate price changes (doesn't reflect substitution)

Paasche Index (uses current year quantities):

PP = Σ(Pn × Qn) / Σ(P0 × Qn) × 100

Advantages: Reflects current consumption patterns
Disadvantages: Weights change each period, harder to compare

Fisher’s Ideal Index (geometric mean of Laspeyres and Paasche):

Fisher = √(Laspeyres × Paasche)

Satisfies factor reversal test and time reversal test
Theoretically ideal but rarely used in practice

Example - Sri Lanka Tea Export Prices:

Product  | Base Year         | Current Year
         | Price (Rs./kg) Qty (M kg) | Price (Rs./kg) Qty (M kg)
----------|------------------|--------------------
Tea A    | 400       50     | 500        55
Tea B    | 300       80     | 350        90
Tea C    | 200       40     | 280        35

PL = [(500×50) + (350×80) + (280×40)] / [(400×50) + (300×80) + (200×40)] × 100
   = [25000 + 28000 + 11200] / [20000 + 24000 + 8000] × 100
   = 64200 / 52000 × 100 = 123.46

PP = [(500×55) + (350×90) + (280×35)] / [(400×55) + (300×90) + (200×35)] × 100
   = [27500 + 31500 + 9800] / [22000 + 27000 + 7000] × 100
   = 68800 / 56000 × 100 = 122.86

Fisher = √(123.46 × 122.86) = √15173.8 = 123.18

DeFLation (using index numbers):

Real Value = Nominal Value / Price Index × 100

Used to convert nominal GDP to real GDP
Removes inflation effect

Example:

Sri Lanka GDP:
Nominal GDP 2024: Rs. 25 trillion
Price Index 2024 (base 2015): 180

Real GDP (2015 prices) = 25,000 / 180 × 100 = Rs. 13.89 trillion
This allows comparison with GDP in 2015 base year prices

A/L Key: The difference between Laspeyres and Paasche and knowing when to use each is a classic A/L question. If weights are not changing significantly (or if you’re comparing many periods), Laspeyres is typically used. If consumption patterns change significantly, Paasche may be preferred.

Normal Distribution

Properties:

  • Bell-shaped, symmetric about the mean
  • Mean = Median = Mode
  • Total area under curve = 1 (or 100%)
  • 68% of values within ±1 SD
  • 95% of values within ±2 SD
  • 99.7% of values within ±3 SD

Standard Normal Distribution:

Z = (X - μ) / σ

Where X = value, μ = mean, σ = standard deviation
Z score tells you how many standard deviations from mean

Example:

Monthly revenue of Sri Lankan garment exporters:
Mean = Rs. 50 million, SD = Rs. 8 million

Find probability revenue exceeds Rs. 60 million:
Z = (60 - 50) / 8 = 10/8 = 1.25

From Z table: P(Z > 1.25) = 0.1056
So 10.56% probability revenue exceeds Rs. 60 million

Find probability revenue between Rs. 40M and Rs. 60M:
Z for 40M = (40-50)/8 = -1.25
Z for 60M = 1.25
P(-1.25 < Z < 1.25) = 1 - 2(0.1056) = 0.7888
So 78.88% probability revenue is in this range

A/L Exam Tip: Z-table values are often provided in the A/L exam paper. Always convert to Z-score first, then use the table. The sign of Z matters — positive Z looks up in positive column, negative Z uses symmetry.


Content adapted based on your selected roadmap duration. Switch tiers using the selector above.