Skip to main content
PSM 3% exam weight

Epidemiology and Study Designs

Part of the NEET PG study roadmap. PSM topic psm-002 of PSM.

Epidemiology and Study Designs

Overview

Epidemiology is the core discipline of public health and a high-weightage topic in NEET PG PSM. Questions from this chapter appear consistently every year, with emphasis on study designs, measures of disease frequency, and screening test characteristics. A thorough understanding of epidemiological principles is also essential for answering clinical research-based questions across all subjects.


Definition and Scope of Epidemiology

Definition

“The study of the distribution and determinants of health-related states or events in specified populations, and the application of this study to control health problems.”
— John Last (Dictionary of Epidemiology)

Key Words in the Definition

TermMeaning
DistributionPatterns of disease by person, place, and time
DeterminantsCauses and risk factors of disease
Health-related statesDisease, injury, disability, wellness
Specified populationsGroups defined by geography, occupation, age, etc.
Application/ControlUsing findings to improve population health

Scope of Epidemiology

  • Descriptive epidemiology — person, place, time characteristics
  • Analytical epidemiology — identifying determinants and testing hypotheses
  • Clinical epidemiology — application to individual patient care
  • Environmental/occupational epidemiology — study of environmental hazards
  • Molecular epidemiology — genetic/molecular markers in disease
  • Social epidemiology — social determinants of health

Epidemiological Triad (In Detail)

The epidemiological triad explains disease causation through the interaction of Agent, Host, and Environment.

Components

ComponentRoleExamples
AgentCause of diseaseBacteria, virus, toxin, nutritional deficiency
HostPerson who gets the diseaseAge, sex, genetics, immunity, behavior
EnvironmentExternal factors affecting agent/hostClimate, sanitation, vectors, social factors

Dynamics of Interaction

  • Agent must be present in sufficient quantity and virulence
  • Host must be susceptible (lack immunity or protective factors)
  • Environment must facilitate transmission
  • All three components must interact for disease to occur

Chain of Infection

Reservoir → Portal of Exit → Mode of Transmission → Portal of Entry → Susceptible Host
  • Reservoir — natural habitat of the agent (human, animal, environment)
  • Portal of Exit — how agent leaves (respiratory secretions, blood, feces)
  • Mode of Transmission — direct/indirect contact, vectors, airborne, etc.
  • Portal of Entry — how agent enters new host (same as portal of exit sites)

NEET PG Tip: Breaking any link in the chain can prevent disease transmission.


Measures of Disease Frequency

This is one of the most frequently tested areas in NEET PG. Questions on incidence, prevalence, and their calculations appear regularly.

Incidence

New cases of a disease occurring in a population during a specific time period.

MeasureFormula
Incidence RateNew cases / Population at risk × 1000 (or 100,000)
Risk (Cumulative Incidence)New cases / Total population at start × 100
  • Numerator = New cases
  • Denominator = Population at risk (excluding those already affected)
  • Unit = Per time period (usually per year)
  • Reflects velocity of disease spread

Prevalence

All cases (old and new) existing in a population at a given point in time or period.

MeasureFormula
Point PrevalenceAll cases at a point / Total population × 1000
Period PrevalenceCases during period / Mid-interval population × 1000
  • Numerator = All existing cases (old + new)
  • Denominator = Total population
  • Reflects disease burden in the population
  • Affected by: Incidence rate + Duration of disease

Relationship Between Incidence and Prevalence

Prevalence ≈ Incidence × Average Duration of Disease
  • For acute diseases (short duration): Prevalence << Incidence
  • For chronic diseases (long duration): Prevalence can approach or exceed incidence
  • Increasing prevalence with stable incidence suggests improved survival (e.g., HIV/AIDS with antiretroviral therapy)

Memory Aid: I-ncidence = I = New cases
P-revalence = P = All cases (Past + Present)


Attack Rate and Secondary Attack Rate

Attack Rate

Used during disease outbreaks to measure the proportion of exposed people who become ill.

FormulaInterpretation
New cases among exposed / Total exposed × 100Proportion of exposed individuals affected
  • Denominator = Total exposed population (not total population)
  • Used in outbreak investigations and food poisoning episodes
  • Usually expressed as a percentage

Secondary Attack Rate

The proportion of contacts (family members, household contacts) who develop disease after exposure to a primary case.

FormulaInterpretation
New cases among contacts / Total contacts × 100Transmission within household

Key Points:

  • Numerator = Secondary cases (not the primary case)
  • Denominator = Total susceptible contacts (excluding the primary case)
  • High secondary attack rate indicates high infectivity of the disease
  • Used to calculate reproductive number (R0)

NEET PG Memory Aid:
Attack Rate = exposed people who get sick
Secondary Attack Rate = contacts of cases who get sick


Study Designs

Classification of Study Designs

Descriptive Studies
├── Case reports/series
├── Cross-sectional studies (prevalence studies)
└── Ecological studies

Analytical Studies
├── Observational
│   ├── Case-control studies
│   └── Cohort studies (prospective & retrospective)

Experimental Studies
├── Randomized Controlled Trials (RCTs)
├── Field trials
└── Community trials

Descriptive Studies

Case Report/Case Series

  • Case report = Description of a single patient
  • Case series = Description of a group of patients with similar disease
  • Purpose: Generate hypotheses; document rare conditions
  • Limitation: No control group, cannot establish causality

Cross-Sectional Study (Prevalence Study)

  • Measures exposure and outcome simultaneously at one point in time
  • Purpose: Estimate prevalence, describe population characteristics
  • Data collection: Survey, health examination survey
  • Limitation: Cannot establish temporal relationship (cause before effect)
  • Cannot calculate incidence directly

Ecological Study

  • Analyzes data at the population/group level, not individual level
  • Purpose: Generate hypotheses, study population-level associations
  • Limitation: Ecological fallacy — cannot infer individual-level relationships

NEET PG Key: Ecological studies are weakest for establishing causality due to ecological fallacy.


Analytical Studies

Case-Control Study

Retrospective design starting with outcome (disease) and looking backward for exposure.

FeatureDescription
DirectionOutcome → Exposure (retrospective)
Starts withCases (diseased) and Controls (non-diseased)
MeasuresOdds Ratio (OR)
Best forRare diseases, diseases with long latency
StrengthQuick, inexpensive, small sample needed
WeaknessProne to recall bias, selection bias

Case-Control Study Calculations

Disease +Disease -
Exposedab
Not Exposedcd
  • Odds Ratio (OR) = ad / bc
  • OR = 1 → No association
  • OR > 1 → Positive association (risk factor)
  • OR < 1 → Negative association (protective factor)

NEET PG Questions Pattern

Common question: “Which study design is best for studying a rare disease like vinyl chloride-induced hemangiosarcoma?”Case-control study


Cohort Study

Prospective or Retrospective design starting with exposure and following forward to observe outcome.

FeatureDescription
DirectionExposure → Outcome (prospective)
Starts withExposed and unexposed groups
MeasuresRelative Risk (RR), Risk Ratio, Rate Ratio
Best forCommon diseases, establishing temporal sequence
StrengthEstablishes temporal relationship; less bias than case-control
WeaknessExpensive, time-consuming, large sample needed

Cohort Study Calculations

Disease +Disease -
Exposedab
Not Exposedcd
  • Relative Risk (RR) = [a/(a+b)] / [c/(c+d)]
  • Attributable Risk (AR) = Incidence in exposed - Incidence in unexposed
  • RR = 1 → No association
  • RR > 1 → Risk factor
  • RR < 1 → Protective factor

Prospective vs Retrospective Cohort

ProspectiveRetrospective
TimingForward from exposureBackward from records
DataCollected as study proceedsAlready available in records
Time requiredLong (years)Short
CostHighLower

NEET PG Memory Aid:
Case-Control = Control for Bias, start from Disease (Backward) → CBD
Cohort = Chronological/Forward, starts from Exposure → Common Exposure


Experimental Studies

Randomized Controlled Trial (RCT)

Gold standard for therapeutic interventions. Participants are randomly assigned to intervention or control group.

FeatureDescription
RandomizationEliminates selection bias
Control groupComparison with standard/placebo
BlindingSingle, double, triple (reduces observer bias)
MeasuresRelative Risk Reduction (RRR), Absolute Risk Reduction (ARR), NNT
StrengthStrongest evidence for causality
WeaknessExpensive, ethical issues, limited generalizability

Key Measurements in RCTs

MeasureFormula
Relative Risk Reduction (RRR)(ARR / Control event rate) × 100
Absolute Risk Reduction (ARR)Control rate - Treatment rate
Number Needed to Treat (NNT)1 / ARR

Sensitivity, Specificity, PPV, and NPV

This is a must-know area for NEET PG, frequently tested in both PSM and Medicine.

2×2 Table

Disease +Disease -
Test +True Positive (TP)False Positive (FP)
Test -False Negative (FN)True Negative (TN)

Definitions

MeasureFormulaMeaning
SensitivityTP / (TP+FN)Ability to detect disease among those with disease
SpecificityTN / (TN+FP)Ability to identify non-disease among those without disease
PPVTP / (TP+FP)Probability of disease given positive test
NPVTN / (TN+FN)Probability of no disease given negative test

Key Properties

PropertyWhat it measuresBest use
SensitivityCorrectly identifies TRUE POSITIVESScreening tests; rules OUT disease (high SnOut)
SpecificityCorrectly identifies TRUE NEGATIVESConfirmatory tests; rules IN disease (high SpIn)

Memory Aid: Sensitivity → Screen (used for screening)
Specificity → Specify/Confirm (used for confirmation)

Effect of Prevalence on Predictive Values

PrevalenceEffect
High prevalencePPV increases, NPV decreases
Low prevalencePPV decreases, NPV increases
Sensitivity and SpecificityIndependent of prevalence

NEET PG Tip: In low-prevalence populations, even a highly sensitive and specific test can have low PPV — many positive results will be false positives.

Likelihood Ratios

MeasureFormulaInterpretation
Positive LRSensitivity / (1 - Specificity)Higher = better; >10 is excellent
Negative LR(1 - Sensitivity) / SpecificityLower = better; <0.1 is excellent

Screening Tests: Criteria and Principles

Wilson and Jungner Criteria for Screening

  1. The condition should be an important health problem
  2. There should be a recognized latent stage
  3. There should be a suitable test for early detection
  4. The test should be acceptable to the population
  5. Early treatment should be beneficial
  6. There should be a clear policy on who to treat
  7. There should be an agreed upon screening program
  8. The cost of case finding should be economically balanced

Types of Screening

TypeDescriptionExample
Mass screeningScreen entire populationNewborn screening
High-risk screeningScreen high-risk groupsMammography in >40 women
Multiphasic screeningMultiple tests simultaneouslyHealth check-up camps
Opportunistic screeningScreen during routine careBP check during OPD visit

Characteristics of a Good Screening Test

  • Valid (sensitive and specific)
  • Reliable (consistent results on repeat testing)
  • Simple (easy to perform and interpret)
  • Acceptable to the population
  • Inexpensive
  • Safe (no harm to screened individual)

NEET PG Exam Tips

High-Yield Points

  1. Incidence vs Prevalence — incidence = new cases, prevalence = all cases
  2. Case-control → Odds Ratio — cohort → Relative Risk
  3. Sensitivity and Specificity are independent of prevalence
  4. PPV and NPV change with prevalence
  5. RCT is gold standard for therapeutic trials
  6. Ecological studies are weakest for causality
  7. High sensitivity = good for screening (rules out disease — SnOut)
  8. High specificity = good for confirmation (rules in disease — SpIn)
  9. Secondary attack rate — used for household contacts
  10. Attributable risk — excess risk in exposed group

Question Patterns

  • “Which study design is best for rare disease?” → Case-control
  • “Which study design establishes temporal sequence?” → Cohort
  • “Which test should be used for screening?” → High sensitivity
  • “Which test should be used for confirmation?” → High specificity
  • “PPV increases when prevalence _____?” → Increases

Memory Mnemonics

Sensitivity and Specificity independence: Sensitivity and Specificity don’t Preval (independent of prevalence)

Cohort: Common diseases, Establishes Relationship Forward = C-E-R-F

Case-Control: Common in Beginning, Disease Backward = C-B-D-B


Summary Table

ConceptKey FormulaUse
IncidenceNew cases / Population at riskMeasure disease occurrence (risk)
PrevalenceAll cases / Total populationDisease burden
Attack RateCases among exposed / Total exposed × 100Outbreak investigation
Secondary Attack RateSecondary cases / Total contacts × 100Household transmission
Relative Risk(a/(a+b)) / (c/(c+d))Cohort studies
Odds Ratioad/bcCase-control studies
SensitivityTP/(TP+FN)Screening tests
SpecificityTN/(TN+FP)Confirmatory tests
PPVTP/(TP+FP)Probability of disease with + test
NPVTN/(TN+FN)Probability of no disease with - test

Chapter: Epidemiology and Study Designs | Subject: PSM | Exam: NEET PG