Epidemiology and Study Designs
Overview
Epidemiology is the core discipline of public health and a high-weightage topic in NEET PG PSM. Questions from this chapter appear consistently every year, with emphasis on study designs, measures of disease frequency, and screening test characteristics. A thorough understanding of epidemiological principles is also essential for answering clinical research-based questions across all subjects.
Definition and Scope of Epidemiology
Definition
“The study of the distribution and determinants of health-related states or events in specified populations, and the application of this study to control health problems.”
— John Last (Dictionary of Epidemiology)
Key Words in the Definition
| Term | Meaning |
|---|---|
| Distribution | Patterns of disease by person, place, and time |
| Determinants | Causes and risk factors of disease |
| Health-related states | Disease, injury, disability, wellness |
| Specified populations | Groups defined by geography, occupation, age, etc. |
| Application/Control | Using findings to improve population health |
Scope of Epidemiology
- Descriptive epidemiology — person, place, time characteristics
- Analytical epidemiology — identifying determinants and testing hypotheses
- Clinical epidemiology — application to individual patient care
- Environmental/occupational epidemiology — study of environmental hazards
- Molecular epidemiology — genetic/molecular markers in disease
- Social epidemiology — social determinants of health
Epidemiological Triad (In Detail)
The epidemiological triad explains disease causation through the interaction of Agent, Host, and Environment.
Components
| Component | Role | Examples |
|---|---|---|
| Agent | Cause of disease | Bacteria, virus, toxin, nutritional deficiency |
| Host | Person who gets the disease | Age, sex, genetics, immunity, behavior |
| Environment | External factors affecting agent/host | Climate, sanitation, vectors, social factors |
Dynamics of Interaction
- Agent must be present in sufficient quantity and virulence
- Host must be susceptible (lack immunity or protective factors)
- Environment must facilitate transmission
- All three components must interact for disease to occur
Chain of Infection
Reservoir → Portal of Exit → Mode of Transmission → Portal of Entry → Susceptible Host
- Reservoir — natural habitat of the agent (human, animal, environment)
- Portal of Exit — how agent leaves (respiratory secretions, blood, feces)
- Mode of Transmission — direct/indirect contact, vectors, airborne, etc.
- Portal of Entry — how agent enters new host (same as portal of exit sites)
NEET PG Tip: Breaking any link in the chain can prevent disease transmission.
Measures of Disease Frequency
This is one of the most frequently tested areas in NEET PG. Questions on incidence, prevalence, and their calculations appear regularly.
Incidence
New cases of a disease occurring in a population during a specific time period.
| Measure | Formula |
|---|---|
| Incidence Rate | New cases / Population at risk × 1000 (or 100,000) |
| Risk (Cumulative Incidence) | New cases / Total population at start × 100 |
- Numerator = New cases
- Denominator = Population at risk (excluding those already affected)
- Unit = Per time period (usually per year)
- Reflects velocity of disease spread
Prevalence
All cases (old and new) existing in a population at a given point in time or period.
| Measure | Formula |
|---|---|
| Point Prevalence | All cases at a point / Total population × 1000 |
| Period Prevalence | Cases during period / Mid-interval population × 1000 |
- Numerator = All existing cases (old + new)
- Denominator = Total population
- Reflects disease burden in the population
- Affected by: Incidence rate + Duration of disease
Relationship Between Incidence and Prevalence
Prevalence ≈ Incidence × Average Duration of Disease
- For acute diseases (short duration): Prevalence << Incidence
- For chronic diseases (long duration): Prevalence can approach or exceed incidence
- Increasing prevalence with stable incidence suggests improved survival (e.g., HIV/AIDS with antiretroviral therapy)
Memory Aid: I-ncidence = I = New cases
P-revalence = P = All cases (Past + Present)
Attack Rate and Secondary Attack Rate
Attack Rate
Used during disease outbreaks to measure the proportion of exposed people who become ill.
| Formula | Interpretation |
|---|---|
| New cases among exposed / Total exposed × 100 | Proportion of exposed individuals affected |
- Denominator = Total exposed population (not total population)
- Used in outbreak investigations and food poisoning episodes
- Usually expressed as a percentage
Secondary Attack Rate
The proportion of contacts (family members, household contacts) who develop disease after exposure to a primary case.
| Formula | Interpretation |
|---|---|
| New cases among contacts / Total contacts × 100 | Transmission within household |
Key Points:
- Numerator = Secondary cases (not the primary case)
- Denominator = Total susceptible contacts (excluding the primary case)
- High secondary attack rate indicates high infectivity of the disease
- Used to calculate reproductive number (R0)
NEET PG Memory Aid:
Attack Rate = exposed people who get sick
Secondary Attack Rate = contacts of cases who get sick
Study Designs
Classification of Study Designs
Descriptive Studies
├── Case reports/series
├── Cross-sectional studies (prevalence studies)
└── Ecological studies
Analytical Studies
├── Observational
│ ├── Case-control studies
│ └── Cohort studies (prospective & retrospective)
Experimental Studies
├── Randomized Controlled Trials (RCTs)
├── Field trials
└── Community trials
Descriptive Studies
Case Report/Case Series
- Case report = Description of a single patient
- Case series = Description of a group of patients with similar disease
- Purpose: Generate hypotheses; document rare conditions
- Limitation: No control group, cannot establish causality
Cross-Sectional Study (Prevalence Study)
- Measures exposure and outcome simultaneously at one point in time
- Purpose: Estimate prevalence, describe population characteristics
- Data collection: Survey, health examination survey
- Limitation: Cannot establish temporal relationship (cause before effect)
- Cannot calculate incidence directly
Ecological Study
- Analyzes data at the population/group level, not individual level
- Purpose: Generate hypotheses, study population-level associations
- Limitation: Ecological fallacy — cannot infer individual-level relationships
NEET PG Key: Ecological studies are weakest for establishing causality due to ecological fallacy.
Analytical Studies
Case-Control Study
Retrospective design starting with outcome (disease) and looking backward for exposure.
| Feature | Description |
|---|---|
| Direction | Outcome → Exposure (retrospective) |
| Starts with | Cases (diseased) and Controls (non-diseased) |
| Measures | Odds Ratio (OR) |
| Best for | Rare diseases, diseases with long latency |
| Strength | Quick, inexpensive, small sample needed |
| Weakness | Prone to recall bias, selection bias |
Case-Control Study Calculations
| Disease + | Disease - | |
|---|---|---|
| Exposed | a | b |
| Not Exposed | c | d |
- Odds Ratio (OR) = ad / bc
- OR = 1 → No association
- OR > 1 → Positive association (risk factor)
- OR < 1 → Negative association (protective factor)
NEET PG Questions Pattern
Common question: “Which study design is best for studying a rare disease like vinyl chloride-induced hemangiosarcoma?” → Case-control study
Cohort Study
Prospective or Retrospective design starting with exposure and following forward to observe outcome.
| Feature | Description |
|---|---|
| Direction | Exposure → Outcome (prospective) |
| Starts with | Exposed and unexposed groups |
| Measures | Relative Risk (RR), Risk Ratio, Rate Ratio |
| Best for | Common diseases, establishing temporal sequence |
| Strength | Establishes temporal relationship; less bias than case-control |
| Weakness | Expensive, time-consuming, large sample needed |
Cohort Study Calculations
| Disease + | Disease - | |
|---|---|---|
| Exposed | a | b |
| Not Exposed | c | d |
- Relative Risk (RR) = [a/(a+b)] / [c/(c+d)]
- Attributable Risk (AR) = Incidence in exposed - Incidence in unexposed
- RR = 1 → No association
- RR > 1 → Risk factor
- RR < 1 → Protective factor
Prospective vs Retrospective Cohort
| Prospective | Retrospective | |
|---|---|---|
| Timing | Forward from exposure | Backward from records |
| Data | Collected as study proceeds | Already available in records |
| Time required | Long (years) | Short |
| Cost | High | Lower |
NEET PG Memory Aid:
Case-Control = Control for Bias, start from Disease (Backward) → CBD
Cohort = Chronological/Forward, starts from Exposure → Common Exposure
Experimental Studies
Randomized Controlled Trial (RCT)
Gold standard for therapeutic interventions. Participants are randomly assigned to intervention or control group.
| Feature | Description |
|---|---|
| Randomization | Eliminates selection bias |
| Control group | Comparison with standard/placebo |
| Blinding | Single, double, triple (reduces observer bias) |
| Measures | Relative Risk Reduction (RRR), Absolute Risk Reduction (ARR), NNT |
| Strength | Strongest evidence for causality |
| Weakness | Expensive, ethical issues, limited generalizability |
Key Measurements in RCTs
| Measure | Formula |
|---|---|
| Relative Risk Reduction (RRR) | (ARR / Control event rate) × 100 |
| Absolute Risk Reduction (ARR) | Control rate - Treatment rate |
| Number Needed to Treat (NNT) | 1 / ARR |
Sensitivity, Specificity, PPV, and NPV
This is a must-know area for NEET PG, frequently tested in both PSM and Medicine.
2×2 Table
| Disease + | Disease - | |
|---|---|---|
| Test + | True Positive (TP) | False Positive (FP) |
| Test - | False Negative (FN) | True Negative (TN) |
Definitions
| Measure | Formula | Meaning |
|---|---|---|
| Sensitivity | TP / (TP+FN) | Ability to detect disease among those with disease |
| Specificity | TN / (TN+FP) | Ability to identify non-disease among those without disease |
| PPV | TP / (TP+FP) | Probability of disease given positive test |
| NPV | TN / (TN+FN) | Probability of no disease given negative test |
Key Properties
| Property | What it measures | Best use |
|---|---|---|
| Sensitivity | Correctly identifies TRUE POSITIVES | Screening tests; rules OUT disease (high SnOut) |
| Specificity | Correctly identifies TRUE NEGATIVES | Confirmatory tests; rules IN disease (high SpIn) |
Memory Aid: Sensitivity → Screen (used for screening)
Specificity → Specify/Confirm (used for confirmation)
Effect of Prevalence on Predictive Values
| Prevalence | Effect |
|---|---|
| High prevalence | PPV increases, NPV decreases |
| Low prevalence | PPV decreases, NPV increases |
| Sensitivity and Specificity | Independent of prevalence |
NEET PG Tip: In low-prevalence populations, even a highly sensitive and specific test can have low PPV — many positive results will be false positives.
Likelihood Ratios
| Measure | Formula | Interpretation |
|---|---|---|
| Positive LR | Sensitivity / (1 - Specificity) | Higher = better; >10 is excellent |
| Negative LR | (1 - Sensitivity) / Specificity | Lower = better; <0.1 is excellent |
Screening Tests: Criteria and Principles
Wilson and Jungner Criteria for Screening
- The condition should be an important health problem
- There should be a recognized latent stage
- There should be a suitable test for early detection
- The test should be acceptable to the population
- Early treatment should be beneficial
- There should be a clear policy on who to treat
- There should be an agreed upon screening program
- The cost of case finding should be economically balanced
Types of Screening
| Type | Description | Example |
|---|---|---|
| Mass screening | Screen entire population | Newborn screening |
| High-risk screening | Screen high-risk groups | Mammography in >40 women |
| Multiphasic screening | Multiple tests simultaneously | Health check-up camps |
| Opportunistic screening | Screen during routine care | BP check during OPD visit |
Characteristics of a Good Screening Test
- Valid (sensitive and specific)
- Reliable (consistent results on repeat testing)
- Simple (easy to perform and interpret)
- Acceptable to the population
- Inexpensive
- Safe (no harm to screened individual)
NEET PG Exam Tips
High-Yield Points
- Incidence vs Prevalence — incidence = new cases, prevalence = all cases
- Case-control → Odds Ratio — cohort → Relative Risk
- Sensitivity and Specificity are independent of prevalence
- PPV and NPV change with prevalence
- RCT is gold standard for therapeutic trials
- Ecological studies are weakest for causality
- High sensitivity = good for screening (rules out disease — SnOut)
- High specificity = good for confirmation (rules in disease — SpIn)
- Secondary attack rate — used for household contacts
- Attributable risk — excess risk in exposed group
Question Patterns
- “Which study design is best for rare disease?” → Case-control
- “Which study design establishes temporal sequence?” → Cohort
- “Which test should be used for screening?” → High sensitivity
- “Which test should be used for confirmation?” → High specificity
- “PPV increases when prevalence _____?” → Increases
Memory Mnemonics
Sensitivity and Specificity independence: Sensitivity and Specificity don’t Preval (independent of prevalence)
Cohort: Common diseases, Establishes Relationship Forward = C-E-R-F
Case-Control: Common in Beginning, Disease Backward = C-B-D-B
Summary Table
| Concept | Key Formula | Use |
|---|---|---|
| Incidence | New cases / Population at risk | Measure disease occurrence (risk) |
| Prevalence | All cases / Total population | Disease burden |
| Attack Rate | Cases among exposed / Total exposed × 100 | Outbreak investigation |
| Secondary Attack Rate | Secondary cases / Total contacts × 100 | Household transmission |
| Relative Risk | (a/(a+b)) / (c/(c+d)) | Cohort studies |
| Odds Ratio | ad/bc | Case-control studies |
| Sensitivity | TP/(TP+FN) | Screening tests |
| Specificity | TN/(TN+FP) | Confirmatory tests |
| PPV | TP/(TP+FP) | Probability of disease with + test |
| NPV | TN/(TN+FN) | Probability of no disease with - test |
Chapter: Epidemiology and Study Designs | Subject: PSM | Exam: NEET PG