Mental Health Screening Tools: PHQ-9, GAD-7, and Other Validated Instruments

Validated screening instruments are among the most consequential pieces of paper in a clinical visit — short enough to complete in a waiting room, specific enough to change the entire direction of a conversation. The PHQ-9, GAD-7, and a handful of other standardized questionnaires form the backbone of mental health screening across primary care, specialty settings, and population health programs in the United States. This page covers what these tools measure, how clinicians interpret them, and where their usefulness ends.


Definition and scope

A mental health screening tool is a standardized questionnaire designed to detect the probability that a person meets criteria for a specific condition — not to diagnose it. That distinction matters more than it might sound. Screening identifies people who warrant further evaluation; diagnosis requires clinical judgment, full history, and often multiple encounters.

The Patient Health Questionnaire-9 (PHQ-9) is the most widely deployed depression screening instrument in US primary care. Developed by Drs. Robert Spitzer, Janet Williams, and Kurt Kroenke with Pfizer funding in the late 1990s, it maps directly to DSM criteria for major depressive disorder across 9 items, each scored 0–3. The maximum score is 27.

The Generalized Anxiety Disorder-7 (GAD-7), published by Spitzer et al. in JAMA Internal Medicine in 2006, applies the same scoring logic to generalized anxiety disorder — 7 items, maximum score 21.

Neither tool belongs exclusively to psychiatry. The United States Preventive Services Task Force (USPSTF) recommends screening adults for depression in clinical settings that have adequate systems in place to support follow-up care. That qualifier is doing real work: a score without a pathway is just a number.

The broader landscape of validated instruments extends well beyond these two. The Columbia Suicide Severity Rating Scale (C-SSRS) is used in crisis settings and emergency mental health to stratify suicide risk. The Edinburgh Postnatal Depression Scale (EPDS) is the standard for maternal mental health screening, with a clinically significant threshold typically set at a score of 10 or higher. The AUDIT-C screens for alcohol use, and the PC-PTSD-5 is a 5-item gateway screen for PTSD and trauma-related disorders.


How it works

The PHQ-9 asks how often, over the past two weeks, the respondent has been bothered by nine symptoms: depressed mood, anhedonia, sleep disturbance, fatigue, appetite changes, feelings of worthlessness, concentration difficulties, psychomotor changes, and suicidal ideation. Each answer runs from 0 ("not at all") to 3 ("nearly every day").

Score interpretation follows a tiered structure:

  1. 1–4: Minimal depression — monitor, no treatment generally indicated
  2. 5–9: Mild depression — watchful waiting or patient education
  3. 10–14: Moderate depression — treatment plan warranted; consider psychotherapy or medication
  4. 15–19: Moderately severe — active treatment, referral typically indicated
  5. 20–27: Severe depression — immediate treatment, possibly inpatient evaluation

The GAD-7 uses identical scoring mechanics. Scores of 5, 10, and 15 represent mild, moderate, and severe anxiety thresholds respectively, per the original Spitzer et al. validation study (Archives of Internal Medicine, 2006, Vol. 166, No. 10).

Both tools report sensitivity and specificity figures in the range of 80–90% for their respective target conditions at the validated cut-points — which means they miss roughly 1 in 10 true cases even under ideal conditions. That limitation is baked into how they're meant to be used: as a first filter, not a final answer.


Common scenarios

Screening typically surfaces in three distinct contexts, and the same tool behaves differently across each.

In primary care, the PHQ-9 and GAD-7 are often administered at annual wellness visits or when a patient presents with vague somatic complaints — fatigue, headaches, GI symptoms — that may have a psychological dimension. The relationship between physical and mental health conditions means primary care physicians frequently encounter depression and anxiety before any mental health professional does.

In pediatric and adolescent settings, the PHQ-A (adolescent version) is validated for ages 12–17 and is recommended by the USPSTF for adolescents aged 12–18 years. The standard adult PHQ-9 is not validated for pediatric use, a distinction that's easy to miss and clinically consequential. Mental health in children and adolescents requires age-appropriate instruments throughout.

In population health and workplace programs, tools like the PHQ-2 — a 2-item ultra-brief precursor to the full PHQ-9 — serve as rapid first-pass screens when time or context doesn't permit a full assessment. A PHQ-2 score of 3 or higher triggers administration of the complete PHQ-9, creating a two-stage funnel that balances efficiency with sensitivity.


Decision boundaries

Every screening instrument has a validity perimeter, and clinicians and patients both benefit from knowing where it sits.

The PHQ-9 and GAD-7 were validated primarily in primary care and obstetric populations. Their psychometric properties in incarcerated populations, people with significant cognitive impairment, or individuals from cultural backgrounds where somatic symptom expression predominates are less well-established. The USPSTF evidence review notes that screening instruments developed and validated in predominantly White samples may perform differently across racial and ethnic minority populations.

Screening tools also carry a fundamental limitation: they measure the past two weeks, not a longitudinal pattern. A person in a two-week window of relative stability may score in the minimal range while living with a chronic condition that significantly affects daily function. Conversely, someone experiencing acute situational distress — a bereavement, a job loss — may score in the moderate range without meeting criteria for a diagnosable disorder.

Item 9 of the PHQ-9 asks specifically about thoughts of death or self-harm. A score of 1 or higher on that item warrants direct clinical follow-up regardless of the total score — a point reinforced in suicide prevention best practices. The C-SSRS or a structured clinical interview typically follows any positive response on this item, because a single screening question cannot adequately stratify suicidal ideation severity.

Finally, a high score is not a mandate for any specific treatment. It is the beginning of a clinical conversation — a door opened, not a decision already made. Finding a mental health provider and establishing what support is feasible remains the step that actually changes outcomes.

References