Medicare inpatient claims, summarized by procedure and hospital.
The Medicare Inpatient Hospitals by Provider and Service dataset — commonly called the DRG dataset — is published annually by CMS and captures what hospitals billed to Medicare, what Medicare paid, and how many patients were treated, broken down by diagnosis-related group (DRG). Each row represents one hospital performing one procedure in one year.
The dataset covers over 500 DRG codes across thousands of hospitals. Combined with over a decade of annual releases, it builds a detailed picture of where procedures are concentrated, how prices vary across hospitals for identical diagnoses, and how inpatient volume has shifted over time.
A diagnosis-related group is how Medicare pays hospitals.
Medicare does not pay hospitals based on what they charge or how long a patient stays. It pays a fixed amount per admission based on the patient's diagnosis and procedure — the DRG. Every inpatient Medicare claim is assigned one.
Every hospital-DRG pair carries the same three measurements.
Each record combines a volume count with two price points — what the hospital asked for, and what Medicare actually paid. The gap between those two numbers is where a significant portion of the analysis lives.
What hospitals charge and what they receive are two different numbers.
For any given DRG, a hospital might submit a charge of $85,000 and receive a Medicare payment of $18,000. That spread is not a mistake. Submitted charges are a hospital's internal list price — a number largely disconnected from actual cost or expected reimbursement. Medicare payment is determined by the DRG system and adjusted for hospital-specific factors. The gap between them varies widely across hospitals treating the same diagnosis, which makes charge-to-payment ratio a meaningful marker of pricing behavior rather than a simple billing artifact.
Volume and price together answer questions neither answers alone.
Discharge counts and payment data become most useful when combined — across hospitals for the same DRG, or across DRGs within a single hospital.
Three things claims data makes visible.
Medicare inpatient claims going back to 2013.
The dataset covers Medicare fee-for-service inpatient hospitalizations only. Medicare Advantage enrollees, Medicaid, and commercially insured patients are not included. For hospitals with high Medicare Advantage penetration, the dataset will undercount total inpatient volume — but the DRG mix and pricing patterns it reveals remain representative of the hospital's inpatient activity.
Rows with fewer than 11 discharges for a given hospital-DRG-year combination are suppressed by CMS for patient privacy, consistent with the suppression approach used across Medicare public datasets.