Try free →
FHIR / Population

FHIR Bulk Data Access 2026: population-level data for AI scribe analytics, quality reporting, and risk adjustment

May 8, 2026 · 6 min read

FHIR Bulk Data Access (also called Flat FHIR or "$export") lets a clinical organization or authorized analytics vendor pull large slices of patient data efficiently — not one patient at a time, but cohorts, panels, or entire patient populations in a single asynchronous job.

For AI scribe organizations and value-based care teams, this is the data-engineering layer that makes population-level intelligence work. Per-encounter scribe output is one half of the value; population analytics over those structured outputs is the other half.

What FHIR Bulk Data Access provides

The FHIR Bulk Data Access specification (HL7 FHIR R4 / R5) defines three operation patterns:

The output is typically NDJSON (newline-delimited JSON) over HTTPS, asynchronously generated and downloaded once ready. Standard FHIR resources are available: Patient, Encounter, Condition, Observation, MedicationStatement, AllergyIntolerance, Procedure, DocumentReference, etc.

The AI scribe analytics layer

If your organization is running AI scribes at scale, here's what FHIR Bulk Data unlocks:

Use caseFHIR Bulk pullAnalytics output
HCC suspect list generationCondition + Observation (last 24 mo)For each patient: HCCs billed last year not yet billed this year, plus conditions implied by labs
Quality measure performanceProcedure + Observation + DocumentReferenceCancer screening, diabetes A1c, BP control rates by panel
Documentation gap analysisCondition + DocumentReferenceWhich dx are billed but lack supporting note text? Which conditions are mentioned in notes but not coded?
SDoH coverageCondition (Z-codes) + DocumentReferenceWhich patients have SDoH content in notes but no Z-code? Which Z-codes are stale?
AI scribe quality assuranceDocumentReference (AI-tagged) + EncounterNote completion rate, edit-distance from AI draft to signed note, reviewer flag rate
Panel risk stratificationCondition + Observation + ProcedureRAF distribution, HCC capture rate, projected revenue lift from documentation improvement

Why this matters more in 2026

Three reasons:

  1. HCC v28 stricter specificity. RADV audits sample 30 charts per contract and extrapolate. Population-level documentation gap analysis means you find weak charts before the audit does.
  2. Health Equity Index. The 2027 payment year HEI bonus is calculated on 2025-2026 data. Population SDoH coverage is a documentation analytics question, not a per-encounter question.
  3. Value-based contracts. Most MA, ACO, and risk-bearing arrangements in 2026 require quarterly performance reporting. Without bulk FHIR, this is a manual extract job each cycle.

Authorization and security

FHIR Bulk Data Access uses the SMART on FHIR backend services authorization pattern:

BAA chain: practice / organization + EHR vendor + analytics vendor (3-party) at minimum.

The DIY analytics stack on FHIR Bulk

For an organization wanting to run their own analytics on AI-scribed encounters:

  1. FHIR Bulk client (Python / Node). Standard libraries exist for SMART backend services + bulk pull (e.g., python-fhirclient).
  2. NDJSON download + parse. Straightforward streaming parse; resources by type into a relational store or columnar store.
  3. Storage. Postgres or DuckDB for moderate scale; BigQuery or Snowflake for larger.
  4. Analytics. SQL queries for HCC gap, quality measures, SDoH coverage. Dashboards in Metabase / Superset / Looker.
  5. LLM-driven analysis. Periodic LLM passes over note text + structured data to generate clinician-facing reports.

For a 5-clinician practice, this stack is buildable in 4-6 weeks of one developer + analyst time. The output is panel-level intelligence the practice's payer contracts already require.

Vendor matrix — FHIR Bulk Data + AI scribe analytics 2026

VendorWhat they doCost model
InnovaccerPopulation health data platform with FHIR Bulk ingestionEnterprise
Health CatalystHealthcare analytics platformEnterprise
DatavantCross-system data linking and qualityPer-record / enterprise
NavinaHCC-focused analytics with AI scribe overlayPer-PMPM (panel size dependent)
ReveleerRisk adjustment + quality gap closureEnterprise / per-record
DIY (Python + Postgres + Metabase)Build it yourselfDeveloper + analyst time + infrastructure

Limitations and gotchas

Strategic angle for AI scribe organizations

If you're building or running AI scribe at any meaningful scale (more than 50 encounters/day per provider), the per-encounter scribe output is just the input layer. The population analytics layer over those outputs is where ROI is demonstrated to leadership and to payer contracts.

The FHIR Bulk path is the data plumbing for that analytics layer. Without it, you're stuck running per-patient queries that don't scale.

What's coming in 2026 H2

EHR vendor support for FHIR Bulk Data continues to expand through 2026 H2 as part of the ONC-mandated USCDI v3+ adoption. Expect broader resource coverage, more reliable Group-level exports, and tighter SMART backend services standardization. By 2027 most major EHRs should have production-grade Bulk endpoints — making this layer a baseline expectation rather than a differentiator.

Population analytics + AI scribe stack on LessRec

$0.05/min Whisper transcription as the encounter layer. Build your population analytics over FHIR Bulk Data exports for value-based contracts. First 10 minutes free.

Try LessRec free →