Data Sources
Built on official government data. Updated daily.
ProviderSignal aggregates 8 government data sources into one unified provider intelligence platform. Every record traces back to an authoritative origin: CMS, state dental boards, HRSA, Census Bureau, and NPDB. No scraping. No user-submitted data. No guesswork.
NPI/NPPES Registry
Centers for Medicare & Medicaid Services (CMS)
The National Plan and Provider Enumeration System is the federal registry of every healthcare provider in the United States. Each provider receives a unique 10-digit NPI number that serves as the universal cross-reference key across all other data sources. ProviderSignal extracts dental-specific records using taxonomy codes (122300000X and subtypes) and loads the full provider profile: name, address, specialty, entity type (Individual vs Organization), sole proprietor status, parent organization, authorized official, secondary specialties, enumeration date, and state license numbers embedded in the 50 Other Provider Identifier fields.
Methodology: Bulk CSV download (~9.3GB extracted). State license numbers are parsed from Other Provider Identifier slots into JSONB for direct matching against state board records. Weekly diffs detect deactivations, reactivations, and address changes.
TSBDE (Texas State Board of Dental Examiners)
Texas State Board of Dental Examiners
State dental board data provides the enrichment layer that transforms a basic NPI directory entry into an actionable provider intelligence record. TSBDE publishes daily CSV files for dentists, hygienists, and dental assistants. ProviderSignal ingests the dentist file and matches records to NPI entries using a six-tier strategy: exact license number match (from NPI Other Provider Identifier fields), name + ZIP fallback, fuzzy name + city with nickname expansion, relaxed multi-provider address match, name-only with confirmation signals, and a former last name retry for name-change cases.
Methodology: Key fields extracted: license number, status (active/expired/revoked/suspended), expiration date, birth year, graduation year, dental school, county, disciplinary action flag, anesthesia permit levels (I-IV) with dates, nitrous oxide permit, practice description, practice types, and remedial plan status. Daily diff engine compares snapshots to generate license_events for the alert system.
Medicaid Fee Schedules
Texas Medicaid & Healthcare Partnership (TMHP)
State Medicaid fee schedules provide the actual reimbursement rates dental providers receive for procedures billed to Medicaid. Unlike Medicare, which pays $0 for dental (Medicare does not cover dental services), Medicaid programs set state-specific fee-for-service rates for CDT procedure codes. ProviderSignal loads the Texas TMHP fee schedule and the CMS Geographic Practice Cost Index (GPCI) data for all 110 localities nationwide.
Methodology: Rates are stored with facility and non-facility distinctions. GPCI components (work, practice expense, malpractice) are loaded for rate adjustment calculations. The CMS conversion factor ($33.40 for 2026) is applied to RVU values where applicable. Revenue potential estimates combine fee schedule rates with utilization volume from Medicare Part B claims.
NPDB (National Practitioner Data Bank)
U.S. Department of Health and Human Services (HHS)
The National Practitioner Data Bank collects reports on medical malpractice payments and adverse actions taken against healthcare practitioners. The NPDB Public Use Data File is anonymized by federal law and contains no NPI numbers, names, or individually identifiable information. ProviderSignal processes the raw file into aggregate statistics: malpractice payment counts and amounts by state and year, adverse action trends, and basis-of-action breakdowns.
Methodology: Individual provider-level discipline data comes from state dental boards (TSBDE disciplinary action flags), not NPDB. The aggregate NPDB data provides regional risk context: Texas accounts for 6,356 adverse actions across the dataset. State-level trends are displayed on provider profiles as contextual malpractice climate indicators.
HRSA HPSA Designations
Health Resources & Services Administration (HRSA)
Dental Health Professional Shortage Area (HPSA) designations identify geographic areas, populations, or facilities with a shortage of dental providers. HPSA designations carry significant implications: providers practicing in HPSAs may qualify for National Health Service Corps loan repayment, Medicare bonus payments, and J-1 visa waivers. For DSO expansion planning, HPSAs represent underserved markets with built-in federal incentives.
Methodology: Downloaded from data.hrsa.gov CSV export. Each designation includes the HPSA name, score (higher = more severe shortage), designation type (geographic, population, facility), status, and county/ZIP mapping. HPSA overlays appear on the interactive map and factor into whitespace analysis.
Census ACS Demographics
U.S. Census Bureau
The American Community Survey 5-Year Estimates provide ZIP Code Tabulation Area (ZCTA) level demographics essential for market analysis. ProviderSignal loads population counts, median household income, median age, and age distribution breakdowns for every ZIP in the target geography. This data powers the whitespace analysis engine and demographic map overlays.
Methodology: Fetched via Census Bureau API using ACS 5-Year Detailed Tables. Population and income data are combined with provider density calculations to produce provider-per-capita ratios at the ZIP level. ZIPs with high population and low provider counts surface as underserved market opportunities.
CMS Medicare Part B Claims
Centers for Medicare & Medicaid Services (CMS)
Provider-level utilization and payment data from the Medicare Part B program. While most dental services are not covered by Medicare, oral surgeons and medical-benefit dental procedures (e.g., jaw fracture repair, tumor excision) generate Medicare claims. These records serve as a volume proxy: providers who appear in Medicare Part B data are actively treating complex cases and generating insurance-billable procedures.
Methodology: Auto-download pipeline builds URLs for 2021-2023 datasets from data.cms.gov. Records include NPI, HCPCS/CPT code, place of service, beneficiary count, service count, average submitted charge, average Medicare payment, and average allowed amount. Useful for estimating practice volume and procedure mix.
Grants.gov + SAM.gov
Federal grant and contract databases
Grants.gov aggregates federal grant opportunities across all agencies. ProviderSignal searches for dental and oral health-related opportunities and surfaces active funding for community health centers, dental education programs, workforce development, and underserved population outreach. SAM.gov provides federal contract opportunities including dental service contracts for VA facilities, military bases, and federal prisons.
Methodology: Grants.gov REST API v1 (search2 endpoint) filtered for dental/oral health keywords. SAM.gov Opportunities API v2 filtered for dental NAICS codes and keywords. Both are stored in the grant_opportunities table with title, agency, funding amount, application deadline, and direct link to the opportunity page.
Data Freshness & Pipeline Orchestration
ProviderSignal tracks 8 data sources through an automated pipeline orchestrator running on GitHub Actions. Each source has a defined freshness threshold (daily for TSBDE, weekly for NPI, quarterly for CMS and NPDB). When a source exceeds its threshold, the orchestrator automatically triggers a refresh.
The data_source_status table records the last successful ingest timestamp, record count, and error state for every source. The dashboard Data Status page shows real-time pipeline health. All downloads use exponential backoff with jitter and 3 retries. Government servers frequently rate-limit or refuse datacenter IP ranges.