Skip to main content

Data Dictionary

Schemas for the eight latest_* tables produced by Stage 6 of the ssp_pricing DAG. These are the canonical downstream read targets; intermediate / export_* tables are documented in Methodology.

All tables live in tq_dev.internal_dev_csong_ssp (the DAG's schema param default). Every row is tagged with a version column matching the DAG's ssp_version param. The Stage 6 upsert does DELETE … WHERE version = ssp_version then re-inserts, so each pipeline run replaces its own version's rows without touching other versions.

Except for latest_ssp_descriptions and latest_metadata, the latest_* tables carry only the generic-library row (provider_id = '0' in the underlying export_* table — the national-median Google Sheet view). Per-provider rows stay in the export_* tables.


Column Naming Convention

All price columns follow a consistent prefix + suffix scheme:

Prefix — identifies what is included in the price:

PrefixIncluded componentsFormula
inst_Institutional / facilityinst_price
prof_Institutional + Professional Feeinst_price + prof_professional_price
prof_with_optional_Institutional + Professional Fee + Optional Feeinst_price + prof_all_fees_price

Suffix — identifies the metric:

SuffixDescription
_priceVolume-weighted average commercial canonical rate
_medicare_priceMedicare reference rate from the highest-volume code
_weightprice / base_rate (single base_rate = $500 applied to both IP and OP — DAG param)
_medicare_weightmedicare_price / base_rate
_list_chargeBilled-charge-style estimate (commercial × gross-charge markup ratio)
_avg_chargeAverage line charge from Komodo claims

latest_ssp_descriptions

Grain: one row per (version, ssp_grouper).

SSP-level metadata — descriptions, classification, and encounter / revenue share within the SSP universe. Source: export_ssp_descriptions_v3.

ColumnTypeDescription
versionvarcharSSP pipeline version (ssp_version DAG param)
ssp_groupervarcharLegacy SSP identifier (e.g. CA.0.PCI)
ssp_grouper_idvarcharNew-crosswalk SSP ID
parent_descriptionvarcharParent-level description (from crosswalk)
parent_ssp_short_descriptionvarcharShort label for the parent SSP
parent_xl_descriptionsvarcharExtra-long crosswalk description
consumer_descriptionvarcharConsumer-facing description
disease_areavarcharDisease-area label (from ssp_groupings)
statusvarcharSSP lifecycle status (from ssp_groupings)
posvarcharPlace of service: IP or OP
categoryvarcharSSP category (e.g. I&S, ns_)
shoppabilityvarcharShoppability tier label
medicare_apcvarcharHighest-volume APC / DRG used for Medicare pricing
encounter_pctdoubleShare of total anchored encounters across all SSPs (from 1%-sampled ssp_encounter_volume)
revenue_pctdoubleShare of total estimated revenue across all SSPs

latest_ssp_prices

Grain: one row per (version, ssp_grouper) — generic-library (provider_id = '0') only.

Top-line SSP price summary surfaced in the Google Sheet view. Source: export_ssp_prices_v3 filtered to provider_id = '0'.

ColumnTypeDescription
versionvarcharSSP pipeline version
ssp_groupervarcharLegacy SSP identifier
ssp_grouper_idvarcharNew-crosswalk SSP ID
inst_weightdoubleInstitutional price / base_rate
inst_avg_chargedecimal(38,2)Komodo average institutional line charge for the SSP
prof_weightdouble(Institutional + Professional Fee) price / base_rate
total_weightdouble(Institutional + Professional + Optional) price / base_rate
inst_list_chargedecimal(38,2)Institutional list charge (billed-charge estimate)
primary_list_chargedoublePrimary surgeon list charge
anes_list_chargedoubleAnesthesia list charge
labpath_list_chargedoubleLab/Path list charge
radiology_list_chargedoubleRadiology list charge
provider_namevarcharProvider name (null for the generic row)
provider_cityvarcharProvider city (null for the generic row)
provider_statevarcharProvider state (null for the generic row)
health_system_namevarcharHealth system name (null for the generic row)

latest_facility_rc_families

Grain: one row per (version, ssp_grouper, revenue_code_family) — generic-library only.

How the institutional price is split across revenue-code families (drug/implant carve-outs are flagged). Source: export_facility_rc_families_v3 filtered to provider_id = '0'.

ColumnTypeDescription
versionvarcharSSP pipeline version
ssp_groupervarcharLegacy SSP identifier
ssp_grouper_idvarcharNew-crosswalk SSP ID
revenue_code_familyvarcharRC family (e.g. 0360-0369, 0450-0459)
family_descriptionvarcharHuman-readable family name (e.g. Operating Room Services)
is_carveoutbooleanTRUE if this family is priced as an additive carve-out (implants or qualifying pharmacy)
association_ratedecimal(38,17)Komodo-derived encounter association for this family within the SSP
avg_chargedecimal(18,2)Average line charge used for the carve-out price or the proportional allocation weight
line_codesvarcharComma-joined list of line codes that fed this family

latest_professional_line_items

Grain: one row per (version, ssp_grouper, service_type, line_code) — generic-library only.

Per-line-code professional detail. Source: export_professional_line_items_v3 filtered to provider_id = '0'.

ColumnTypeDescription
versionvarcharSSP pipeline version
ssp_groupervarcharLegacy SSP identifier
ssp_grouper_idvarcharNew-crosswalk SSP ID
service_typevarcharProfessional, Anesthesia, Lab/Path, or Radiology
line_codevarcharHCPCS / CPT line code
association_ratedoubleEncounter share within the anchor (NULL for sub_package_contents rows)
unitsdoubleUnit multiplier applied to the rate (anesthesia time-based, anchor-cap, etc.)
weightdoubleCommercial rate × units / base_rate
list_chargedoubleCommercial rate × markup × units
ccs_categoryvarcharCCS (Clinical Classification Software) category
ncci_groupintegerNCCI cluster index — codes sharing an id are mutually exclusive
line_code_descriptionvarcharHuman-readable description from services_spines_cleaned

latest_professional_conveners

Grain: one row per (version, ssp_grouper, component) — generic-library only.

Professional fee rolled up by convener — the delivering specialty class. Source: export_professional_conveners_v3 filtered to provider_id = '0'.

ColumnTypeDescription
versionvarcharSSP pipeline version
ssp_groupervarcharLegacy SSP identifier
ssp_grouper_idvarcharNew-crosswalk SSP ID
componentvarcharConvener bucket: primary, anes, crna, labpath, radiology, assistant_surgeon, assistant_nonsurgeon
encounter_associationdecimal(38,17)Encounter share where this component fires (from ancillary_encounter_proportions)
weightdoubleComponent price / base_rate
list_chargedoubleComponent billed-charge estimate

latest_sub_category

Grain: one row per (version, ssp_grouper, sub_category) — generic-library only.

Subcategory breakdown (IP severity tiers, OP variant codes). Source: export_sub_category_v3 filtered to provider_id = '0'.

ColumnTypeDescription
versionvarcharSSP pipeline version
ssp_groupervarcharLegacy SSP identifier
sub_categoryvarcharSeverity/variant within the SSP (0 = base)
sub_category_idvarcharNew-crosswalk subcategory ID
sub_category_namevarcharSubcategory human-readable name
posvarcharIP or OP
inst_weightdoubleInstitutional price / base_rate at the subcategory level
primary_weightdoublePrimary surgeon price / base_rate at the subcategory level
anchor_codesvarcharComma-joined list of anchor codes in this subcategory

latest_all_line_items

Grain: one row per (version, ssp_grouper, fee_type, code) — generic-library only.

Unified line-item view combining institutional, professional, and revenue-code entries. Source: export_all_line_items_v3 filtered to provider_id = '0'.

ColumnTypeDescription
versionvarcharSSP pipeline version
ssp_groupervarcharLegacy SSP identifier
ssp_grouper_idvarcharNew-crosswalk SSP ID
fee_typevarcharFacility Fee, Professional Fee, or Optional Fee
codevarcharLine code (HCPCS / CPT / MS-DRG / revenue code)
code_typevarcharHCPCS, CPT, MS-DRG, or RC
is_anchorbooleanTRUE when the line code is the SSP's anchor
association_ratedoubleEncounter share within the anchor (NULL for sub_package_contents-native rows)
line_code_descriptionvarcharHuman-readable description

latest_metadata

Grain: one row per (version, key).

Key/value pipeline summary — static params plus row counts from each export_* table. Used by the UI and QA tooling to confirm the latest run completed.

ColumnTypeDescription
versionvarcharSSP pipeline version
keyvarcharMetadata key (see rows emitted, below)
valuevarcharMetadata value

Rows emitted per run:

keyMeaning
ssp_versionDAG ssp_version param
pipeline_dateDAG pipeline_date param
export_datecurrent_date at run time
provider_id'0' — the generic-library row ID used by the latest_* filter
ssp_descriptions_rowsRow count of export_ssp_descriptions_v3
ssp_prices_rowsRow count of export_ssp_prices_v3 filtered to provider_id = '0'
facility_rc_families_rowsRow count of export_facility_rc_families_v3 filtered to provider_id = '0'
professional_line_items_rowsRow count of export_professional_line_items_v3 filtered to provider_id = '0'
professional_conveners_rowsRow count of export_professional_conveners_v3 filtered to provider_id = '0'
sub_category_rowsRow count of export_sub_category_v3 filtered to provider_id = '0'
all_line_items_rowsRow count of export_all_line_items_v3 filtered to provider_id = '0'