Code for "Improving the Performance of Risk Adjustment Systems: Constrained Regressions, Reinsurance, and Variable Selection"
This set of programs predict total spending (or spending including reinsurance) using different sets of input variables and types of constraints/penalties. It outputs predicted values for each scenario, which can be used to summarize performance across estimation methods.
- run_analysis.R - run analysis programs. Calls:
- xfold_regs.R - run x-fold crossvalidation (in parallel) for each estimation method.
- screener.R
- helper_functions.R
- xfold_regs.R - run x-fold crossvalidation (in parallel) for each estimation method.
The programs use the simulated data found in the "data" folder. This simulated data contains information on individual sex$\times$5-year age buckets, health condition categories (HCCs), drug-related condition categories (RXCs), four cchronic condition groups (cancer, diab, heart, mental), annual health spending (totpay), and annual health spending with reinsurance included (totpay_r).
Each output dataset contains the predicted outcomes for the various methods:
- pred_ols_og: original risk adjustment formula predicting total spending (totpay) given inputs
- pred_ols: risk adjustment formula predicting spending given inputs, with reinsurance payments included
- pred_constraint: risk adjustment formula constraining predicted group spend = mean group spend
- pred_constraint2: risk adjustment formula constraining predicted group spend to levels to preset predictive ratios
- pred_penaltymixed: risk adjustment formula penalizing misprediction of groups.
The output also includes indictors for membership in one of the four chronic condition groups (cancer, diab, heart, mental) that were used in the constraints and penalties.