US Medical Expenditure Panel Survey (MEPS)

tempData <- read.csv(url("https://laurencipriano.github.io/IveyBusinessStatistics/Datasets/mepsData.csv"), header = TRUE)

summary(tempData)
>   Observation      Person_ID          FluVaccination       Age      
>  Min.   :    1   Min.   :2290001101   Min.   :0.000   Min.   : 0.0  
>  1st Qu.: 7616   1st Qu.:2294946102   1st Qu.:0.000   1st Qu.:18.0  
>  Median :15231   Median :2320170102   Median :0.000   Median :38.0  
>  Mean   :15231   Mean   :2310116491   Mean   :0.429   Mean   :39.1  
>  3rd Qu.:22846   3rd Qu.:2325003103   3rd Qu.:1.000   3rd Qu.:59.0  
>  Max.   :30461   Max.   :2329687103   Max.   :1.000   Max.   :85.0  
>                                       NA's   :11290   NA's   :375   
>       Sex        RaceEthnicity  HealthInsurance NotAffordHealthCare
>  Min.   :0.000   Min.   :1.00   Min.   :1.00    Min.   :0.0000     
>  1st Qu.:0.000   1st Qu.:2.00   1st Qu.:1.00    1st Qu.:0.0000     
>  Median :0.000   Median :2.00   Median :1.00    Median :0.0000     
>  Mean   :0.479   Mean   :2.13   Mean   :1.48    Mean   :0.0473     
>  3rd Qu.:1.000   3rd Qu.:2.00   3rd Qu.:2.00    3rd Qu.:0.0000     
>  Max.   :1.000   Max.   :5.00   Max.   :3.00    Max.   :1.0000     
>                                                 NA's   :599        
>  FamIncome_Continuous  MentalHealth  FamIncome_Categorical
>  Min.   :     0       Min.   :1.00   Min.   :1.00         
>  1st Qu.: 26895       1st Qu.:1.00   1st Qu.:3.00         
>  Median : 56532       Median :2.00   Median :4.00         
>  Mean   : 75267       Mean   :2.04   Mean   :3.55         
>  3rd Qu.:103882       3rd Qu.:3.00   3rd Qu.:5.00         
>  Max.   :583219       Max.   :5.00   Max.   :5.00         
>  NA's   :14           NA's   :536                         
>  FamIncome_PercentPoverty  HealthStatus   HaveProvider    CensusRegion 
>  Min.   :   0             Min.   :1.00   Min.   :0.000   Min.   :1.00  
>  1st Qu.: 138             1st Qu.:1.00   1st Qu.:1.000   1st Qu.:2.00  
>  Median : 271             Median :2.00   Median :1.000   Median :3.00  
>  Mean   : 365             Mean   :2.22   Mean   :0.773   Mean   :2.73  
>  3rd Qu.: 494             3rd Qu.:3.00   3rd Qu.:1.000   3rd Qu.:3.00  
>  Max.   :3020             Max.   :5.00   Max.   :1.000   Max.   :4.00  
>  NA's   :14               NA's   :530    NA's   :1423    NA's   :375   
>  TotalHealthExpenditure HasHypertension  HasDiabetes          BMI       
>  Min.   :     0         Min.   :0.000   Min.   :0.0000   Min.   : 0.1   
>  1st Qu.:   213         1st Qu.:0.000   1st Qu.:0.0000   1st Qu.:23.7   
>  Median :  1179         Median :0.000   Median :0.0000   Median :27.3   
>  Mean   :  6094         Mean   :0.348   Mean   :0.0954   Mean   :28.2   
>  3rd Qu.:  4903         3rd Qu.:1.000   3rd Qu.:0.0000   3rd Qu.:31.9   
>  Max.   :807611         Max.   :1.000   Max.   :1.0000   Max.   :71.1   
>                         NA's   :7725    NA's   :202      NA's   :12209

The data set mepsData is a longitudinal study which surveys and measures a particular group of people over time. In the case of MEPS, each household that is surveyed is surveyed a total of five times (or 5 interview rounds) over a period of two years.

Health Data Variables and Descriptions
Variable Description
Person_ID Unique ID
Age In Years
Sex Self-Reported Gender 1: Male 0: Female
RaceEthnicity Race/Ethnicity 1: Hispanic 2: Non-Hispanic White Only 3: Non-Hispanic Black Only 4: Non-Hispanic Asian Only 5: Non-Hispanic Other Race or Multiple Race
CensusRegion Respondents Census Region 1: North-East 2: Mid-West 3: South 4: West
HealthStatus How does the respondent describe their own health 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor
MentalHealth How does the respondent describe their own mental health 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor
BMI Body Mass Index (Collected for adults)
HasHypertension Has the respondent been diagnosed with high blood pressure? 1: Yes 0: No
HasDiabetes Has the respondent been diagnosed with diabetes? 1: Yes 0: No
Drinks5Day Over the past 12 months, did the respondent have 5 or more alcoholic drinks per day? 1: Yes 0: No
HealthInsurance What type of health insurance, if any, does the respondent have? 1: Any Private 2: Public Only 3: Uninsured
HaveProvider Respondent has a usual healthcare provider 1: Yes 0: No
NotAffordHealthCare In the past 12 months, respondent did not receive or delayed medical care because of cost? 1: Yes 0: No
FamIncome_Continuous Family total income from all sources
FamIncome_PercentPoverty Family income divided by the applicable poverty line based on family size and composition
FamIncome_Categorical Family income as a percent of the poverty line organized as a categorical variable 1: Poor/Negative 2: Near Poor 3: Low Income 4: Middle Income 5: High Income
TotalHealthExpenditure Total health care expenditures ($): all sources (public insurance, private insurance, and out-of-pocket)