US Medical Expenditure Panel Survey (MEPS)

tempData <- read.csv(url("https://laurencipriano.github.io/IveyBusinessStatistics/Datasets/mepsData.csv"), header = TRUE)

summary(tempData)
>   Observation      Person_ID          FluVaccination       Age     
>  Min.   :    1   Min.   :2290001101   Min.   :0       Min.   : 0   
>  1st Qu.: 7616   1st Qu.:2294946102   1st Qu.:0       1st Qu.:18   
>  Median :15231   Median :2320170102   Median :0       Median :38   
>  Mean   :15231   Mean   :2310116491   Mean   :0       Mean   :39   
>  3rd Qu.:22846   3rd Qu.:2325003103   3rd Qu.:1       3rd Qu.:59   
>  Max.   :30461   Max.   :2329687103   Max.   :1       Max.   :85   
>                                       NA's   :11290   NA's   :375  
>       Sex        RaceEthnicity  HealthInsurance NotAffordHealthCare
>  Min.   :0.000   Min.   :1.00   Min.   :1.00    Min.   :0          
>  1st Qu.:0.000   1st Qu.:2.00   1st Qu.:1.00    1st Qu.:0          
>  Median :0.000   Median :2.00   Median :1.00    Median :0          
>  Mean   :0.479   Mean   :2.13   Mean   :1.48    Mean   :0          
>  3rd Qu.:1.000   3rd Qu.:2.00   3rd Qu.:2.00    3rd Qu.:0          
>  Max.   :1.000   Max.   :5.00   Max.   :3.00    Max.   :1          
>                                                 NA's   :599        
>  FamIncome_Continuous  MentalHealth FamIncome_Categorical
>  Min.   :     0       Min.   :1     Min.   :1.00         
>  1st Qu.: 26895       1st Qu.:1     1st Qu.:3.00         
>  Median : 56532       Median :2     Median :4.00         
>  Mean   : 75267       Mean   :2     Mean   :3.55         
>  3rd Qu.:103882       3rd Qu.:3     3rd Qu.:5.00         
>  Max.   :583219       Max.   :5     Max.   :5.00         
>  NA's   :14           NA's   :536                        
>  FamIncome_PercentPoverty  HealthStatus  HaveProvider   CensusRegion
>  Min.   :   0             Min.   :1     Min.   :0      Min.   :1    
>  1st Qu.: 138             1st Qu.:1     1st Qu.:1      1st Qu.:2    
>  Median : 271             Median :2     Median :1      Median :3    
>  Mean   : 365             Mean   :2     Mean   :1      Mean   :3    
>  3rd Qu.: 494             3rd Qu.:3     3rd Qu.:1      3rd Qu.:3    
>  Max.   :3020             Max.   :5     Max.   :1      Max.   :4    
>  NA's   :14               NA's   :530   NA's   :1423   NA's   :375  
>  TotalHealthExpenditure HasHypertension  HasDiabetes       BMI       
>  Min.   :     0         Min.   :0       Min.   :0.0   Min.   : 0     
>  1st Qu.:   213         1st Qu.:0       1st Qu.:0.0   1st Qu.:24     
>  Median :  1179         Median :0       Median :0.0   Median :27     
>  Mean   :  6094         Mean   :0       Mean   :0.1   Mean   :28     
>  3rd Qu.:  4903         3rd Qu.:1       3rd Qu.:0.0   3rd Qu.:32     
>  Max.   :807611         Max.   :1       Max.   :1.0   Max.   :71     
>                         NA's   :7725    NA's   :202   NA's   :12209

The data set mepsData is a longitudinal study which surveys and measures a particular group of people over time. In the case of MEPS, each household that is surveyed is surveyed a total of five times (or 5 interview rounds) over a period of two years.

Health Data Variables and Descriptions
Variable Description
Person_ID Unique ID
Age In Years
Sex Self-Reported Gender 1: Male 0: Female
RaceEthnicity Race/Ethnicity 1: Hispanic 2: Non-Hispanic White Only 3: Non-Hispanic Black Only 4: Non-Hispanic Asian Only 5: Non-Hispanic Other Race or Multiple Race
CensusRegion Respondents Census Region 1: North-East 2: Mid-West 3: South 4: West
HealthStatus How does the respondent describe their own health 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor
MentalHealth How does the respondent describe their own mental health 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor
BMI Body Mass Index (Collected for adults)
HasHypertension Has the respondent been diagnosed with high blood pressure? 1: Yes 0: No
HasDiabetes Has the respondent been diagnosed with diabetes? 1: Yes 0: No
Drinks5Day Over the past 12 months, did the respondent have 5 or more alcoholic drinks per day? 1: Yes 0: No
HealthInsurance What type of health insurance, if any, does the respondent have? 1: Any Private 2: Public Only 3: Uninsured
HaveProvider Respondent has a usual healthcare provider 1: Yes 0: No
NotAffordHealthCare In the past 12 months, respondent did not receive or delayed medical care because of cost? 1: Yes 0: No
FamIncome_Continuous Family total income from all sources
FamIncome_PercentPoverty Family income divided by the applicable poverty line based on family size and composition
FamIncome_Categorical Family income as a percent of the poverty line organized as a categorical variable 1: Poor/Negative 2: Near Poor 3: Low Income 4: Middle Income 5: High Income
TotalHealthExpenditure Total health care expenditures ($): all sources (public insurance, private insurance, and out-of-pocket)