tempData <- read.csv(url("https://laurencipriano.github.io/IveyBusinessStatistics/Datasets/mepsData.csv"), header = TRUE)
summary(tempData)
> Observation Person_ID FluVaccination Age
> Min. : 1 Min. :2290001101 Min. :0 Min. : 0
> 1st Qu.: 7616 1st Qu.:2294946102 1st Qu.:0 1st Qu.:18
> Median :15231 Median :2320170102 Median :0 Median :38
> Mean :15231 Mean :2310116491 Mean :0 Mean :39
> 3rd Qu.:22846 3rd Qu.:2325003103 3rd Qu.:1 3rd Qu.:59
> Max. :30461 Max. :2329687103 Max. :1 Max. :85
> NA's :11290 NA's :375
> Sex RaceEthnicity HealthInsurance NotAffordHealthCare
> Min. :0.000 Min. :1.00 Min. :1.00 Min. :0
> 1st Qu.:0.000 1st Qu.:2.00 1st Qu.:1.00 1st Qu.:0
> Median :0.000 Median :2.00 Median :1.00 Median :0
> Mean :0.479 Mean :2.13 Mean :1.48 Mean :0
> 3rd Qu.:1.000 3rd Qu.:2.00 3rd Qu.:2.00 3rd Qu.:0
> Max. :1.000 Max. :5.00 Max. :3.00 Max. :1
> NA's :599
> FamIncome_Continuous MentalHealth FamIncome_Categorical
> Min. : 0 Min. :1 Min. :1.00
> 1st Qu.: 26895 1st Qu.:1 1st Qu.:3.00
> Median : 56532 Median :2 Median :4.00
> Mean : 75267 Mean :2 Mean :3.55
> 3rd Qu.:103882 3rd Qu.:3 3rd Qu.:5.00
> Max. :583219 Max. :5 Max. :5.00
> NA's :14 NA's :536
> FamIncome_PercentPoverty HealthStatus HaveProvider CensusRegion
> Min. : 0 Min. :1 Min. :0 Min. :1
> 1st Qu.: 138 1st Qu.:1 1st Qu.:1 1st Qu.:2
> Median : 271 Median :2 Median :1 Median :3
> Mean : 365 Mean :2 Mean :1 Mean :3
> 3rd Qu.: 494 3rd Qu.:3 3rd Qu.:1 3rd Qu.:3
> Max. :3020 Max. :5 Max. :1 Max. :4
> NA's :14 NA's :530 NA's :1423 NA's :375
> TotalHealthExpenditure HasHypertension HasDiabetes BMI
> Min. : 0 Min. :0 Min. :0.0 Min. : 0
> 1st Qu.: 213 1st Qu.:0 1st Qu.:0.0 1st Qu.:24
> Median : 1179 Median :0 Median :0.0 Median :27
> Mean : 6094 Mean :0 Mean :0.1 Mean :28
> 3rd Qu.: 4903 3rd Qu.:1 3rd Qu.:0.0 3rd Qu.:32
> Max. :807611 Max. :1 Max. :1.0 Max. :71
> NA's :7725 NA's :202 NA's :12209
The data set mepsData is a longitudinal study which surveys and measures a particular group of people over time. In the case of MEPS, each household that is surveyed is surveyed a total of five times (or 5 interview rounds) over a period of two years.
Variable | Description |
---|---|
Person_ID | Unique ID |
Age | In Years |
Sex | Self-Reported Gender 1: Male 0: Female |
RaceEthnicity | Race/Ethnicity 1: Hispanic 2: Non-Hispanic White Only 3: Non-Hispanic Black Only 4: Non-Hispanic Asian Only 5: Non-Hispanic Other Race or Multiple Race |
CensusRegion | Respondents Census Region 1: North-East 2: Mid-West 3: South 4: West |
HealthStatus | How does the respondent describe their own health 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor |
MentalHealth | How does the respondent describe their own mental health 1: Excellent 2: Very Good 3: Good 4: Fair 5: Poor |
BMI | Body Mass Index (Collected for adults) |
HasHypertension | Has the respondent been diagnosed with high blood pressure? 1: Yes 0: No |
HasDiabetes | Has the respondent been diagnosed with diabetes? 1: Yes 0: No |
Drinks5Day | Over the past 12 months, did the respondent have 5 or more alcoholic drinks per day? 1: Yes 0: No |
HealthInsurance | What type of health insurance, if any, does the respondent have? 1: Any Private 2: Public Only 3: Uninsured |
HaveProvider | Respondent has a usual healthcare provider 1: Yes 0: No |
NotAffordHealthCare | In the past 12 months, respondent did not receive or delayed medical care because of cost? 1: Yes 0: No |
FamIncome_Continuous | Family total income from all sources |
FamIncome_PercentPoverty | Family income divided by the applicable poverty line based on family size and composition |
FamIncome_Categorical | Family income as a percent of the poverty line organized as a categorical variable 1: Poor/Negative 2: Near Poor 3: Low Income 4: Middle Income 5: High Income |
TotalHealthExpenditure | Total health care expenditures ($): all sources (public insurance, private insurance, and out-of-pocket) |