Health surveys are commonly conducted to evaluate the overall state of affairs in terms of health decisions and trends. Often, the data collected are paired with other extant data in order to see if relationships exist that were not specifically studied or around which data were not specifically collected.
The Centers for Disease Control, of course, collect thousands of data elements from a variety of health settings and surveys. The data set attached, â€œCase Study 3 â€“ States.csvâ€ includes 6 variables:
â€¢ Food hardship rate â€“ the reported rate of persons that experience the inability to purchase the food they need at least once in the last 12 months;
â€¢ Obesity rate â€“ the rate of obese persons, which is defined as having a BMI of 30 or greater;
â€¢ Adult cigarette use â€“ the proportion of persons 16 or older smoking more than 100 cigarettes in their lifetime and who continue to smoke;
â€¢ Child cigarette use â€“ the proportion of persons under 16 smoking more than 100 cigarettes in their lifetime and who continue to smoke;
â€¢ Tax â€“ the number of cents of state tax levied on a pack of 20 cigarettes; and
â€¢ Location â€“ the geographical location in the US to which each state and US is a member.
1.Create a scatterplot of the data for Food Hardship Rate and Obesity Rate. Copy the scatterplot
into this document with axes and chart titles and interpret what the scatterplot tells us about the relationship between the two variables.
2.For the food hardship and obesity variables, determine the strength of any correlation, and
determine whether it is significant at 0.05. Interpret the meaning of the correlation coefficient. A quick p-value calculator for Pearson correlation can be found on Social Science Statistics website (https://www.socscistatistics.com/pvalues/pearsondistribution.aspx). Write a journal entry for the results.
3.For the adult smoking and child smoking variables, determine the strength of any correlation,
and determine whether it is significant at 0.05. Interpret the meaning of the correlation coefficient. A quick p-value calculator for Pearson correlation can be found at https://www.socscistatistics.com/pvalues/pearsondistribution.aspx. Write a journal entry for the results.
4.One method of decreasing the smoking rate is to increase the tax rate on a pack of cigarettes.
Using the mean of the adult and child rate for each state and DC, consider predicting the smoking rate from the tax rate.
a)Calculate the mean of the adult and child smoking rates for each state and DC â€“ insert this in an Excel column.
b)With tax rate as the predictor variable, conduct a simple linear regression analysis on the data for mean adult and child smoking rate. In the analysis, you would typically report: