1. the demographic variables dataset
3. the body measures dataset
Since 1971, the National Center for Health Statistics had been assessing the health and nutritional status of both children and adults in the United States, through periodic National Health and Nutritional Examination Survey (NHANES) surveys. These surveys are an invaluable resource to epidemiological and public health research; the surveys can be used to determine the prevalence of major diseases and risk factors, to assess nutrition and health promotion, and to guide public health policy.
All initial and peer postings should be at least 250-500 words in APA format supported by scholarly sources.
In 2012, the NHANES National Youth Fitness Survey (NNYFS) was conducted in conjunction with NHANES to obtain physical activity and fitness levels of U.S. youths aged 3 through 15. Initial data from the NNYFS were released in 2013 and serve as the basis for this discussion problem.
Begin by downloading the Excel file MHA610_Week 6_Discussion_NNYFS_workingdata.xls. This workbook was created by merging two datasets from the NNYFS: the demographic variables dataset, and the body measures dataset. For the purposes of this discussion, many variables were eliminated from the original datasets, as well as observations with missing data on height and weight. The Excel workbook thus consists of one worksheet, with 1576 rows (the first row contains headers, and the next 1575 rows are observed values for the participants), and 11 columns of variables. The columns in the Excel file are the following:
the respondent sequence number (index for all the files)
gender of the participant, 1 = male, 2 = female
1 = Mexican American 2 = other Hispanic 3 = non-Hispanic white 4 = non-Hispanic black 5 = other
age in years at time of physical exam
annual household income, categorized
annual family income, categorized
ratio of family income to poverty, 0 to 5
weight, in kg
height, in cm
body mass index (kg/m^2)
1 = underweight 2 = normal weight 3 = overweight 4 = obese . = missing
For purposes of this discussion, you are asked to answer the three following questions:
• Does BMI vary significantly between boys and girls?
• Does BMI vary significantly among the racial/ethnic groups?
• Is there any trend to BMI with age?
There are several ways to address these questions. For example, you might take BMXBMI as your outcome variable of interest: it is continuous, so you could then perform a two-sample t test for (1), a one way analysis of variance for (2), and a simple regression analysis (with age as the predictor variable) for (3).
Alternatively, you might reduce the problem to consideration of binomial probabilities: for example, you could classify everyone as obese or not obese (or maybe, overweight/obese vs underweight/normal), then compare binomial outcomes for (1) and (2) (z tests with the normal approximation or contingency tables), and conduct a t test on ages for (3).
Neither approach is wrong—the key is interpreting your findings!
If you prefer to do the analyses in Statdisk, there is a file, MHA610_Week 6_Discussion_NNYFS_workingdata.csv, ready to be read into Statdisk. (It’s the original Excel workbook, saved as csv.) No need to go through any additional steps, unless you wish to restructure the data in Excel.
Incidentally, the income variables are not needed for these questions, but as a bonus, you might want to investigate whether obesity is related to socioeconomic status (as reflected by family income).