STA429/1007 Assignment 5

Quiz on Friday Feb. 16th at 12:10 p.m.


The Poverty Data

For 97 countries in the world, the file poverty.dat contains data for birth rates, death rates, infant death rates, life expectancies for males and females, and Gross National Product. The variables are:

In the next assignment you are going to do multiple regression on these data, but we have some lecture to finish first. So for your next assignment, define printing formats for country group, and read the data. You should provide labels for the variables, but please comment them out after you are sure they work, because your correlation matrix will look better without labels.

Create a new variable that is the difference in life expectancy between Males and Females. If you do Female minus Male they will be mostly positive numbers, which for some reason are easier to think about.

Do a correlation matrix of all the quantitative variables, includng the new one. Be ready to interpret the correlation coefficients and associated significance tests.

Do a one-way ANOVA with country group the independent variable and difference in life expectancy the dependent variable. Get the group means as well as an overall significance test. We are not doing follow-up tests yet, but what trends do you seem to see in the data?

This is supposed to be quick and easy. Please bring your log and list file to the quiz.