2.3 Case C→C

Réussis tes devoirs et examens dès maintenant avec Quizwiz!

In our example: The Total row gives the summary of the categorical variable body image:

(These are the same counts we got earlier in the course when we looked at the single categorical variable body image, and did not consider gender.)

-

Another way to visualize the conditional percents, instead of a table, is the double bar chart. This display is quite common in newspapers.

Case C→C: Two Categorical Variables

Earlier in the course, (when we discussed the distribution of a single categorical variable) we examined the data obtained when a random sample of 1,200 U.S. college students were asked about their body image (underweight, overweight, or about right). We are now returning to this example, to address the following question: If we had separated our sample of 1,200 U.S. college students by gender and looked at males and females separately, would we have found a similar distribution across body-image categories? More specifically, are men and women just as likely to think their weight is about right? Among those students who do not think their weight is about right, is there a difference between the genders in feelings about body image? Answering these questions requires us to examine the relationship between two categorical variables, gender and body image. Because the question of interest is whether there is a gender effect on body image, - the explanatory variable is gender, and - the response variable is body image.

Comments

In our example, we chose to organize the data with the explanatory variable gender in rows and the response variable body image in columns, and thus our conditional percents were row percents, calculated within each row separately. Similarly, if the explanatory variable happens to sit in columns and the response variable in rows, our conditional percents will be column percents, calculated within each column separately. For an example, see the "Did I Get This?" exercises below.

Introduction

Recall the role-type classification table for framing our discussion about the relationship between two variables: (picture) We are done with case C→Q, and will now move on to case C→C, where we examine the relationship between two categorical variables.

Did I Get This?

Suppose a study were done to answer the question: "Is the smoking of students related to their parents' smoking habits?" in which data were collected from 5,375 students and organized in the following two-way table:

Once again the raw data is a long list of 1,200 genders and responses, and thus not very useful in that form. To start our exploration of how body image is related to gender, we need an informative display that summarizes the data. In order to summarize the relationship between two categorical variables, we create a display called a two-way table. Here is the two-way table for our example:

The table has the possible genders in the rows, and the possible responses regarding body image in the columns. At each intersection between row and column, we put the counts for how many times that combination of gender and body image occurred in the data. We sum across the rows to fill in the Total column, and we sum down the columns to fill in the Total row. So for example,

In our example:

We look at each gender separately and convert the counts to percents within that gender. Let's start with females: (Note that each count is converted to percents by dividing by the total number of females, 760. These numerical summaries are called conditional percents, since we find them by "conditioning" on one of the genders.)

Note that

it doesn't make sense to compare raw counts, because there are more females than males overall. So for example, it is not very informative to say "there are 560 females who responded 'about right' compared to only 295 males," since the 560 females are out of a total of 760, and the 295 males are out of a total of only 440). We need to supplement our display, the two-way table, with some numerical summaries that will allow us to compare the distributions. These numerical summaries are found by simply converting the counts to percents within (or restricted to) each value of the explanatory variable separately.

Remember, though,

that our primary goal is to explore how body image is related to gender. Exploring the relationship between two categorical variables (in this case body image and gender) amounts to comparing the distributions of the response variable (in this case body image) across the different values of the explanatory variable (in this case males and females):


Ensembles d'études connexes

Resp 3: Implement Health Education

View Set

Human Resources MGMT Chapter 8: Compensation and Benefits

View Set

UCO HLTH 1112 Exam 2 Study guide

View Set