Question

Status:Closed    Asked:Mar 20, 2017 - 10:45 PM

Getting race and gender percentages by detailed occupation by year

I'm trying to recreate this 2016 table, except with more detailed breakdowns of race and gender: https://www.bls.gov/cps/cpsaat11.htm


For example, instead of separate categories for Women and Race, I'd like to create percentages for "White women," "Black women," "Asian women," and "Latina women," too.


The trouble I'm having is with weighing the data. As far as I know, in order to get the percentage of a group in weighted survey data, I need to sum the WTSUPP of the group I'm looking at and divide by the sum WTSUPP of the entire sample.


So, to make sure I was on the right track, I decided to compare the female percent of IPUMS' data set with BLS for "Healthcare support occupations" (occupation code 3600-3655). I summed up the WTSUPP of just the female rows and divided by the sum WTSUPP of all the rows (female and male). But my result appear to be off from the BLS by about 0.3% (BLS says 87.7% women, my result was 88%). Is there any reason why this would be happening? This is on year 2016 data, so I'm not sure if it's affected by updates to weights. I'd also love to know if I'm doing this all wrong!

 
Do you have the same question? Follow this Question
 

Staff Answer

avatar

Jeff Bloem

Staff

It sounds like you are performing the weighting of the data correctly. It doesn't look like your estimates are that far off of the official BLS estimates. In general, we don't expect to exactly replicate official statistics. So it looks like your estimates are well within the margin of error of the BLS estimates, so I wouldn't worry too much about your methodology.

 

Mar 21, 2017 - 10:28 AM

1
0
Report it

OTHER QUESTIONS NEEDING ANSWERS

Why is the income wage variable incwage-capped? and how is the cap determined...
USA IPUMS 1980 ancestry variable -- mapping to country names
Did the version of database change since 2011?
Are DATANUM, SERIAL, and PERNUM the same in single-year and 5-year ACS sample...
How can I accurately calculate the median family income by state for families...
Detailed Variables Dropping From Dataset
Login   |   Register


Share |