Status:Closed    Asked:Apr 23, 2014 - 12:26 PM

when dealing with historical data what should we be aware of; did coding change a lot 1980 and after?

We are doing a project on Race, Gender and Employment. We wanted to track race, gender, occupation, wages, benefits over time (1930-2012). I have heard that coding in the census changed dramatically during 1980 and after. Is this true and are there other items to be aware of when dealing with historical data?

Do you have the same question? Follow this Question

Staff Answer




It is true that coding structures have changed drastically over the time period you are interested in. However, the RACE variable is an excellent example of how IPUMS-USA integrates multiple coding structures to create a harmonized variable that is comparable across time. The primary issues that you should be aware of when comparing across large spans of time are the changes in question Universe that may make variables incomparable over time. For instance, the OCC1950 variable underwent multiple age limitation changes over the years. This means, that to produce a true comparison over time, you would have to restrict all of the data to the most restrictive universe. You will also want to study the sample designs of each sample you are using to make sure the samples themselves are comparable. For example, the 2000-2005 ACS samples do not include persons in Group Quarters.

I hope this helps.


Apr 23, 2014 - 01:24 PM

Report it


What weights do I use if we are tabulating the Race and Occupation variables?
For the Race/ethnicity variables, is there a specific weight to use? Specifi...
Getting race and gender percentages by detailed occupation by year
How do download data to find the gender, race, and education composition of o...
Login   |   Register

Recently Active Members

View More »

Share |