Status:Closed    Asked:Apr 23, 2014 - 12:26 PM

when dealing with historical data what should we be aware of; did coding change a lot 1980 and after?

We are doing a project on Race, Gender and Employment. We wanted to track race, gender, occupation, wages, benefits over time (1930-2012). I have heard that coding in the census changed dramatically during 1980 and after. Is this true and are there other items to be aware of when dealing with historical data?

Do you have the same question? Follow this Question

Staff Answer




It is true that coding structures have changed drastically over the time period you are interested in. However, the RACE variable is an excellent example of how IPUMS-USA integrates multiple coding structures to create a harmonized variable that is comparable across time. The primary issues that you should be aware of when comparing across large spans of time are the changes in question Universe that may make variables incomparable over time. For instance, the OCC1950 variable underwent multiple age limitation changes over the years. This means, that to produce a true comparison over time, you would have to restrict all of the data to the most restrictive universe. You will also want to study the sample designs of each sample you are using to make sure the samples themselves are comparable. For example, the 2000-2005 ACS samples do not include persons in Group Quarters.

I hope this helps.


Apr 23, 2014 - 01:24 PM

Report it


I am looking for the Kessler 6 variable in IPUMS-MEPS
Reason for missing MIGSTA1 values in ASEC files for 1985 (and 1995)
India Area-level data (geo2) inconsistent for total population, urban share, ...
Does anyone have a method for handling missing data in CPS "EDUC" variable? 2...
How do I find the data file:Ipumsi_00001.dat for the 1974
Variables from Full Year Consolidated Files currently not included in IPUMS-M...
Login   |   Register

Recently Active Members

View More »

Share |