Status:Closed    Asked:Apr 14, 2014 - 07:47 PM

How do I balance my panel?

I'm interested in studying the personal income of veterans and non veterans from 2006 to 2012. Additional variables include the period of the veteran's service, whether the person is self or wage-employed, and demographic variables such as race and sex.

Everytime I submit for my extract, however, I receive a panel filled with mostly veterans or vice-versa. How can I get a data set that has an even balance (50%) of my two key binary variables; class of worker and veteran status? I've tried tweaking the sample sizes to no avail.

Do you have the same question? Follow this Question

Staff Answer




IPUMS-USA does not currently support the kind of case selection you are describing. The ACS data you are downloading represents a sample of the entire U.S. population from each year. Within each sample, every person is given a weight (PERWT) calculated by the Census Bureau to account for sampling error and the like, meant to multiply that person by the amount of people they represent within the total U.S. population for the year they were sampled. Because creating a 50% veteran, 50% non-veteran subsample could introduce new sampling error, the weights may no longer be correct. You could use the IPUMS-USA extract of all persons from 2006-2012, and generate your own random sample of veterans and non-veterans, resulting in your desired sample size.

I hope this helps.


Apr 16, 2014 - 08:35 AM

Report it


I am looking for the Kessler 6 variable in IPUMS-MEPS
Reason for missing MIGSTA1 values in ASEC files for 1985 (and 1995)
India Area-level data (geo2) inconsistent for total population, urban share, ...
Does anyone have a method for handling missing data in CPS "EDUC" variable? 2...
How do I find the data file:Ipumsi_00001.dat for the 1974
Variables from Full Year Consolidated Files currently not included in IPUMS-M...
Login   |   Register

Recently Active Members

View More »

Share |