Status:Closed    Asked:Sep 12, 2015 - 06:40 PM

Using sample weights for regression analysis of Mexico's data?

I'm running a difference in difference regression using Mexico's 2010 census data. I want to compare earnings of individuals in municipalities that received some program vs. earnings in those that didn't. I am trying to understand how to use the sample weights for the census and what are the relevant strata.

Am I correct in thinking that I need to specify:

svyset serial [pweight=wthh]

before running the regression? (eg svy: reg x y)

Any references on this would be extremely appreciated.

Thank you very much!

Do you have the same question? Follow this Question

Staff Answer




The Mexico 2010 sample requires weighting and is clustered by both municipality (strata) and household. As a result, researchers should adjust for the household-level clustering, as well as use weights to account for the complex survey design. Your Stata code appears consistent with this approach. Using the weight variable should account for stratification, although you might consider explicitly identifying municipality as the strata in your svyset command.

For a more complete discussion of sampling error and variance estimation, please refer to this IPUMS-International User's Note.

Hope this helps.


Sep 14, 2015 - 03:41 PM

Report it


I am looking for the Kessler 6 variable in IPUMS-MEPS
Reason for missing MIGSTA1 values in ASEC files for 1985 (and 1995)
India Area-level data (geo2) inconsistent for total population, urban share, ...
Does anyone have a method for handling missing data in CPS "EDUC" variable? 2...
How do I find the data file:Ipumsi_00001.dat for the 1974
Variables from Full Year Consolidated Files currently not included in IPUMS-M...
Login   |   Register

Recently Active Members

View More »

Share |