Status:Closed    Asked:Sep 12, 2015 - 06:40 PM

Using sample weights for regression analysis of Mexico's data?

I'm running a difference in difference regression using Mexico's 2010 census data. I want to compare earnings of individuals in municipalities that received some program vs. earnings in those that didn't. I am trying to understand how to use the sample weights for the census and what are the relevant strata.

Am I correct in thinking that I need to specify:

svyset serial [pweight=wthh]

before running the regression? (eg svy: reg x y)

Any references on this would be extremely appreciated.

Thank you very much!

Do you have the same question? Follow this Question

Staff Answer




The Mexico 2010 sample requires weighting and is clustered by both municipality (strata) and household. As a result, researchers should adjust for the household-level clustering, as well as use weights to account for the complex survey design. Your Stata code appears consistent with this approach. Using the weight variable should account for stratification, although you might consider explicitly identifying municipality as the strata in your svyset command.

For a more complete discussion of sampling error and variance estimation, please refer to this IPUMS-International User's Note.

Hope this helps.


Sep 14, 2015 - 03:41 PM

Report it


Calculating regression standard errors
Login   |   Register

Recently Active Members

View More »

Share |