Please click here if you are unable to view this page.
TOPIC:
OPTIMAL ESTIMATION OF HETEROGENEOUS PARAMETERS UNDER UNKNOWN HETEROSKEDASTICITY
ABSTRACT
This paper studies the large-scale estimation of heterogeneous parameters under limited information where the heterogeneity is on a unit-level, e.g., teachers or neighborhoods. We employ a normal sampling model with unknown heteroskedasticity and provide generalized Tweedie’s formula for the posterior means of the heterogeneous parameters. We then use these to characterize the compound optimal estimators (the oracles) of the unit-specific mean and quantile parameters in terms of the density of certain sufficient statistics. Feasible versions are proposed for which we provide asymptotic compound optimal guarantees, where their compound risk is shown to be asymptotically equivalent to that of the infeasible oracles. Numerical experiments show that the proposed estimators are generally within 1–3% of the oracles for an extensive range of data generating processes, including ones calibrated to our empirical application. The estimators are employed in an empirical study of teachers’ effects on students’ test outcomes where we find that the teacher rankings can be highly sensitive to how one defines teacher quality, whether in terms of the mean or lower percentile students.