- 1 North Carolina data coverage: grades 4-5 from 1996-97 through 2010-11 and grade 3 from 1996-97 through 2009-10. The difference in sample sizes comparing columns (1) and (2) arises because we drop 1.37 million student-year observations that cannot be matched to their classroom teacher (see Appendix C for more detail).
Paper not yet in RePEc: Add citation now
- 2 LAUSD data coverage: grades 4-5 from 2003-04 through 2012-13 and 2015-16 through 2016-17 school years and third grade from 2003-04 through 2012-13.
Paper not yet in RePEc: Add citation now
Abadie, Alberto and Maximilian Kasy (2019), âChoosing among regularized estimators in empirical economics: The risk of machine learning.â Review of Economics and Statistics, 101, 743â762.
- Angrist, Joshua D., Peter D. Hull, Parag A. Pathak, and Christopher R. Walters (2017), âLeveraging lotteries for school value-added: Testing and estimation.â Quarterly Journal of Economics, 132, 871â919.
Paper not yet in RePEc: Add citation now
- Bacher-Hicks, Andrew, Thomas J. Kane, and Douglas O. Staiger (2014), âValidating teacher effect estimates using changes in teacher assignments in Los Angeles.â Working Paper 20657, National Bureau of Economic Research, URL http://www.nber.org/papers/w20657.
Paper not yet in RePEc: Add citation now
Bitler, Marianne, Sean Corcoran, Thurston Domina, and Emily Penner (2019), âTeacher effects on student achievement and height: A cautionary tale.â Working Paper 26480, National Bureau of Economic Research, URL http://www.nber.org/papers/w26480.
- Bonhomme, SteÌphane and Martin Weidner (2019), âPosterior average effects.â Working Paper CWP43/19, Centre for Microdata Methods and Practice, URL https://www.cemmap.ac.uk/ publication/id/14366.
Paper not yet in RePEc: Add citation now
- Brown, Lawrence D. (2008), âIn-season prediction of batting averages: A field test of empirical Bayes and Bayes methodologies.â Annals of Applied Statistics, 2, 113â152.
Paper not yet in RePEc: Add citation now
- Brown, Lawrence D. and Eitan Greenshtein (2009), âNonparametric empirical Bayes and compound decision approaches to estimation of a high-dimensional vector of normal means.â Annals of Statistics, 37, 1685â1704.
Paper not yet in RePEc: Add citation now
- Bruhn, Jesse (2020), âThe consequences of sorting for understanding school quality.â URL http: //www.jessebruhn.com/research. Unpublished.
Paper not yet in RePEc: Add citation now
- C Construction of the Teacher Value-Added Sample This appendix describes the construction of the final sample of students and teachers used for teacher VA estimation in both of our administrative datasets. Sample selection follows prior work (for instance, Chetty et al. (2014a,b)), the main requirements for inclusion in the sample being that the student has a valid score in a given subject both in the current and prior period, and can be matched to a teacher in that subject.
Paper not yet in RePEc: Add citation now
Chandra, Amitabh, Amy Finkelstein, Adam Sacarny, and Chad Syverson (2016), âHealth care exceptionalism? Performance and allocation in the US health care sector.â American Economic Review, 106, 2110â44.
- Chetty, Raj and Nathaniel Hendren (2018), âThe impacts of neighborhoods on intergenerational mobility II: County-level estimates.â Quarterly Journal of Economics, 133, 1163â1228.
Paper not yet in RePEc: Add citation now
- Chetty, Raj, John N. Friedman, and Jonah E. Rockoff (2014a), âMeasuring the impacts of teachers I: Evaluating bias in teacher value-added estimates.â American Economic Review, 104, 2593â 2632.
Paper not yet in RePEc: Add citation now
- Chetty, Raj, John N. Friedman, and Jonah E. Rockoff (2014b), âMeasuring the impacts of teachers II: Teacher value-added and student outcomes in adulthood.â American Economic Review, 104, 2633â79.
Paper not yet in RePEc: Add citation now
- Chetty, Raj, John N. Friedman, and Jonah E. Rockoff (2017), âMeasuring the impacts of teachers: Reply.â American Economic Review, 107, 1685â1717.
Paper not yet in RePEc: Add citation now
- Clotfelter, Charles T., Helen F. Ladd, and Jacob L. Vigdor (2006), âTeacher-student matching and the assessment of teacher effectiveness.â Journal of Human Resources, 41, 778â820.
Paper not yet in RePEc: Add citation now
- D Linking Long-run Outcomes to Teacher VA In Section 7.2, we referenced a method to link long-run outcomes with teacher VA proposed by Chetty et al. (2014b). In this appendix, for completeness, we describe the steps involved.
Paper not yet in RePEc: Add citation now
- Data on dropouts and graduations are available for school years 2011-12 through 2016-17. These data indicate whether students in the twelfth grade cohort of that year graduated or dropped out and so we restrict our data to cohorts in twelfth grade during this time period.61 We have a dropout or graduation record for 129,456 students â fifty percent of students from eligible cohorts.
Paper not yet in RePEc: Add citation now
Dicker, Lee H. and Sihai D. Zhao (2016), âHigh-dimensional classification via nonparametric empirical Bayes and maximum likelihood inference.â Biometrika, 103, 21â34.
- Efron, Bradley (2003), âRobbins, empirical Bayes and microarrays.â Annals of Statistics, 31, 366â 378.
Paper not yet in RePEc: Add citation now
- Efron, Bradley (2010), Large-scale Inference: Empirical Bayes methods for estimation, testing, and prediction. Cambridge University Press, Cambridge, UK.
Paper not yet in RePEc: Add citation now
- Efron, Bradley (2011), âTweedieâs formula and selection bias.â Journal of American Statistical Association, 106, 1602â1614.
Paper not yet in RePEc: Add citation now
- Evdokimov, Kirill and Halbert White (2012), âSome extensions of a lemma of Kotlarski.â Econometric Theory, 28, 925â932.
Paper not yet in RePEc: Add citation now
- Fan, Jianqing (1991), âOn the optimal rates of convergence for nonparametric deconvolution problems.
Paper not yet in RePEc: Add citation now
- Fletcher, Jason M., Leora I. Horwitz, and Elizabeth Bradley (2014), âEstimating the value added of attending physicians on patient outcomes.â Working Paper 20534, National Bureau of Economic Research, URL http://www.nber.org/papers/w20534.
Paper not yet in RePEc: Add citation now
- Goldhaber, Dan and Richard Startz (2017), âOn the distribution of worker productivity: The case of teacher effectiveness and student achievement.â Statistics and Public Policy, 4, 1â12.
Paper not yet in RePEc: Add citation now
- Goncalves, Felipe and Steven Mello (2018), âA few bad apples? Racial bias in policing.â URL https://static1.squarespace.com/static/58d9a8d71e5b6c72dc2a90f1/t/ 5cfe39c1db1f980001595d4d/1560164805693/GoncalvesMello.pdf. Unpublished.
Paper not yet in RePEc: Add citation now
- Gourieroux, Christian and Joann Jasiak (2020), âDynamic deconvolution of (sub)independent autoregressive sources.â URL http://www.jjstats.com/papers/dynamdec.pdf. Unpublished.
Paper not yet in RePEc: Add citation now
- Gu, Jiaying and Roger Koenker (2017a), âEmpirical Bayesball remixed: Empirical Bayes methods for longitudinal data.â Journal of Applied Econometrics, 32, 575â599.
Paper not yet in RePEc: Add citation now
- Gu, Jiaying and Roger Koenker (2017b), âUnobserved heterogeneity in income dynamics: An empirical Bayes perspective.â Journal of Business & Economic Statistics, 35, 1â16.
Paper not yet in RePEc: Add citation now
- Gu, Jiaying and Shu Shen (2017), âOracle and adaptive false discovery rate controlling methods for one-sided testing: Theory and application in treatment effect evaluation.â Econometrics Journal, 21, 11â35.
Paper not yet in RePEc: Add citation now
- Gu, Jiaying, Roger Koenker, and Stanislav Volgushev (2018), âTesting for homogeneity in mixture models.â Econometric Theory, 34, 850 â 895.
Paper not yet in RePEc: Add citation now
Guarino, Cassandra M., Michelle Maxfield, Mark D. Reckase, Paul N. Thompson, and Jeffrey M. Wooldridge (2015), âAn evaluation of empirical Bayesâs estimation of value-added teacher performance measures.â Journal of Educational and Behavioral Statistics, 40, 190â222.
- Hanushek, Eric A. (2009), âTeacher deselection.â In Creating a New Teaching Profession (Dan Goldhaber and Jane Hannaway, eds.), 165â180, Urban Institute Press, Washington, DC.
Paper not yet in RePEc: Add citation now
- Hanushek, Eric A. (2011), âThe economic value of higher teacher quality.â Economics of Education Review, 30, 466â479.
Paper not yet in RePEc: Add citation now
Heckman, James and Burton Singer (1984), âA method for minimizing the impact of distributional assumptions in econometric models for duration data.â Econometrica, 52, 271â320.
- Hull, Peter (2020), âEstimating hospital quality with quasi-experimental data.â URL https: //www.google.com/url?q=https%3A%2F%2Fwww.dropbox.com%2Fs%2Fhb54rrz3vte8gij% 2FRAM_012020.pdf%3Fraw%3D1&sa=D&sntz=1&usg=AFQjCNE-ap7RlsV8PsFpJ64ekwD2HmqIjQ. Unpublished.
Paper not yet in RePEc: Add citation now
- i yijt. The estimator commonly used in the literature is a method-ofmoments estimator proposed by Kane and Staiger (2008) under the additional assumption that αj â¼ N(0, Ï2 α). Specifically, they propose the following estimators for the variance parameters: ÏÌ2 α = c cov(yjt, yjtâ1) ÏÌ2 = b V (yijt) â ÏÌ2 α.
Paper not yet in RePEc: Add citation now
- In terms of demographics, we have information about parental education (six education groups, 1996-97 through 2005-06 only), economically disadvantaged status (1998-99 through 2010-11 only), ethnicity (six ethnic groups), gender, limited English status, disability status, academically gifted status and grade repetition. Besides missing data in some years for parental education and economically disadvantaged status, our demographic data cover over 99 percent of all student-year observations. When demographic information is missing, we create a missing indicator for that variable.
Paper not yet in RePEc: Add citation now
- Jackson, C. Kirabo (2018), âWhat do test scores miss? The importance of teacher effects on non-test score outcomes.â Journal of Political Economy, 126, 2072â2107.
Paper not yet in RePEc: Add citation now
Jacob, Brian A. and Lars Lefgren (2008), âCan principals identify effective teachers? Evidence on subjective performance evaluation in education.â Journal of Labor Economics, 26, 101â136.
- James, W. and Charles Stein (1961), âEstimation with quadratic loss.â In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics, 361â379, University of California Press, Berkeley, Calif.
Paper not yet in RePEc: Add citation now
- Jiang, Wenhua (2020), âOn general maximum likelihood empirical Bayes estimation of heteroscedastic IID normal means.â Electronic Journal of Statistics, 14, 2272â2297.
Paper not yet in RePEc: Add citation now
- Jiang, Wenhua and Cun-Hui Zhang (2009), âGeneral maximum likelihood empirical Bayes estimation of normal means.â Annals of Statistics, 37, 1647â1684.
Paper not yet in RePEc: Add citation now
Kane, Thomas J. and Douglas O. Staiger (2008), âEstimating teacher impacts on student achievement: An experimental evaluation.â Working Paper 14607, National Bureau of Economic Research, URL http://www.nber.org/papers/w14607.
Kane, Thomas J., Jonah E. Rockoff, and Douglas O. Staiger (2008), âWhat does certification tell us about teacher effectiveness? Evidence from New York City.â Economics of Education Review, 27, 615â631.
- Kiefer, Jack and Jacob Wolfowitz (1956), âConsistency of the maximum likelihood estimator in the presence of infinitely many incidental parameters.â Annals of Mathematical Statistics, 27, 887â906.
Paper not yet in RePEc: Add citation now
Koedel, Cory, Kata Mihaly, and Jonah E. Rockoff (2015), âValue-added modeling: A review.â Economics of Education Review, 47, 180â195.
Koenker, Roger and Ivan Mizera (2014), âConvex optimization, shape constraints, compound decisions, and empirical Bayes rules.â Journal of the American Statistical Association, 109, 674â685.
- Koenker, Roger and Jiaying Gu (2017), âRebayes: An R package for empirical Bayes mixture methods.â Journal of Statistical Software, 82, 1â26.
Paper not yet in RePEc: Add citation now
- Kotlarski, Ignacy (1967), âOn characterizing the gamma and the normal distribution.â Pacific Journal of Mathematics, 20, 69â76.
Paper not yet in RePEc: Add citation now
- Laird, Nan (1978), âNonparametric maximum likelihood estimation of a mixing distribution.â Journal of the American Statistical Association, 73, 805â811.
Paper not yet in RePEc: Add citation now
- Laird, Nan M. and Thomas A. Louis (1987), âEmpirical Bayes confidence intervals based on bootstrap samples.â Journal of American Statistical Association, 82, 805â811.
Paper not yet in RePEc: Add citation now
- LAUSD: California High School Exit Examination (CAHSEE) data cover 2003-04 through 201415. As the CAHSEE is normally first taken in tenth grade, we keep cohorts who were in tenth grade during this time period.60 If a student took the CAHSEE multiple times, we take the score from the studentâs first sitting of the CAHSEE. We report the sum of CAHSEE scores from the mathematics and English sections and so CAHSEE scores range from 550 to 900. We have CAHSEE records for 184,128 students, covering sixty-seven percent of students from eligible cohorts.
Paper not yet in RePEc: Add citation now
- Li, Tong and Quang Vuong (1998), âNonparametric estimation of the measurement error model using multiple indicators.â Journal of Multivariate Analysis, 65, 139â165.
Paper not yet in RePEc: Add citation now
- Lindsay, Bruce G. (1995), Mixture Models: Theory, Geometry, and Applications. Conference Board of the Mathematical Sciences: NSF-CBMS regional conference series in probability and statistics, Institute of Mathematical Statistics.
Paper not yet in RePEc: Add citation now
- McLachlan, G.J. (1987), âOn bootstrapping likelihood ratio test statistics for the number of components in a normal mixture.â Journal of the Royal Statistical Society, Series C, 36, 318â324.
Paper not yet in RePEc: Add citation now
- North Carolina: Dropout data are available for school years 2003-04 through 2016-17. Given the majority of students drop out in tenth through twelfth grade, we ensure that our dropout data coverage starts in at least tenth grade for a given cohort65 and covers up to twelfth grade.66 Any student that has either not dropped out or moved out-of-state is coded as not being a high school dropout. We have dropout outcomes for 1,097,381 students.
Paper not yet in RePEc: Add citation now
- Other tests for normality are also possible. For instance, if α indeed follows a normal distribution N(0, Ï2 α), then the logarithm of its characteristic function takes the form log Ïα(t) = ât2 /Ï2 α , which implies that the first-order derivative with respect to t is of the form ât/Ï2 α, which is a linear function of t. Since the distribution of α is identified (as established in Theorem 1), we can construct a consistent estimator for Ïα(t) and inspect linearity of the derivative of its logarithm transformation. Another specification test has been proposed in Bonhomme and Weidner (2019). We leave to future research a power comparison involving these and other specification tests. H Computation Appendix In this brief appendix, we discuss the estimation of F using NPMLE.
Paper not yet in RePEc: Add citation now
- Petek, Nathan and Nolan Pope (2018), âThe multidimensional impact of teachers on students.â URL http://www.econweb.umd.edu/~pope/Nolan_Pope_JMP.pdf. Unpublished.
Paper not yet in RePEc: Add citation now
- Polyanskiy, Yury and Yihong Wu (2020), âSelf-regularizing property of nonparametric maximum likelihood estimator in mixture models.â arXiv preprint arXiv:2008.08244.
Paper not yet in RePEc: Add citation now
- Proof. Using equation (2.64) in Rao (1992), we have log Ïα(t) = iE[α]t + Z t 0 â âu log Ï(u, v) Ï(u, 0)Ï(0, v) u=0 dv. where i is the imaginary root. Using the fact that â âu log Ï(u, v) Ï(u, 0)Ï(0, v) u=0 = âÏ(0, v)/âu Ï(0, v) â âÏ(0, 0)/âu Ï(0, 0) and that âÏ(0,0)/âu Ï(0,0) = iE(Y1), we have log Ïα(t) = iE[α]t + Z t 0 âÏ(0, v)/âu Ï(0, v) dv â iE(Y1)t = Z t 0 âÏ(0, v)/âu Ï(0, v) dv, where the second equality holds because 1 has mean zero under Assumption 1. Additionally, under Assumptions 1- 3, we have Ï(u, v) = Ïα(u + v)Ï(u)Ï(v). Let u = 0, then Ï(v) = Ï(0, v)/Ïα(v); and letting v = 0, then Ï(v) = Ï(u, 0)/Ïα(u).
Paper not yet in RePEc: Add citation now
PSAT data cover school years 2008-09 through 2016-17. As the PSAT is normally taken in tenth grade, we keep cohorts who were in tenth grade during this time period.62 We convert PSAT scores from the 2015-16 and 2016-17 administrations using the concordance tables provided by the College Board so that all PSAT scores are reported on a 600-2400 scale.63 If a student is recorded as receiving multiple administrations of the PSAT, we take the score from the first PSAT test taken by the student. We have PSAT records for 209,675 students, covering fifty-one percent of students from eligible cohorts.
- PSAT data cover school years 2012-13 through 2016-17. As the PSAT is normally taken in tenth grade, we keep cohorts in tenth grade from 2012-13 through 2016-17.67 We convert PSAT scores from the 2015-16 and 2016-17 administrations using the concordance tables provided by the College Board, so that all PSAT scores are reported on a 600-2400 scale. If a student is recorded as receiving multiple administrations of the PSAT, we take the score from the first time the student took the PSAT test. We have PSAT records for 159,028 students, covering forty-four percent of students from eligible cohorts.
Paper not yet in RePEc: Add citation now
- Rao, B.L.S.P. (1992), Identifiability in Stochastic Models: Charaterization of Probability Distributions. Academic Press, United Kindom.
Paper not yet in RePEc: Add citation now
- Rivkin, Steven G., Eric A. Hanushek, and John F. Kain (2005), âTeachers, schools, and academic achievement.â Econometrica, 73, 417â458.
Paper not yet in RePEc: Add citation now
- Robbins, Herbert (1950), âA generalization of the method of maximum likelihood: Estimating a mixing distribution.â Annals of Mathematical Statistics, 21, 314â315.
Paper not yet in RePEc: Add citation now
- Robbins, Herbert (1956), âAn empirical Bayes approach to statistics.â In Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, volume I, University of California Press, Berkeley.
Paper not yet in RePEc: Add citation now
Rockoff, Jonah E. (2004), âThe impact of individual teachers on student achievement: Evidence from panel data.â American Economic Review, 94, 247â252.
- Rothstein, Jesse (2017), âMeasuring the impacts of teachers: Comment.â American Economic Review, 107, 1656â84.
Paper not yet in RePEc: Add citation now
- Saha, Sujayam and Adityanand Guntuboyina (2020), âOn the nonparametric maximum likelihood estimator for gaussian location mixture densities with application to gaussian denoising.â Annals of Statistics, 48, 738â762.
Paper not yet in RePEc: Add citation now
- SAT data cover school years 2006-07 through 2016-17. As the SAT is usually taken in the eleventh or twelfth grade, we keep cohorts who attended both grades during the time period covered 60 We therefore drop fifth grade after 2009-10, fourth grade after 2008-09, and third grade after 2007-08. 61 We therefore drop fifth grade after 2008-09, fourth grade after 2007-08, and third grade after 2006-07. 62 We therefore drop fifth grade after 2011-12, fourth grade after 2010-11, and third grade after 2009-10. 63 Available at https://collegereadiness.collegeboard.org/pdf/2015-psat-nmsqt-concordance-tables.pdf.
Paper not yet in RePEc: Add citation now
- Suspension data are available from 2000-01 through 2016-17, although data from 2004-05 are missing. Here, we keep all data, since the suspension data cover a majority of high school years for all cohorts. We find the total number of days of out-of-school suspensions for each student occurring in middle or high school (grades 6-12). We top-code the number of days suspended in a school year at ten days. If we do not find the student in the suspension files, we assume that the student has never been suspended. Thus we have suspension outcomes for our full value-added sample of 1,386,555 students.
Paper not yet in RePEc: Add citation now
- Suspension data are available from 2003-04 through 2016-17. We restrict data to 2003-04 through 2009-10 to allow for sufficient time to elapse for students to receive suspensions after a student is taught by a given teacher. We calculate the total number of days of out-of-school suspensions for each student occurring in middle or high school (grades 6-12). If we do not find the student in the suspension files, we assume the student has never been suspended. Of note, the LAUSD embarked on an ambitious policy to eliminate âwilful defianceâ suspensions, causing a large drop (almost seventy-five percent) in suspension rates in the 2010s, creating a low rate of suspension in the LAUSD data. We have suspension outcomes for our full value-added sample of 426,074 students from 2003-04 through 2009-10.
Paper not yet in RePEc: Add citation now
- van de Geer, Sara (1993), âHellinger-consistency of certain nonparametric maximum likelihood estimators.â Annals of Statistics, 14â44.
Paper not yet in RePEc: Add citation now
- via the NPMLE of F performs similarly to the infeasible NPB estimator δNPB j defined in (2.6). It is surprising that the proposed estimator achieves this close approximation to the infeasible estimator, given the well-known fact that the NPMLE of F has a slow convergence rate (Fan, 1991). The key reason is that the nonparametric Bayes rule is a smooth functional of F, which can be estimated at a much better rate than the distribution F itself.
Paper not yet in RePEc: Add citation now
- We follow Chetty et al. (2014a,b) and parametrize the control function for lagged test scores f1g(yi,tâ1) with a cubic polynomial in prior-year scores in mathematics and English and interact these cubics with the studentâs grade level. When prior test scores in the other subject are missing, we set the other subject prior score to zero and include an indicator for missing data in the other subject interacted with the controls for prior own-subject test scores. We parametrize the control function for teacher experience f2(ej(i,g,t)) using dummies for years of experience from 0 to 5, the omitted group being teachers with 6 or more years of experience. The student-level control vector Xigt consists of the respective demographic variables in each dataset.
Paper not yet in RePEc: Add citation now
- We report the size and power performance of the proposed parametric bootstrap test in Table E.1 below, based on the following data generating process: Fix the sample size at n = 1000, and for a grid values of h â {0, 0.4, 0.6, 0.8, 1}, sample individual αjâs from the following three-component normal distribution: 0.025N(âh, θh) + 0.95N(0, θh) + 0.025N(h, θh) with θh = 0.1â0.05h2. The design of θh is such that the variance of α is always 0.1; this is roughly the variance of the teacher effects in the LAUSD data. When h = 0, the latent effect αj follows a normal distribution, and the bootstrap test should reject with probability equal to nominal size.
Paper not yet in RePEc: Add citation now
- We then make several sample restrictions. First, we drop 100,000 student-year observations that cannot be matched to a teacher. Second, we drop 180,000 observations where we lack data on teacher experience; the data we drop here are over-represented in early years since we only have teacher experience data from 2007-08 onwards.58 Third, we only include classes with more than seven but fewer than forty students with valid current and lagged test scores in that subject, losing 11,000 observations. Fourth, we exclude 70,000 observations that lack a valid current or lagged test score in that subject.59 Our final sample is roughly 1.3 million student-year observations, covering roughly 660,000 million students and 11,000 teachers. Constructing Value-Added: With both samples in hand, we construct VA estimates for each teacher by running the following regression: yigt = f1g(yi,tâ1) + f2(ej(i,g,t)) + Ï1Xigt + Ï2XÌc(i,g,t) + vj + igt .
Paper not yet in RePEc: Add citation now