Regresión discontinua para análisis de causalidad con diseños de muestreo complejos
| dc.contributor.advisor | Trujillo Oyola, Leonardo | spa |
| dc.contributor.author | Arteaga Gutiérrez, Natalia Ximena | spa |
| dc.date.accessioned | 2020-03-06T13:52:44Z | spa |
| dc.date.available | 2020-03-06T13:52:44Z | spa |
| dc.date.issued | 2019-07-19 | spa |
| dc.description.abstract | En este trabajo titulado “Regresión discontinua para análisis de causalidad con diseños de muestreo complejos”, se hace la propuesta de un nuevo estimador teniendo en cuenta los pesos de muestreo para regresión discontinua, el cual es un diseño cuasi-experimental usado en evaluación de impacto para la estimación del ATE Average Treatment Effect, que permite contestar a la pregunta de ¿cuál habría sido el resultado de los participantes en ausencia del programa/intervención/política?. Para la metodología de estimación se asume que se tienen dos fuentes de variabilidad: i) la del modelo poblacional y ii) la distribución muestral inducida por la selección aleatoria de la muestra, como lo exponen las metodologías disponibles en la literatura. La regresión discontinua requiere que en el proceso de selección de los beneficiarios o focalización de la intervención, programa o política se haya utilizado una variable continua Z, la cual permite organizar a la población de acuerdo con los resultados del indicador. Un ejemplo de esta variable en el caso colombiano es el puntaje Sisbén III, el cual organiza a la población de la menos vulnerable (100 puntos) a la más vulnerable (0 puntos). Al definir un punto de corte para este indicador se define la focalización de los programas sociales del gobierno. De acuerdo con el comportamiento del grupo tratamiento y control con respecto a la variable Z, se determinar´a el tipo de regresión discontinua: i) regresión discontinua nítida, donde la variable Z determina completamente la participación en la intervención y ii) la regresión discontinua difusa, donde la variable Z determina la probabilidad de participar en la intervención. Esta metodología cuenta con dos enfoques para su estimación: paramétrico y no paramétrico, que son estudiados para la inclusión de los pesos de muestreo en la regresión discontinua. En este trabajo se presentan las expresiones matemáticas de los estimadores con pesos muestrales, la implementación y desarrollo del paquete en R llamado rddsw con las funciones para la aplicación de los estimadores encontrados, las simulaciones donde se muestra el comportamiento de los estimadores con y sin pesos de muestreo y finalmente una aplicación con los datos de la evaluación de impacto del programa Ser Pilo Paga de Colombia. | spa |
| dc.description.abstract | In this work titled “Regression discontinuity for causal analysis with complex survey designs”, the proposal of a new estimator is made for the design of regression discontinuity, which is part of the methodologies used in impact evaluation for the estimation of the ATE (Average Treatment Effect) that allows to answer the question of “What would have been the result of the participants in the absence of program / intervention/ politics?”, taking into account the sampling weights. For the estimation methodology it is assumed that there are two sources of variability: i) population model ii) and the sampling distribution induced by the random selection of the sample, as stated methods available in the literature. The regression discontinuity requires that in the beneficiary selection process or in the targeting of the intervention, program or policy, a continuous variable Z has been used, which allows the population to be organized according to the results of the indicator. An example of this variable in the Colombian case is the Sisben III, which organizes the population from the least vulnerable (100 points) to the most vulnerable (0 points). Defining a cut-off point for this indicator, it defines the focus of the government’s social programs. According to the behavior of the treatment and control group with respect to the variable Z, the type of regression discontinuity will be determined: i) sharp regression discontinuity, where variable Z completely determines the participation in the intervention and ii) diffuse regression discontinuity, where the variable Z determines the probability of participating in the intervention. This methodology has two approaches to its estimation: parametric and non-parametric, which are studied for the inclusion of sampling weights in regression discontinuity. This paper presents the mathematical expressions of the estimators with sample weights, the implementation and development of the package in R called rddsw with the functions for the application of the estimators found, simulations showing the behavior of the estimators with and without sampling weights and finally an application with the data of the impact evaluation of the Ser Pilo Paga program in Colombia. | spa |
| dc.description.additional | Magíster en Ciencias - Estadística | spa |
| dc.description.degreelevel | Maestría | spa |
| dc.format.extent | 166 | spa |
| dc.format.mimetype | application/pdf | spa |
| dc.identifier.citation | Arteaga-Gutiérrez, N. (2019), Regresión discontinua para análisis de causalidad con diseños de muestreo complejos. | spa |
| dc.identifier.uri | https://repositorio.unal.edu.co/handle/unal/75917 | |
| dc.language.iso | spa | spa |
| dc.publisher.branch | Universidad Nacional de Colombia - Sede Bogotá | spa |
| dc.publisher.department | Departamento de Estadística | spa |
| dc.relation.references | Aftab, N. & Chand, S. (2016), ‘A new heteroskedastic consistent covariance matrix estimator using deviance measure’, Pakistan Journal of Stastics and Operation Research 12(2), 283. | spa |
| dc.relation.references | Avery, M. (2010), ‘Literature review for local polynomial regression’, Semantic Scholar | spa |
| dc.relation.references | Barnow, B. S. (1972), ‘Conditions for the presence or absence of a bias. in treatment effect: Some statistical models for head start evaluation, discussion paper no. 129’, University of Wisconsin-Madison . | spa |
| dc.relation.references | Bellhouse, D. R., Goia, C. M. & Stafford, J. E. (2003), Analysis of Survey Data, Wiley | spa |
| dc.relation.references | Bellhouse, D. & Stafford, J. (2001), ‘Local polynomial regression in complex surveys’, Statistics Canada 27, 197–203. | spa |
| dc.relation.references | Bernal, R. & Peña, X. (2011), Guía Práctica para la Evaluación de Impacto, Universidad de los Andes, Facultad de Economía, Centro de Estudios sobre Desarrollo Económico. | spa |
| dc.relation.references | Bethlehem, J. (2009), ‘The rise of survey sampling’, Sttistics Netherlands . | spa |
| dc.relation.references | Binder, D. (1983), ‘On the variances of asymptotically normal estimators from complex surveys’, International Statistical Review 51, 279–292. | spa |
| dc.relation.references | Binder, D. A. (2011), ‘Estimating model parameters from a complex survey under a model-design randomization framework’, Pakistan Journal of Statistics 27(4), 371– 390. | spa |
| dc.relation.references | Binder, D. A. & Roberts, G. (2009), ‘Design and model based inference for model parameters’, Sample Surveys: Inference and Analysis 29B. | spa |
| dc.relation.references | Breidt, F. & Opsomer, J. D. (2000), ‘Local polynomial regression estimators in survey sampling’, The Annals of Statistics 28, 1026–1053. | spa |
| dc.relation.references | Calonico, S., Cattaneo, M. D. & Titiunik, R. (2014), ‘Robust data-driven inference in the regression-discontinuity design’, Stata Journal 14(4) pp. 1–36. | spa |
| dc.relation.references | Chambers, R. (2003), Analysis of Survey Data, Wiley | spa |
| dc.relation.references | Chambers, R. L. & Skinner, C. J., eds (2003), Analysis of Survey Data, Wiley. | spa |
| dc.relation.references | Davidson, R. & MacKinnon, J. (1998), ‘Graphical methods for investigating the size and power of hypothesis tests’, The Manchester School . | spa |
| dc.relation.references | Departamento Nacional de Planeación (2019), ‘Sisbén 3’. https://www.Sisben.gov.co/Sisben/Paginas/Que-es.aspx. | spa |
| dc.relation.references | Fan, J. & Gijbels, I. (1996), Local Polynomial Modeling and Its Applications, Chapman and Hall | spa |
| dc.relation.references | Goldberger, A. S. (1972a), ‘Selection bias in evaluating treatment effect: Some formal illustrations, discussion paper no. 123’, University of Wisconsin-Madison . | spa |
| dc.relation.references | Goldberger, A. S. (1972b), ‘Selection bias in evaluating treatment effect: Some formal illustrations, discussion paper no. 129’, University of Wisconsin-Madison . | spa |
| dc.relation.references | Groves, R. M., Fowler, F. J., Couper, M. P., Lepkowski, J. M., Singer, E. & Tourangeau, R. (2004), Survey Methodology, Wiley. | spa |
| dc.relation.references | Gutiérrez, H. A. (2009), Estrategias de Muestreo: Diseño de Encuestas y Estimación de Parámetros, Editorial de la Universidad Santo Tomás. | spa |
| dc.relation.references | Hafen, R. (2010), ‘Local regression models: Advancements, applications, and new methods’. | spa |
| dc.relation.references | Hahn, J., P. T. & Van de Klaauv, W. (1999), ‘Evaluating the effect of an antidiscrimination law using a regression-discontinuity design’, Technical report, National Bureauos Economic Research . | spa |
| dc.relation.references | Hinkley, D. (1977), ‘Jackknifing in unbalanced situations’, Technometrics 19(3), 285– 292. | spa |
| dc.relation.references | Horn, S., Horn, R. & Duncan, D. (1975), ‘Estimating heteroskedastic variances in linear models’, Journal of the American Statistical Association 65(329), 161–172 | spa |
| dc.relation.references | Imbens, G. & Kalyanaraman, K. (2009), ‘Optimal bandwidth choice for the regression discontinuity estimator’, NBER Working Paper Series . | spa |
| dc.relation.references | Imbens, G. & Lemieux, T. (2007), ‘Regression discontinuity designs: a guide to practice’, NBER Working Paper Series . | spa |
| dc.relation.references | Jacob, R. & Zhu, P. (2012), ‘A practical guide to regression discontinuity’, Technical report, mdrc . | spa |
| dc.relation.references | Laplace, P. (1812), ‘Th´eorie analytique des probabilit´es. oevres completes’ | spa |
| dc.relation.references | Lee, D. (2008), ‘Randomized experiments from nom-random selection in u.s. house elections’, Journal Econometrics 142(2), 675–697 | spa |
| dc.relation.references | Lee, D. & Lemieux, T. (2010), ‘Regression discontinuity designs in economics’, Journal of economics literature 48, 281–355 | spa |
| dc.relation.references | Lehtonen, R. & Pahkinen, E. (2003), Practical Methods for Design and Analysis of Complex Surveys, 2 edn, New York: Wiley. | spa |
| dc.relation.references | Lumley, T. (2010), Complex Surveys: A Guide to Analysis using R, Wiley. | spa |
| dc.relation.references | Lumley, T. & Scott, A. (2016), ‘Fitting regression models to survey data’, Statistical Sciences . | spa |
| dc.relation.references | Molina, E., Smith, T. & Sugden, R. (2001), ‘Modelling overdispersion for complex survey data’, Int. Statist. Rev. 69, 373–384. | spa |
| dc.relation.references | Prosperidad Social (2017), ‘Programa jóvenes en acción’. http://www.dps.gov.co/que/jov/Paginas/Requisitos.aspx. | spa |
| dc.relation.references | Rubin, D. B. (1977), ‘Assignment to treatment group on the basis of a covariate’, Journal of Educational and Behavioral statistics 2(1), 1–26. | spa |
| dc.relation.references | Rubin, D. B. (1978), ‘Bayesian inference for causal effects: The role of randomization’, The Annals of Statistics pp. 34–58. | spa |
| dc.relation.references | Rubin, D. B. (2005), ‘Causal inference using potencial outcomes: Design, modeling, decisions’, Journal of the American Statistical Association 100(469), 322–331. | spa |
| dc.relation.references | Shalabh (2015), ‘Lecture notes polynomial regression models’. | spa |
| dc.relation.references | Sudgen, R. A. & Smith, T. M. F. (1984), ‘Ignorable and informative designs in survey sampling inference’, Biometrika 71, 495–506. | spa |
| dc.relation.references | Särndal, C., Swensson, B. & Wretman, J. (1992), Model Assisted Survey Sampling, Springer, New York. | spa |
| dc.relation.references | Trochim, W. & Spiegelman, C. (1980), ‘The relative assignment variable approach to selection bias in pretest-posttest group designs’, Proceedings of the Survey Research Section pp. 376–380. | spa |
| dc.relation.references | UT Centro Nacional de Consultoría - Universidad de los Andes (2016), ‘Evaluación ser pilo paga’. https://colaboracion.dnp.gov.co/CDT/Sinergia/Documentos/ Ser Pilo Paga Informe Final CM.pdf. | spa |
| dc.relation.references | UT Econometría-SEI (2017), ‘Evaluación jóvenes en acción’. http://www.dps. gov.co/inf/doc/Jvenes%20en%20Accin/Informe%20fina%20de%20la%20la% 20Evaluacio%CC%81n%20de%20Impacto%20del%20Programa%20Jo%CC%81venes% 20en%20Accio%CC%81n-2017.pdf. | spa |
| dc.relation.references | Westergaard, H. (1932), ‘Contributions to the history of statistics’, 42(168), 645–647. | spa |
| dc.relation.references | White, H. (1980), ‘A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity’, Econometrica: Journal of the Econometric Society . | spa |
| dc.relation.references | White, H. & Sabarwal, S. (2014), ‘Dise˜no y m´etodos cuasiexperimentales’, Technical report 8. | spa |
| dc.relation.references | Wooldridge, J. (2001), Econometric Analysis of Cross Section and Panel Data, The MIT Press. | spa |
| dc.relation.references | Zhang, H. & Gutiérrez, A. (2010), Teoría estadística: Aplicaciones y Métodos, Editorial de la Universidad Santo Tomás | spa |
| dc.relation.references | Zhelonkin, M., Genton, M. & Ronchtti, E. (2011), ‘On the robustness of two-stage estimators’, Statistics and probabilitty letters . | spa |
| dc.rights | Derechos reservados - Universidad Nacional de Colombia | spa |
| dc.rights.accessrights | info:eu-repo/semantics/openAccess | spa |
| dc.rights.license | Atribución-NoComercial-SinDerivadas 4.0 Internacional | spa |
| dc.rights.spa | Acceso abierto | spa |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | spa |
| dc.subject.ddc | Colecciones de estadística general | spa |
| dc.subject.proposal | Discontinuity regression | eng |
| dc.subject.proposal | Regresión discontinua | spa |
| dc.subject.proposal | Pseudolikelihood | eng |
| dc.subject.proposal | Pseudoverosimilitud | spa |
| dc.subject.proposal | ATE | eng |
| dc.subject.proposal | ATE | spa |
| dc.subject.proposal | LATE | eng |
| dc.subject.proposal | LATE | spa |
| dc.subject.proposal | Complex survey data | eng |
| dc.subject.proposal | Diseños de muestreo complejos | spa |
| dc.title | Regresión discontinua para análisis de causalidad con diseños de muestreo complejos | spa |
| dc.title.alternative | Regression discontinuity for causal analysis with complex survey designs | spa |
| dc.type | Trabajo de grado - Maestría | spa |
| dc.type.coar | http://purl.org/coar/resource_type/c_bdcc | spa |
| dc.type.coarversion | http://purl.org/coar/version/c_ab4af688f83e57aa | spa |
| dc.type.content | Text | spa |
| dc.type.driver | info:eu-repo/semantics/masterThesis | spa |
| dc.type.version | info:eu-repo/semantics/acceptedVersion | spa |
| oaire.accessrights | http://purl.org/coar/access_right/c_abf2 | spa |

