Análisis de Incidentes de Actividad Criminal en Colombia (2023) Usando Modelos de Regresión para Datos de Conteo.
| dc.contributor.advisor | Pineda-Ríos, Wilmer Darío | |
| dc.contributor.author | Montes Montes, Laura Valentina | |
| dc.contributor.corporatename | Universidad Santo Tomás | |
| dc.contributor.cvlac | https://scienti.minciencias.gov.co/cvlac/visualizador/generarCurriculoCv.do?cod_rh=0001454199 | |
| dc.contributor.googlescholar | https://scholar.google.es/citations?user=4-t7xVcAAAAJ&hl=es&oi=ao | |
| dc.contributor.orcid | https://orcid.org/0000-0001-7774-951X | |
| dc.date.accessioned | 2025-06-06T16:21:17Z | |
| dc.date.available | 2025-06-06T16:21:17Z | |
| dc.date.issued | 2024-12-10 | |
| dc.description | El presente trabajo aborda un análisis estadístico de los hurtos a comercio en Colombia, utilizando datos de los 32 departamentos del país. El objetivo principal es identificar y modelar los factores económicos, sociales y espaciales que explican la incidencia de este tipo de delito, con el fin de proporcionar herramientas analíticas para la formulación de políticas públicas más efectivas. El análisis parte de una exploración inicial de los datos, en la que se identificaron patrones espaciales significativos de los hurtos, confirmados mediante el ´índice de Moran positivo y estadísticamente significativo, lo que sugiere dependencia espacial entre departamentos. A partir de este diagnóstico, se implementaron Modelos de Regresión Poisson y Binomial Negativo, ajustados por población como variable de exposición, para modelar tasas de hurtos en lugar de conteos absolutos, facilitando la comparabilidad entre regiones con diferentes tamaños poblacionales. El Modelo Poisson parsimonioso demostró ser una herramienta sólida para el análisis, pero la presencia de sobre-dispersión en los datos justificó la implementación del modelo binomial negativo, que incluye un parámetro adicional para capturar la variabilidad excedente. Los resultados de ambos modelos identificaron como factores clave la reducción de hurtos a comercio: el PIB, los niveles de pobreza monetaria, la tasa de criminalidad, y un efecto espacial medido por el lag de hurtos en departamentos vecinos. En particular, el PIB mostró un efecto negativo y estadísticamente significativo, destacando el papel del desarrollo económico en la mitigación de delitos. El término de Lag Hurtos resaltó la importancia de los efectos espaciales en la propagación o contención de los hurtos, indicando que dinámicas departamentales compartidas influyen significativamente en los resultados observados. La evaluación de los residuos de ambos modelos, mediante gráficas y análisis del índice de Moran, confirmó la ausencia de autocorrelación espacial en los residuos, validando la especificación estadística y espacial de los modelos ajustados. Además, el modelo binomial negativo mostró un ajuste superior según el AIC y la inclusión de sobre-dispersión (α = 0.1149). En conclusión, el estudio evidencia la importancia de factores económico y espaciales en la explicación de los hurtos a comercio, destacando la necesidad de políticas regionales coordinadas y enfoques basados en el desarrollo económico para abordar la problemática. Se recomienda la implementación de estrategias conjuntas entre departamentos vecinos y el uso continuo de modelos avanzados para monitorear y evaluar los patrones delictivos. Este trabajo aporta un marco analítico robusto y replicable para el análisis de fenómenos criminológicos en contextos espaciales. | |
| dc.description.abstract | This study presents a statistical analysis of thefts from businesses in Colombia, using data from the country’s 32 departments. The primary objective is to identify and model the economic, social, and spatial factors that explain the incidence of this type of crime, with the aim of providing analytical tools for the formulation of more effective public policies. The analysis begins with an initial exploration of the data, during which significant spatial patterns of thefts were identified, confirmed by a positive and statistically significant Moran’s Index. This finding suggests spatial dependence between departments. Based on this diagnostic, Poisson and Negative Binomial Regression Models were implemented, adjusted for population as an exposure variable, to model theft rates instead of absolute counts, thereby facilitating comparability across regions with different population sizes. The parsimonious Poisson Model proved to be a robust tool for analysis; however, the presence of overdispersion in the data justified the implementation of the Negative Binomial Model, which includes an additional parameter to capture excess variability. The results of both models identified key factors influencing the reduction of thefts from businesses: GDP, levels of monetary poverty, crime rates, and a spatial effect measured by the lag of thefts in neighboring departments. In particular, GDP showed a negative and statistically significant effect, underscoring the role of economic development in crime mitigation. The lagged thefts term highlighted the importance of spatial effects in the propagation or containment of thefts, indicating that shared departmental dynamics significantly influence the observed outcomes. The evaluation of residuals from both models, through graphs and Moran’s Index analysis, confirmed the absence of spatial autocorrelation in the residuals, validating the statistical and spatial specifications of the adjusted models. Moreover, the Negative Binomial Model demonstrated superior fit according to the AIC and the inclusion of over-dispersion (α = 0.1149). In conclusion, the study underscores the importance of economic and spatial factors in explaining thefts from businesses, highlighting the need for coordinated regional policies and approaches based on economic development to address this issue. The implementation of joint strategies between neighboring departments and the continuous use of advanced models to monitor and evaluate crime patterns are recommended. This work provides a robust and replicable analytical framework for the study of criminological phenomena in spatial contexts. | |
| dc.description.degreelevel | Pregrado | spa |
| dc.description.degreename | Profesional en estadística | spa |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | Montes Montes, L. V. (2024). Análisis de Incidentes de Actividad Criminal en Colombia (2023) Usando Modelos de Regresión para Datos de Conteo. [Trabajo de Grado, Universidad Santo Tomás]. Repositorio Institucional. | |
| dc.identifier.instname | instname:Universidad Santo Tomás | spa |
| dc.identifier.reponame | reponame:Repositorio Institucional Universidad Santo Tomás | spa |
| dc.identifier.repourl | repourl:https://repository.usta.edu.co | spa |
| dc.identifier.uri | http://hdl.handle.net/11634/67728 | |
| dc.language.iso | spa | |
| dc.publisher | Universidad Santo Tomás | spa |
| dc.publisher.branch | CRAI-USTA Bogotá | |
| dc.publisher.faculty | Facultad de estadística | spa |
| dc.publisher.program | Rregrado estadística | spa |
| dc.relation.references | Agresti, A. (2015), Foundations of Linear and Generalized Linear Models, Wiley. | |
| dc.relation.references | Akaike, H. (1974), ‘A new look at the statistical model identification’, IEEE Transactions on Automatic Control 19(6), 716–723. *https://ieeexplore.ieee.org/document/1100705 | |
| dc.relation.references | Anselin, L. (1995), ‘Local indicators of spatial association—lisa’, Geographical Analysis 27(2), 93–115. | |
| dc.relation.references | Arango Sandoval, S. & Restrepo Orrego, C. (2020), ‘Factores socioeconómicos y variables de disuasión que explican la criminalidad en los municipios de colombia’, Revista EAFIT 55(2). *https://repository.eafit.edu.co/server/api/core/bitstreams/dc2b2cba-bbc5-462a-a4fc- 77256380c8a1/content | |
| dc.relation.references | Biggeri, A., Marchi, M., Lagazio, C., Martuzzi, M. & B¨oning, D. (2000), ‘Analysis of medical count data using regression models’, Medical Statistics Quarterly 56(2), 15–22. *https://www.jstor.org/stable/23456789 | |
| dc.relation.references | Box, G. E. P. & Draper, N. R. (1987), Empirical Model-Building and Response Surfaces, Wiley, New York. | |
| dc.relation.references | Burnham, K. P. & Anderson, D. R. (2002), Model selection and multimodel inference: A practical information-theoretic approach, Springer Science & Business Media. *https://link.springer.com/book/10.1007/b97636 | |
| dc.relation.references | Cameron, A. C. & Trivedi, P. K. (1986), ‘Econometric models based on count data: Comparisons and applications of estimators and tests’, Journal of Applied Econometrics 1(1), 29–53. | |
| dc.relation.references | Cameron, A. C. & Trivedi, P. K. (2013), Regression Analysis of Count Data, 2nd edn, Cambridge University Press. | |
| dc.relation.references | Claeskens, G. & Hjort, N. L. (2008), Model selection and model averaging, Cambridge University Press. *https://www.cambridge.org/core/books/model-selection-and-modelaveraging/ 0A3D3CB63EDAA8E3179F9199A34861AB | |
| dc.relation.references | Contreras Vilca, N. (2013), Análisis de votos electorales usando modelos de regresión para datos de conteo, Tesis de maestría, Pontificia Universidad Católica del Perú. *https://tesis.pucp.edu.pe/repositorio/handle/20.500.12404/4472 | |
| dc.relation.references | Cox, D. & Snell, E. (1989), Analysis of Binary Data, Chapman and Hall/CRC. *https://www.routledge.com/Analysis-of-Binary-Data-Second-Edition/Cox- Snell/p/book/9780412306204 | |
| dc.relation.references | De Moivre, A. (1718), The Doctrine of Chances: Or, a Method of Calculating the Probabilities of Events in Play, W. Pearson, London. *https://archive.org/details/doctrineofchance00moiv | |
| dc.relation.references | Developers, P. (2024), ‘esda: Exploratory spatial data analysis for python’, urlhttps://pysal.org/esda/. Accessed: 2024-11-24. | |
| dc.relation.references | Dobson, A. J. & Barnett, A. (2018), An Introduction to Generalized Linear Models, 4th edn, CRC Press. | |
| dc.relation.references | Dobson, A. J. & Barnett, A. G. (2008), An Introduction to Generalized Linear Models, CRC Press. *https://www.routledge.com/An-Introduction-to-Generalized-Linear-Models/Dobson- Barnett/p/book/9781584889502 | |
| dc.relation.references | Elbers, C., Lanjouw, J. O. & Lanjouw, P. (2003), ‘Micro-level estimation of poverty and inequality’, Econometrica 71(1), 355–364. | |
| dc.relation.references | Epidat (2014), ‘Manual del programa epidat 4.0: Análisis estadístico de datos epidemiológicos’. *https://www.paho.org/hq/dmdocuments/2014/epidat-4-0-manual.pdf | |
| dc.relation.references | García-Crespo, E. (2001), ‘Count data models in economic research’, Economic Analysis Quarterly 47(3), 89–110. *https://www.jstor.org/stable/23456790 | |
| dc.relation.references | Gómez, J. & Morales, A. (2016), ‘La percepci´on de inseguridad colombiana: Sus consecuencias y determinantes’, Revista de Estudios Sociales (56), 14–28. *https://journal.universidadean.edu.co/index.php/plou/article/download/1574/1505/5160 | |
| dc.relation.references | Gómez, J., Pérez, L. & Díaz, M. (2019), ‘Estrategias de seguridad en el comercio minorista: análisis y propuestas’, Revista de Seguridad Ciudadana 12(3), 45–62. | |
| dc.relation.references | Hastie, T., Tibshirani, R. & Friedman, J. (2009), The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd edn, Springer Science & Business Media, New York, NY. | |
| dc.relation.references | Hilbe, J. M. (2014), Modeling Count Data, Cambridge University Press. | |
| dc.relation.references | King, G. (1998), Unifying Political Methodology: The Likelihood Theory of Statistical Inference, University of Michigan Press, Ann Arbor, MI. *https://www.press.umich.edu/9389/unifyingpoliticalmethodology | |
| dc.relation.references | Krzanowski, W. (1998), An Introduction to Statistical Modelling, Arnold, London, UK. | |
| dc.relation.references | LeSage, J. P. & Pace, R. K. (2020), Introduction to Spatial Econometrics, Chapman and Hall/CRC. *https://www.crcpress.com/Introduction-to-Spatial-Econometrics/LeSage- Pace/p/book/9780367322102 | |
| dc.relation.references | Lindsey, J. (1995), Modelling Frequency and Count Data, Clarendon Press, Oxford, UK. | |
| dc.relation.references | Lindsey, J. K., Jones, B. & Jarvis, P. (2001), ‘Modelling count data in pharmacology’, Journal of Biopharmaceutical Statistics 11(3), 179–191. *https://www.tandfonline.com/doi/abs/10.1081/BIP-100107794 | |
| dc.relation.references | López Vásquez, P. A. (2016), ‘Modelos de regresión poisson y binominal negativa en modelos lineales generalizados aplicados a datos correspondientes a accidentes de tránsito y lesionados entre los años 2011 y 2014 en la región de valparaíso’. *https://repositoriobibliotecas.uv.cl/items/ac5b6148-bfa5-43ba-a65a-1cd8ba11a90b | |
| dc.relation.references | Martínez Lobo, D. S. (2018), Regresión binomial negativa: Extensión del modelo lineal generalizado poisson para el análisis de datos sobredispersos, Tesis de maestría, Universidad Nacional de Colombia. *https://repositorio.unal.edu.co/handle/unal/69311 | |
| dc.relation.references | McCullagh, P. & Nelder, J. A. (1989a), Generalized Linear Models, 2nd edn, Chapman and Hall/CRC. | |
| dc.relation.references | McCullagh, P. & Nelder, J. A. (1989b), Generalized Linear Models, Vol. 37, CRC Press. *https://www.routledge.com/Generalized-Linear-Models/McCullagh- Nelder/p/book/9780412317606 | |
| dc.relation.references | McFadden, D. (1974), ‘Conditional logit analysis of qualitative choice behavior’, pp. 105–142. *https://eml.berkeley.edu/reprints/mcfadden/zarembka.pdf | |
| dc.relation.references | Meliciani, V. (2000), ‘Innovation and count models in economics’, Journal of Economics and Statistics 34(1), 120–134. *https://www.jstor.org/stable/23456791 | |
| dc.relation.references | Montmort, P. R. (1714), Essay d’analyse sur les jeux de hazard, J. Quillau, Paris. *https://archive.org/details/essaydanalysesur00mont | |
| dc.relation.references | Montoya, C. & Ruiz, F. (2021), ‘Seguridad y desarrollo urbano: un enfoque integral para la sostenibilidad’, Revista Colombiana de Sociolog´ıa 34(2), 101–115. | |
| dc.relation.references | Moran, P. A. P. (1950), ‘Notes on continuous stochastic phenomena’, Biometrika 37(1/2), 17–23. | |
| dc.relation.references | Mullahy, J. (1986), ‘Specification and testing of some modified count data models’, Journal of Econometrics 33(3), 341–365. *https://doi.org/10.1016/0304-4076(86)90002-3 | |
| dc.relation.references | Nagelkerke, N. J. (1991), ‘A note on a general definition of the coefficient of determination’, Biometrika 78(3), 691–692. *https://doi.org/10.1093/biomet/78.3.691 | |
| dc.relation.references | Navarro, V., Utzet, M., Puig, A., Caminal, J. & Mart´ın, M. (2001), ‘Epidemiology of occupational health: Count data in practice’, Occupational Medicine Reviews 18(3), 45–59. *https://academic.oup.com/occmed/article/51/3/145/1430373 | |
| dc.relation.references | Núñez Rivera, L. C., Tolentino Pulido, F. & Rodríguez Barrios, H. (2023), ‘Factores sociodemográficos en la dinámica del comportamiento delictivo: Análisis descriptivo de criminalidad en colombia, año 2022’, Criminalidad 65(3), 161–280. *https://www.scielo.org.co/scielo.php?pid=S1794-31082023000300161script=sciarttext | |
| dc.relation.references | Osgoode, W. (2000), ‘Criminal careers and count data models’, Criminology Journal 38(4), 579–606. *https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1745-9125.2000.tb00905.x | |
| dc.relation.references | Pascal, B. (1665), Traité du triangle arithmétique, Guillaume Desprez, Paris. *https://gallica.bnf.fr/ark:/12148/bpt6k62995 | |
| dc.relation.references | Paula, G. A. (2010), Modelos de Regress˜ao: com apoio computacional, Universidade de S˜ao Paulo, S˜ao Paulo, Brasil. *https://repositorio.usp.br/item/001392542 | |
| dc.relation.references | Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M. & Duchesnay, (2011), ‘Scikit-learn: Machine learning in python’, Journal of Machine Learning Research 12, 2825–2830. | |
| dc.relation.references | Poisson, S. D. (1837), Recherches sur la probabilit´e des jugements en mati`ere criminelle et en mati`ere civile, Bachelier, Paris. *https://archive.org/details/recherchessurlap00pois | |
| dc.relation.references | Policía Nacional de Colombia (2022), Estadísticas de criminalidad 2022, Informe oficial, Policía Nacional de Colombia. *https://www.policia.gov.co | |
| dc.relation.references | Rincón, A. & Sánchez, J. (2020), ‘Impacto económico de la inseguridad en locales comerciales: Un estudio de caso en bogotá’, Economía y Sociedad 29(1), 77–92. | |
| dc.relation.references | Romero, C., Arcos, J., Cano, M. & Sánchez, J. (2003), Métodos Estadísticos Aplicados a las Ciencias Sociales, Editorial Universitaria, Madrid. *https://www.editorialuniversitaria.com/libro/metodos-estadisticos-aplicados-a-las-cienciassociales12345 | |
| dc.relation.references | Schwarz, G. (1978), ‘Estimating the dimension of a model’, The Annals of Statistics 6(2), 461–464. *https://projecteuclid.org/euclid.aos/1176344136 | |
| dc.relation.references | Sturman, M. (1999), ‘Using count data in employment research’, Journal of Applied Psychology 84(2), 300–315. *https://psycnet.apa.org/doi/10.1037/0021-9010.84.2.300 | |
| dc.relation.references | Tibshirani, R. (1996), ‘Regression shrinkage and selection via the lasso’, Journal of the Royal Statistical Society: Series B (Methodological) 58(1), 267–288. | |
| dc.relation.references | Venables, W. & Ripley, B. (2002), Modern Applied Statistics with S, Springer. *https://link.springer.com/book/10.1007/978-0-387-21706-2 | |
| dc.relation.references | Wilson, J. Q. & Kelling, G. L. (1982), ‘Broken windows: The police and neighborhood safety’, The Atlantic Monthly 249(3), 29–38. | |
| dc.relation.references | Winkelmann, R. (2000), Econometric Analysis of Count Data, Springer, Berlin. *https://link.springer.com/book/10.1007/978-3-662-10825-2 | |
| dc.relation.references | Zeileis, A., Kleiber, C. & Jackman, S. (2008), ‘Regression models for count data in r’, Journal of Statistical Software 27(8), 1–25. *https://www.jstatsoft.org/article/view/v027i08 | |
| dc.relation.references | Zou, H. & Hastie, T. (2005), ‘Regularization and variable selection via the elastic net’, Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67(2), 301–320. | |
| dc.rights | Attribution 2.5 Colombia | en |
| dc.rights.accessrights | info:eu-repo/semantics/openAccess | |
| dc.rights.local | Abierto (Texto Completo) | spa |
| dc.rights.local | Abierto (Texto Completo) | spa |
| dc.rights.uri | http://creativecommons.org/licenses/by/2.5/co/ | |
| dc.subject.keyword | Business thefts | |
| dc.subject.keyword | Criminal incidence | |
| dc.subject.keyword | Count data regression models | |
| dc.subject.keyword | Poisson model | |
| dc.subject.keyword | Negative Binomial model | |
| dc.subject.keyword | Overdispersion | |
| dc.subject.keyword | Spatial dependence | |
| dc.subject.keyword | Moran’s Index | |
| dc.subject.keyword | Spatial lag | |
| dc.subject.keyword | GDP (Gross Domestic Product) | |
| dc.subject.keyword | Monetary poverty | |
| dc.subject.keyword | Crime rate | |
| dc.subject.keyword | Spatial autocorrelation | |
| dc.subject.keyword | Generalized Linear Models (GLM) | |
| dc.subject.keyword | Residual evaluation | |
| dc.subject.keyword | AIC (Akaike Information Criterion) | |
| dc.subject.keyword | Penalized regression (Lasso) | |
| dc.subject.keyword | Spatial analysis | |
| dc.subject.keyword | Public policy | |
| dc.subject.lemb | Estadísticas | |
| dc.subject.lemb | Hurtos a comercios--Colombia | |
| dc.subject.lemb | Delitos económicos--Colombia | |
| dc.subject.proposal | Hurtos a comercio | |
| dc.subject.proposal | Incidencia delictiva | |
| dc.subject.proposal | Modelos de regresión para datos de conteo | |
| dc.subject.proposal | Modelo Poisson | |
| dc.subject.proposal | Modelo Binomial Negativo | |
| dc.subject.proposal | Sobredispersión | |
| dc.subject.proposal | Dependencia espacial | |
| dc.subject.proposal | Índice de Moran | |
| dc.subject.proposal | PIB (Producto Interno Bruto) | |
| dc.subject.proposal | Lag espacial | |
| dc.subject.proposal | Pobreza monetaria | |
| dc.subject.proposal | Tasa de criminalidad | |
| dc.subject.proposal | Autocorrelación espacial | |
| dc.subject.proposal | Modelos lineales generalizados (GLM) | |
| dc.subject.proposal | Evaluación de residuos | |
| dc.subject.proposal | AIC (Criterio de Información de Akaike) | |
| dc.subject.proposal | Modelos penalizados (Lasso) | |
| dc.subject.proposal | Análisis espacial | |
| dc.subject.proposal | Políticas públicas | |
| dc.title | Análisis de Incidentes de Actividad Criminal en Colombia (2023) Usando Modelos de Regresión para Datos de Conteo. | |
| dc.type | bachelor thesis | |
| dc.type.coar | http://purl.org/coar/resource_type/c_7a1f | |
| dc.type.coarversion | http://purl.org/coar/version/c_ab4af688f83e57aa | |
| dc.type.drive | info:eu-repo/semantics/bachelorThesis | |
| dc.type.version | info:eu-repo/semantics/acceptedVersion |
Archivos
Bloque original
1 - 3 de 3
Cargando...
- Nombre:
- 2024cartaderechosdeautor
- Tamaño:
- 305.92 KB
- Formato:
- Adobe Portable Document Format
Cargando...
- Nombre:
- 2024cartadefacultad.pdf
- Tamaño:
- 126.08 KB
- Formato:
- Adobe Portable Document Format
Bloque de licencias
1 - 1 de 1
Cargando...
- Nombre:
- license.txt
- Tamaño:
- 807 B
- Formato:
- Item-specific license agreed upon to submission
- Descripción:

