Modelling underreported Spatio-temporal Crime Events

Autor: Álvaro J. Riascos Villegas, Jose Sebastian Ñungo, Lucas Gómez Tobón, Mateo Dulce Rubio, Francisco Gómez
Jazyk: angličtina
Rok vydání: 2023
DOI: 10.5281/zenodo.7868622
Popis: The code needed to replicate our work is available in our GitHub Repository Description of the files distance_1000.csv:is a data frame with 5000 rows and 3 columns. Each row is a time step of the algorithms, and reports the euclidean distance between the vector with the real crime rate in each cell and the estimation made by the algorithm. The exercise was performed in the case of 1,000 arms and at most 100 super arms. This file is created in thetimes.py script of our repository. distance_10000.csv:is a data frame with 5000 rows and 3 columns. Each row is a time step of the algorithms, and reports the euclidean distance between the vector with the real crime rate in each cell and the estimation made by the algorithm. The exercise was performed in the case of 10,000 arms and at most 1,000 super arms. This file is created in thetimes.py script of our repository. distance_50000.csv:is a data frame with 5000 rows and 3 columns. Each row is a time step of the algorithms, and reports the euclidean distance between the vector with the real crime rate in each cell and the estimation made by the algorithm. The exercise was performed in the case of 50,000 arms and at most 5,000superarms. This file is created in thetimes.py script of our repository. grilla_bogota.csv:is a data frame with 1638 rows and 5 columns in which each row described one grid of Bogotá. The difference between this file and grilla_bogota2.csv is that this file is used to plot Figure 9 which includes the rural area of the city. Something that is removed in our analysis due to the low density of crime in this zone.This file is created in the3_create_grid.ipynb script of our repository. grilla_bogota2.csv:is a data frame with1008 rows and 10 columns in which each row described one grid of Bogotá. This file is more complete than grilla_bogota.csv because it includes the name of the Localidad in which the centroid of the cell belongs and its Rep. Rate. However, this file does not contain the rural area of the city.This file is created in the3_create_grid.ipynb script of our repository. localidades.zip: this zipped folder contains the shapefiles to draw the map of Bogotá with its respective administrative limits. The information contained herein is of a public nature and can also be found on the government's open data page. matriz_eventos_real.csv: is a matrix of 498 rows and 368 columns in which each row represents one cell of Bogota's grid and each column represents the number of real crimes for each date. Recall that we assume that the total of crimes is the combination of NUSE and SIEDCO crimes after the removal of duplicates. This file is created in the3_create_grid.ipynb script of our repository. matriz_eventos_subreporte.csv:is a matrix of 498 rows and 368 columns in which each row represents one cell of Bogota's grid and each column represents the number of subreported crimes for each date. Recall that we assume that the number of sub-reported crimes is the number of crimes reported in NUSE. This file is created in the3_create_grid.ipynb script of our repository. subreporte_ccb.csv: is a data frame of 498 rows and 4 columns that describe the Rep. Rate and lambda for each cell of Bogota's grid.This file is created in the3_create_grid.ipynb script of our repository. upla.zip:this zipped folder contains otherextra shapefiles to draw the map of Bogotá with its respective administrative limits. The information contained herein is of a public nature and can also be found on the government's open data page. victimización.xlsx:is an Excel file with 20 rows and 4 columns that containsthe Vict. Rate and the Rep. Rate for each Localidad of Bogotá. This information comes from survey-based victimization and victim crime reporting rates presented by Bogotá’s Chamber of commerce (2014).
Databáze: OpenAIRE