San Diego Crime Incidents with Demographic Descriptions
sandiegodata.org-crime_victims-1.1.3
Last Update: 2020-11-22T23:16:50
Crime incidents in San Diego, from 2016 though July 2020 inclusive, with UCR codes for the crime and the age, race and sex of the victim and suspect.
This dataset describes crime incidents from 2016 to 2020, with demographic
information for both the victims and suspects. The file has multiple rows per
incident, one for each suspect or victim. The primary key pk
links records
together into a single crime incident. The dataset is derived from data acquired for a
PRA request and is processed to standardize geographic identifiers and racial categories.
Refer to the source dataset
for the original data and the PRA request used to acquire it.
Processing
The data presented here are a processed version of the file received from ARJIS
through a Public Records Act request. The processing includes:
- Converting the tract identifier to a formal ACS format tract geoid
- Converting the block identifier to a formal ACS format block geoid
- Adding the position of the centroid of the tracts, in WKT format
- Adding the Census internal point location, for the block, in WGS 84 latitude and longitude.
- Recording the race field to the Census race / ethinicity scheme.
Additiona processing that was performed on the upstream data, which came directly
from ARJIS, includes:
- Created "year" field
- Deleted MACRStatus from years 2017-2020
- Combined years into 1 file
- Deleted partial August cases to have complete month
- Deleted 2 ARJIS and 1 DA as AGENCY records
- Deleted incident type (all were crime case), highcharge (all were 1) and role (all were incident)
- ALLYRS_NOSUSP includes only victims, victim/witnesses and blank (property?) in the person role
- UNIQUECASE includes unique case numbers (no matter how many victims)
Race recode
The race
field of the original data includes many names of regions,
countries or ethnicities. The census_race_eth
field is a recode of the
race
field to use the race/ethnicity scheme used by the Census. The codes
used are:
- nhwhite: Non Hispanic White
- hispanic: Hispanic, of any race
- black: Black or African-American
- asian: Asian
- nhopi: Native Hawaiian or Pacific Islander.
This file does not include any records that would be classified as the
remaining census race codes, such as American Indian or Alaskan Native. These
are the translations from the values in the race
field to those of the
census_race
field:
- OTHER: other
- none: unknown
- WHITE: nhwhite
- HISPANIC: hisp
- BLACK: black
- MIDDLE EASTERN: white
- PACIFIC ISLANDER: nhopi
- CHINESE: asian
- JAPANESE: asian
- OTHER ASIAN: asian
- FILIPINO: asian
- ASIAN INDIAN: asian
- GUAMANIAN: nhopi
- VIETNAMESE: asian
- HAWAIIAN: nhopi
- INDIAN: asian
- CAMBODIAN: asian
- KOREAN: asian
- SAMOAN: nhopi
- LAOTIAN: asian
- EAST AFRICAN: black
For the 2020 census, Filipinos may be classified as Pacific Islanders, rather
than Asian, as they had been in previous years. Because this data was collected
before this transition, Filipinos are classified as Asians.
Contacts
Resources
- sdcrime_16_20. San Diego crime suspects and victims, 2016 to 2020
- ucrcodes. UCR codes and detailed descriptions
References
- census_blocks, data/census_blocks.csv. Census 2010 blocks, converted to ACS geoids, with centroid position
- op_sd_crime_xls. Response from PRA request
- op_sd_crime_csv. Conversion of main tab of response data to CSV