Our Methodology

To estimate the number and share of people eligible for but not receiving relief (expungements, set-asides, etc.) from criminal convictions in each state, we proceeded as follows, implementing the approach developed in Colleen V. Chien, America's Paper Prisons: The Second Chance Gap (2020).

First, we ascertain the relevant records relief laws and develop rules logic, using legal research to develop lists of ineligible and eligible charges. Next, we obtain and clean the data sample and collect information on the state’s criminal population. Where possible, we also obtain administrative data on the number of expungements granted historically. Next, we develop flow logic to model the laws. Next, we apply the flow logic to the data sample to estimate eligibility shares in the sample. Finally, we extrapolate from the population in the sample to the total criminal population in the state overall to calculate number and share of individuals in the “current gap” (people with currently records eligible for relief) as well as the “uptake gap” (share of people eligible for expungement over time that have not received them). The descriptions below disclose several shortcomings in our approach, including our inability to account for outstanding fines or out of state charges which could potentially disqualify some individuals for relief, failure to model criteria from whom eligibility is unascertainable from the available record, the existence of missing data for which we assume a lack of eligibility, and our inability to be sure that our sample is representative of all with criminal records in the state.

The Process


Ascertaining the Law and Developing Rules Logic

Based on court guidelines, statutes, and guides from non-profits, we discern each state’s law and determine its internal logic, with respect to the charge grade (e.g. misdemeanor or felony), offense type (e.g non-violent or domestic violence charge), time (e.g 3-year waiting period), disposition type (e.g. nolo contendere) and person conditions (e.g. a lifetime limit of 2 convictions) that define eligibility. The assumptions we rely on are disclosed in every report.

From these rules, we created lists of eligible and ineligible offenses. To do so, we reviewed the relief rules for disqualified classes of charges and then searched the criminal code for the corresponding statute name or number corresponding with each class of charges. We then used these statutes to identify the characteristics of each potentially eligible offense: their charge type (e.g. felony, misdemeanor), degree, and the maximum possible duration of incarceration/amount to be fine for each offense. Once we had assembled the characteristics of each potentially ineligible offense, we cross referenced each offense and its characteristics against the eligibility statute. If a specific statute section was outside the prescribed characteristics of any category of eligibility (e.g., class of offense, degree, maximum duration of incarceration/amount to be fined, etc.), the offense was deemed ineligible for expungement. The offenses that were within each of the eligibility requirements after this process were deemed eligible for expungement. We did not consider the eligibility of offenses that fulfilled the unmodeled criteria referenced above, making our estimate under-inclusive and over-inclusive.


Obtaining the Data Sample and Collecting Data on the State Population of Individuals with Criminal Records and the Number of Expungements Granted

From a state data vendor, we obtain court records on which to do our analysis. We generally seek data from the state directly but where not available, we use the sample of background checks described in Chien 2020. Where not already available, we use Name+DOB to create unique person IDs and create state-specific criminal histories for each person. Profile information on the analyzed population is provided below in every report in Appendix B.

We approximate the number of people in the state with criminal charges using a few methods. If state criminal population information is available directly from the state, we rely on it. When it isn’t available, we consider two sources. First, we consult public records provided by SEARCH (2022), a listing of criminal subject counts provided by the repositories of each state. We then adjust for growth in the number of people with records using a 3% CAGR average based on 10 years of historical data. As a sanity check, we compare this number with the estimated number of people with criminal records derived based on taking the population of people in the state from the Census and then multiplying the “national average” share of ~25% of Americans having a criminal record (derived from 331M individuals and 80M people with criminal records). When the difference is large (i.e. more than ~25%), we use the population-derived number. The raw numbers derived from SEARCH records and from the state include multi-state offenders, people who do not live in the state at the time of the crime, and also, people that may have since their disposition left the state. Regardless of the source, the raw numbers do not account for deported or deceased people. As described in the report, where possible we make adjustments to take into account these factors, but it should be reiterated that from these reasons, the population number provided are estimates.

We further account for people with uncharged arrests as described in Chien (2020) based on an analysis prepared by Professor Robert Apel of Rutgers University based on the NLSY97, an ongoing U.S. Bureau of Labor Statistics survey tracking 7,335 randomly selected people starting in their 20’s by removing them from our eligibility analysis, which is based on court records.

In addition to researching the number of individuals with criminal histories, we seek from state sources administrative data on the number of expungements granted historically. When public reports are not available, we file records requests or consult other sources of information. We use this data to calculate the “uptake rate” and number of years it would take to clear the backlog.


Applying the Law to the Sample Data to Obtain an Eligibility Share

To apply the law to data, we use the methods described in Chien (2020) to first prepare the data by cleaning and labeling dispositions and charges data. We report the share of charges missing dispositions or charge types in Appendix B of each report. We then apply the logic to the sample to obtain a share of people eligible for records relief in the sample. When relevant data is missing, we assume, conservatively, that the charge or incident is ineligible for relief.

To approximate “sentence completion” we use recorded sentences where available, assuming that the sentence has been carried out, and where not available, an assumption that the sentence is completed 2.5 years after the disposition date for misdemeanor charges, and 3.5 years after the disposition date for felony charges where sentence completion was not readily available. Importantly, we do not account for outstanding fines or out of state charges which can potentially disqualify some individuals for relief per the summary of the rules.

When the eligibility of frequently occurring charges isn’t addressed directly by the “top down” methodology described above, of researching eligibility or ineligibility based on the rules, we use a “bottom up” approach of researching these charges and ascertaining their eligibility one by one.


Applying the Eligibility Share to the Criminal Population and State History of Relief to Estimate the Number of People in the Second Chance Gap

To develop a total state eligibility estimate based on the shares derived in the steps above we make the assumption that the sample is representative enough of the criminal population that we can use its eligibility shares as the basis for a rough state estimate. Our figures should be caution We then apply these shares to the estimated number of people with court criminal records in the state, developed using the approach described above. This yields our estimation of the number and share of individuals in the “current gap” (people with currently records eligible for relief) as well as, in combination with the expungement actuals mentioned above, the “uptake gap” (share of people eligible for expungement over time that have not received them).


The Paper Prisons Initiative is made possible through the support of our collaborators and partners

Contact Us

If you wish to contact us, please fill out the entire form below and press the "Send Message" button. If you wish to sign up for email updates from us, please only fill out your name and email in the form below and press the "Sign Up for Updates" button.


© 2024 Paper Prisons Initiative

Privacy Policy | Terms and Conditions | Cookie Policy