HenryWillis
Work

Where I've spent my time.

Professional experience spanning data science, research, and engineering roles.

Jun 2025 — Current

Data Scientist

Open Justice Lab (Contractor)·New York, NY

  • Implementing causal inference methods to identify variables driving deaths in custody and recidivism.
  • Processing and cleaning data produced by Boston University Spark! teams.
  • Running self-hosted vLLMs to convert FOIA-gathered PDFs into usable tabular data via OCR.
  • Submitting and managing FOIA requests for state prison capacities, prison building materials, deaths in custody, and gun deaths by county.
  • Merging BJS census snapshots that lack shared facility IDs using probabilistic record matching.
  • Using Bayesian sampling and modeling to estimate per-document error rates for OCR-converted records.
  • Building websites and visualizations so litigators, researchers, and lay readers can explore the data directly.
Python Causal inference Bayesian modeling vLLM OCR FOIA Data viz
Jul 2024 — Current

LSAT Tutor

7Sage (Contractor)·Remote

  • Scored 99th percentile on the LSAT.
  • Conduct weekly tutoring sessions with students all over the world to prepare them for the LSAT.
  • Provide personalized study plans and test-taking strategies.
LSAT prep Teaching Remote tutoring
Jun 2024 — May 2025

Research Assistant

Open Justice Lab·Boston, MA

  • Automating conversion of thousands of PDFs from FOIA requests to tabular format.
  • Aggregating prison overcrowding data from fifteen states into a central database.
  • Performing exploratory data analysis using NLP methodologies on thousands of pages of Extraordinary Occurrence Reports from Pennsylvania state prisons.
Python NLP PDF processing Data analysis
Jun 2023 — Sep 2023

Data Science Intern

Achillea Peer Tutoring·Boston, MA

  • Migrated multiple disconnected data sources into a PostgreSQL database.
  • Used Pandas dataframes to produce analytics for various subgroups of students.
  • Visualized analytics as part of client-facing progress reports.
  • Managed relationships with partner organizations' information technology teams.
PostgreSQL Pandas Data viz Analytics
Jun 2022 — Jan 2023

Field Applications Engineering Intern

AEye, Inc.·Dublin, CA

  • Managed relationships with three perception software vendors.
  • Set up LiDAR processing units (LPU) on-site and captured data.
  • Wrote scripts to convert point cloud data into JSON lists of vehicles and their attributes.
  • Implemented remote access to LPUs for customers and wrote JIRA documentation.
Python LiDAR JSON JIRA Point clouds
Jan 2022 — Jan 2023

Tutor

BU Educational Resource Center·Boston, MA

  • Tutored students weekly in verified courses, primarily Economics.
Economics Teaching