Jonathan Muhire Jonathan Muhire
Jonathan Muhire

I debug and ship systems that have to work.

Software engineer focused on robotics simulation, CI reliability, ML evaluation, and data infrastructure. Recent work: an openpilot MetaDrive CI PR with 20 consecutive full workflow passes.

Autonomous driving simulator road connected to CI test cards and log traces
Simulator CI: observed failure modes, targeted patches, repeatable Linux proof.

Systems, evaluation, and robotics work.

openpilot MetaDrive CI Reliability

Submitted an upstream openpilot PR for bounty #30693: Linux GitHub Actions simulator coverage for MetaDrive, CI-scoped reliability fixes, log artifacts, and 20 consecutive full workflow passes on a single submitted commit.

Autonomous-driving simulator connected to CI test artifacts
Robotics Data Infrastructure

Built a ROS2/C++ teleoperation and data-capture prototype, with MinIO and LakeFS backing roughly 500GB of RGB-D and geometry logs for reproducible dataset snapshots and rollback.

Robot data-capture rig connected to versioned storage and dataset snapshots
Agent Evaluation — Memory Ablations

ReAct agent with memory-mode ablations and needle-in-haystack benchmarks. Eval-driven development: define the failure mode, write the test, then improve the system until it passes.

Agent memory modules connected to retrieval tests and evaluation heatmaps
Vesuvius Challenge — 3D Segmentation

3D volumetric deep learning for detecting ancient papyrus surfaces in CT scans of carbonized Herculaneum scrolls. Focused on data handling, model evaluation, and segmentation quality.

Translucent CT volume with segmented papyrus surface layers
RenAIssance — GSoC 2025

Open-source document understanding pipeline for Renaissance manuscripts using LayoutLMv3, OCR, structured logging, validation, and batch-processing failure handling.

Manuscript document moving through OCR, layout detection, and validation stages

Notes and breakdowns.

Open to systems, robotics, and ML infrastructure roles.

Looking for teams that value debugging, reproducibility, and software that survives real execution.