Jonathan Muhire Jonathan Muhire
Jonathan Muhire

I build intelligent systems that ship.

ML systems, data infrastructure, and applied AI. 19 open-source repositories with real benchmarks and production code. GitHub has everything.

CS & AI · GSoC 2025 · Prev co-founder @ Neotix

Research and systems I've shipped.

DocIntel document intelligence pipeline DocIntel — Document Intelligence API

End-to-end document intelligence pipeline. Upload PDFs or images → layout detection, OCR, entity extraction → structured JSON. FastAPI + React + 65 tests.

FastAPI Document AI React
RenAIssance document AI pipeline RenAIssance — GSoC 2025

End-to-end Document AI for Renaissance manuscripts. LayoutLMv3 layout detection, U-Net text recognition, Tesseract OCR, and post-correction. 179 files.

Document AI LayoutLMv3 PyTorch
Vesuvius Challenge 3D segmentation Vesuvius Challenge

3D deep learning for detecting ancient papyrus in X-ray CT scans. Four architectures (3D U-Net, Attention U-Net, V-Net, Double U-Net) with ensemble + TTA.

3D Vision MONAI Kaggle
AgriFinance credit scoring platform AgriFinance

Full-stack agricultural finance platform. ML credit scoring, satellite NDVI crop monitoring, climate-adjusted loans, and offline-first PWA for low-connectivity regions.

Flask TF Lite PWA
Swarm robotics simulation analytics Swarm Robotics Simulator

Full-stack web app for multi-robot simulation. React + Express + PostgreSQL with live visualization, configurable presets, and performance analytics.

TypeScript React PostgreSQL
Flint language compilation pipeline Flint Programming Language

Custom language with three backends: tree-walk interpreter, C transpiler, and LLVM compiler. Lexer, parser, AST, and code generation from scratch.

Compilers LLVM PL Design
CampusBuddy mobile app screens CampusBuddy

Flutter mobile app for campus life — events, dining, resources. 412 files across iOS, Android, and desktop with real data integrations.

Flutter Dart Mobile

All repositories, by domain.

🧠
ML & AI Research · 7
Agentic Long-Runner

ReAct agent with 4 memory modes. Chunked retrieval recovers needle-in-haystack from 0% to 100%.

ReAct LangChain
RenAIssance — GSoC 2025

Document AI for Renaissance manuscripts. LayoutLMv3 detection, Tesseract OCR, and post-correction pipeline.

LayoutLMv3 PyTorch
CFN Biomedical Eval

Leakage-safe evaluation of Compositional Function Networks for biomedical tabular prediction. AUROC/AUPRC benchmarks.

NLP Eval
ArtExtract

CNN-RNN art classification. Style, artist, and genre prediction with confusion matrix validation.

PyTorch CV
ISSR — Crisis Detection

Behavioral analysis for suicide prevention and mental health crisis detection with geospatial trend mapping.

NLP Geospatial
Vesuvius Challenge

Deep learning for detecting ancient papyrus surfaces from 3D X-ray CT scans.

3D Vision PyTorch
SFN scRNA Study

Single-cell RNA sequencing analysis pipeline for neuroscience research.

Python Genomics
⚙️
Infrastructure & Systems · 5
DocIntel — Document Intelligence API

End-to-end document intelligence: layout detection, OCR, and entity extraction via FastAPI. React dashboard, 65 tests, Docker deployment.

FastAPI React Docker
MinIO + LakeFS Infrastructure

3-node distributed object store with erasure coding, LakeFS versioning, and Prometheus + Grafana monitoring.

MinIO Docker
ParaView Scientific Viz

Multi-domain visualization: volumetric scalar fields, CFD flow simulation, and 3D LiDAR point clouds.

ParaView VTK
MediSync

Healthcare IoT pipeline connecting glucose monitors and sensors to BigQuery with Vertex AI analytics.

GCP BigQuery
Robot Data Validator

Multi-sensor robot training data validation webapp with embedded Rerun.io visualization.

Python Rerun.io
🤖
Robotics & Simulation · 4
Custom PyMyCobot

Bimanual teleoperation platform for MyArm M&C arms with GUI visualization and real-time streaming.

Python Hardware
LeRobot Modal

Real-time robot vision with Modal-hosted SmolVLA inference and local camera input.

Modal Vision
Swarm Robotics Simulator

Browser-based multi-robot sim with flocking, foraging, and formation presets plus analytics dashboard.

TypeScript Multi-Agent
G1 Humanoid Spin Kick

RL motion imitation for Unitree G1 humanoid dynamic locomotion in MuJoCo.

RL MuJoCo
📱
Full-Stack & Mobile · 5
AgriFinance

Financial services for smallholder farmers. Climate-integrated credit scoring and risk assessment.

Python ML
ConnectFarm

React Native app for farmer credit access with ML climate predictions.

React Mobile
CampusBuddy

Flutter campus app for events, dining, and student resources.

Flutter Dart
XPR366

Infrastructure health monitoring platform using robot-collected sensor data for decision support.

JavaScript Sensors
Doctor Who Wiki DB

ERD, ORM, and GUI system for a Doctor Who knowledge base. Database systems final project.

SQL JavaScript
📜
Languages & Compilers · 2
Flint Lang

Custom interpreted language with interpreter, C transpiler, and LLVM compiler backend.

Python LLVM
Pacman AI

AI-driven game with autonomous agents, ghost AI, pathfinding algorithms, and visual enhancements.

Python Search

View all repositories on GitHub

Notes and breakdowns.

Open to research and systems roles.

Looking for teams building ML systems, data infrastructure, or applied AI.