Jonathan Muhire Jonathan Muhire

About

I build ML systems and data infrastructure. 19 open-source repositories spanning agents, document AI, distributed storage, NLP evaluation frameworks, and full-stack applications. I write research code that works at production scale.

GSoC 2025 contributor. Previously co-founded Neotix. Studying Computer Science at Oklahoma Christian University.

Current Focus

  • Memory-augmented agents — ReAct with 4 memory modes, needle-in-haystack evals showing 0% to 100% retrieval recovery.
  • Document AI — GSoC 2025. Layout detection, OCR, and post-correction on Renaissance manuscripts.
  • Data infrastructure — Distributed MinIO + LakeFS for reproducible ML experiments.
  • Biomedical NLP — Token-level and span-level clinical concept extraction benchmarks.

Other Work

Stack

Python, C++, Java, Dart, JavaScript. PyTorch, Transformers, Docker, MinIO, LakeFS, Flutter, React, Firebase.

Contact

muhirejonathan123@gmail.com
GitHub · LinkedIn · Book a call