Jonathan Muhire

Jonathan Muhire

Founder @ Neotix · Robotics Data Infrastructure
Dual Degree: BS Computer Science & MS Artificial Intelligence

Enabling Embodied ai Agents to learn from the physical world as naturally as we perceive, learn and act through our complex and diverse world
Currently exploring: Multimodal fusion for robust embodied AI through curriculum learning and synthetic data augmentation
Projects
Developed remote teleoperation system for bimanual dexterous manipulation and data collection using PyMyCobot. Implemented real-time control protocols and collected demonstration data for imitation learning tasks like pick and place, enabling precise manipulation through intuitive human-robot interfaces.
Python ROS MyCobot Teleoperation
Engineered comprehensive data extraction pipeline for UMI, implementing SLAM-ORB3 for trajectory extraction from GoPro footage and gripper state estimation. Developed custom Rerun visualizations for real-time 7-DOF trajectory analysis (3D position + 4D quaternion), synchronizing visual SLAM outputs with gripper sensor data. Pipeline processes raw demonstrations into training-ready datasets with precise temporal alignment between camera poses and manipulation actions.
SLAM-ORB3 Rerun Python OpenCV 7-DOF Tracking
Developed annotation tools for robotic manipulation datasets using embodied Chain-of-Thought methods. Created interfaces for labeling gripper poses, contact points, and task-relevant keyframes in demonstration data.
Python Computer Vision Annotation Tools PyTorch
Architected scalable data storage and versioning system for robotics datasets. Deployed MinIO for S3-compatible object storage with LakeFS for Git-like version control of large sensor data, enabling reproducible ML experiments and efficient data management across distributed teams.
MinIO LakeFS S3 API Docker Data Versioning
Built end-to-end robotic learning platform using LeRobot framework. Implemented data collection pipelines, teleoperation interfaces, and policy training infrastructure for the ISO-101 robot, enabling rapid prototyping of manipulation behaviors from human demonstrations.
LeRobot PyTorch ROS2 Imitation Learning
Research Threads
Embodied AI & Robotics
Technical architecture for scalable robotics data collection infrastructure
On challenges and approaches in large-scale robotics data acquisition
Benchmarking framework for embodied AI systems
Dexterous control systems for robotic manipulation
Dexterous manipulation benchmarks and datasets
Interactive simulacra of human behavior
Language Models & NLP
Morphology-aware Kinyarwanda language model
Benchmarking cross-lingual text classification for Kinyarwanda
Kinyarwanda NLP models and speech systems
Machine translation systems for Rwandan languages
Novel algorithms for underrepresented languages
Extreme quantization in language models
Model Architecture & Theory
Mathematical reasoning and problem-solving models
Demystifying neural network operations
Adaptive computation during model inference
Optimization techniques for large model deployment
Comprehensive overview of transformer applications
Frontier Research
LK-99 and ambient-pressure superconductivity claims
Reading
On Building & Research
Paul Graham on startup growth strategies
Ben Kuhn on developing deep conviction in your work
Ben Kuhn on choosing impactful work
On effective work vs suffering for its own sake
Tyler Cowen on the art of inquiry
Self-directed CS curriculum guide
Career & Life Philosophy
On independent thinking and value formation
Sam Altman's career advice for young people
On avoiding career stagnation through comfort
Marina Keegan's essay on connection and possibility
CS Lewis on social exclusion and ambition
David Deutsch on explanations that transform the world
Adventures of a curious character
Albert Camus on existentialism and absurdity
On shared experiences and diverging paths
Rethinking our perception of time
Technical Deep Dives
Leopold Aschenbrenner on AGI timelines and implications
Stephen Wolfram's technical deep dive into LLMs
GPS
Bartosz Ciechanowski's interactive explainer
Analysis of Chinese AI model architectures
Profile of the Anduril founder and defense innovation
America's position in the new labor economy
Asimov's short story on entropy and existence
Epsilon Theory on societal trust erosion
Resources
Courses & Tutorials
Michael Nielsen's interactive book on neural networks
Stanford's course on transformer architectures
Building and deploying ML applications
Papers & Implementations
Implementation guides for key ML papers
Karpathy on the importance of fundamentals
Tim Dettmers on LLM quantization
Tools & Frameworks
Meta's foundation model for image segmentation
AI agent for autonomous code generation
Career Resources
Crowdsourced list of internship opportunities