back
map-pin

London, UK

Reinforcement Learning (RL) Engineer, Manipulation

Our Mission

At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into daily life and amplify human capacity.

What You’ll Do:

  • Train language-vision conditioned manipulation policies via reinforcement learning (RL) in simulation and in the real world.
  • Construct challenging and diverse suites of manipulation tasks in simulation.
  • Partner with teleoperations to collect trajectories in simulation for behavior cloning.
  • Partner with testing and operations to establish real-world RL training pipelines.
  • Experiment with various ways of bringing policies trained in simulation to the real world..

We’re Looking For:

  • 3+ years building deep‑learning systems (industry or research) with shipped models or published artifacts to show for it.
  • Hands‑on with at least one of: LLMs, VLMs, or image/video generative models — architecture, training, and inference.
  • Experience solving real problems using reinforcement learning with deep neural networks in any domain.
  • Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code.
  • You are self-driven, pro-active, communicate efficiently, document experiments clearly and communicate trade‑offs crisply.

Nice to have

  • Experience with simulators for robotics (Isaac Sim, MuJoCo etc.)
  • Experience in RL for robotics.
  • Experience building infrastructure for large-scale RL (e.g. using ray).
  • Publications at ICLR/ICML/NeurIPS or equivalent open‑source contributions.
  • Familiarity with OpenVLA, Physical Intelligence (π) models, or similar open VLA frameworks.

What We Offer:

  • Meaningful time off to rest and recharge: 23 days of annual leave (accrued), 15 days of paid sick leave, and paid company holidays.
  • Fully funded private healthcare for UK employees, with broad provider access, virtual and in‑person care, and strong mental health and serious illness support.
  • Equity included–we believe builders should share in what they build.
  • Pension scheme with a total 8% contribution (5% employee, 3% employer) on full earnings.
  • Free daily breakfast, catered lunch, and snacks in‑office.
  • Collaboration with top‑tier engineers, researchers, and product experts in AI and robotics.
  • Freedom to influence the product and own key initiatives.

How to Apply

Does this role sound like the perfect fit for you?
Fill in the form and include links or files that showcase the best of what you’ve built and achieved.

Apply now

*indicates a required field

Thanks for the request! we have already received your details and will contact you soon

Contact us

Have another role in mind? Let us know what you could bring to the team.