back
map-pin

London, UK

Lead, Vision‑Language‑Action (VLA) / Behaviour Learning

Humanoid is the first AI and robotics company in the UK, creating the world’s most advanced, reliable, commercially scalable, and safe humanoid robots. Our first humanoid robot HMND 01 is a next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications.


Our Mission

At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into daily life and amplify human capacity.

Vision

In a world where artificial intelligence opens up new horizons, our faith in its potential unveils a new outlook where, together, humans and machines build a new future filled with knowledge, inspiration, and incredible discoveries. The development of a functional humanoid robot underpins an era of abundance and well-being where poverty will disappear, and people will be able to choose what they want to do. We believe that providing a universal basic income will eventually be a true evolution of our civilization.

Solution

As the demands on our built environment rise, labour shortages loom. With the world’s workforce increasingly moving away from undesirable tasks, the manufacturing, construction, and logistics industries critical to our daily lives are left exposed. By deploying our general-purpose humanoid robots in environments deemed hazardous or monotonous, we envision a future where human well-being is safeguarded while closing the gaps in critical global labour needs.

Responsibilities

  • Set and drive the strategy for representation learning, behaviour cloning and reinforcement learning (RL).
  • Lead large-scale post-training of multi-modal LLM / VLM / VLA systems; continuously integrate new sensor modalities (vision, audio, proprioception, LiDAR, point cloud, …).
  • Build always-on pipelines that collect sim + tele-op logs, store them in a versioned lake, transform / label streams with weak supervision, curate balanced datasets and run an evaluation loop that feeds fresh failure cases back into training.
  • Partner with MLOps and Data Platform teams to scale distributed training and optimise models for real-time edge deployment.
  • Hire, mentor and unblock a small, elite team of research scientists and engineers.

Expertise

  • 6+ years building deep-learning systems, 2+ years technical team leadership.
  • Hands-on experience with LLM / VLM architecture design, billion-parameter training and fine-tuning.
  • Proven RL expertise (behaviour cloning, actor–critic, offline RL) applied to robotics or autonomous driving.
  • Demonstrated record of shipping to real robots or vehicles and iterating via data-flywheel loops.
  • Excellent written and verbal communication.
  • Demonstrated record of shipping to real robots or vehicles and iterating via data-flywheel loops.
  • Excellent written and verbal communication.

Nice to have

  • Deployment on humanoid or legged robots.
  • Experience in autonomous vehicle control and planning.
  • Research or open-source work in multi-modal transformers, diffusion control, world models.
  • Familiarity with OpenVLA, Physical Intelligence (π) models or other open-source VLA frameworks.

Benefits

  • High competitive salary.
  • 23 calendar days of vacation per year.
  • Flexible working hours.
  • Opportunity to work on the latest technologies in AI/ML, Robotics and others.
  • Startup model, offering a dynamic and innovative work environment.

How to Apply

Does this role sound like the perfect fit for you?
Fill in the form and include links or files that showcase the best of what you’ve built and achieved.

Apply now

*indicates a required field

Thanks for the request! we have already received your details and will contact you soon

Contact us

Have another role in mind? Let us know what you could bring to the team.