back
map-pin

London, UK

Senior Perception Engineer – Spatial Understanding & Navigation

Humanoid is the first AI and robotics company in the UK, creating the world’s most advanced, reliable, commercially scalable, and safe humanoid robots. Our first humanoid robot HMND 01 is a next-gen labour automation unit, providing highly efficient services across various use cases, starting with industrial applications.


Our Mission

At Humanoid we strive to create the world’s leading, commercially scalable, safe, and advanced humanoid robots that seamlessly integrate into daily life and amplify human capacity.

Vision

In a world where artificial intelligence opens up new horizons, our faith in its potential unveils a new outlook where, together, humans and machines build a new future filled with knowledge, inspiration, and incredible discoveries. The development of a functional humanoid robot underpins an era of abundance and well-being where poverty will disappear, and people will be able to choose what they want to do. We believe that providing a universal basic income will eventually be a true evolution of our civilization.

Solution

As the demands on our built environment rise, labour shortages loom. With the world’s workforce increasingly moving away from undesirable tasks, the manufacturing, construction, and logistics industries critical to our daily lives are left exposed. By deploying our general-purpose humanoid robots in environments deemed hazardous or monotonous, we envision a future where human well-being is safeguarded while closing the gaps in critical global labour needs.

What You’ll Do

  • Develop next-generation spatial understanding systems for robot locomotion and manipulation, integrating perception and high-level reasoning.
  • Work on open-ended navigation powered by Vision-Language-Action (VLA) models — enabling robots to understand context, predict intent, and act in complex, dynamic environments.
  • Design and scale auto-labeling and large-scale data pipelines to train and evaluate multimodal models for navigation and interaction.
  • Develop and implement scene understanding and 3D reconstruction methods that give robots persistent spatial memory and geometric awareness.
  • Collaborate with cross-functional research and engineering teams to bring large vision-language models into real-world robotic systems.
  • Stay ahead of the field — rapidly evaluate new model architectures, benchmarks, and datasets to guide our embodied AI roadmap

We’re Looking For

  • Deep experience in machine learning for vision or embodied AI, ideally with large models (VLMs, VLAs, transformers, diffusion, or multi-modal architectures).
  • Strong background in scene understanding, spatial reasoning, or 3D reconstruction from visual data.
  • Proficiency in PyTorch and hands-on experience building, fine-tuning, and deploying large-scale ML systems.
  • Strong experimental and research skills — capable of taking projects from concept to model training, evaluation, and robot integration.
  • Comfortable working in a fast-moving, research-driven environment with evolving models, data, and tools.

What We Offer

  • Competitive salary plus participation in our Stock Option Plan
  • Paid vacation with adjustments based on your location to comply with local labor laws, and additional paid sick leave days
  • Travel opportunities to our Vancouver and Boston offices
  • Office perks: free breakfasts, lunches, snacks, and regular team events
  • Freedom to influence the product and own key initiatives
  • Collaboration with top‑tier engineers, researchers, and product experts in AI and robotics
  • Startup culture prioritising speed, transparency, and minimal bureaucracy

How to Apply

Does this role sound like the perfect fit for you?
Fill in the form and include links or files that showcase the best of what you’ve built and achieved.

Apply now

*indicates a required field

Thanks for the request! we have already received your details and will contact you soon

Contact us

Have another role in mind? Let us know what you could bring to the team.