Nebius AI

About Nebius AI

Empowering AI with robust infrastructure solutions

🏢 Tech👥 51-250📅 Founded 2022📍 Amsterdam, North Holland, Netherlands

Key Highlights

  • Publicly traded on Nasdaq, expanding AI infrastructure market
  • Headquartered in Amsterdam with hubs in the US, Europe, and Israel
  • Team of around 400 skilled engineers focused on AI/ML
  • Specializes in large-scale GPU clusters and cloud platforms

Nebius is a Nasdaq-listed company headquartered in Amsterdam, specializing in AI infrastructure solutions. With a team of around 400 engineers, Nebius provides large-scale GPU clusters and cloud platforms designed to support the rapid growth of the AI industry. The company has established R&D and co...

🎁 Benefits

Nebius offers competitive equity packages, a flexible PTO policy, and opportunities for remote work. Employees also benefit from a learning budget to ...

🌟 Culture

Nebius fosters a culture centered around engineering excellence and innovation in AI infrastructure. The company values collaboration across its globa...

Skills & Technologies

Overview

Nebius AI is seeking a Senior HPC Cluster Engineer to enhance and optimize their cutting-edge hyperscaler platform. You'll work with GPU computing and InfiniBand networks, focusing on performance tuning and automation. This role requires expertise in high-performance computing environments.

Job Description

Who you are

You have a strong background in high-performance computing (HPC) and experience with GPU clusters and InfiniBand networks. Your expertise in KVM and QEMU technologies allows you to ensure high performance and security in multi-GPU environments. You are skilled in analyzing and troubleshooting complex systems, and you have a passion for optimizing infrastructure to support new hardware. You thrive in collaborative environments and are eager to contribute to innovative AI cloud solutions.

You possess a deep understanding of hardware virtualization and device emulation technologies, which enables you to fine-tune system performance effectively. Your experience in automating fault detection and resolution in HPC environments showcases your proactive approach to problem-solving. You are committed to continuous learning and professional growth, and you are excited about the challenges presented by the evolving AI landscape.

Desirable

Experience with cloud computing platforms and familiarity with AI/ML technologies would be advantageous. A background in software engineering or related fields can enhance your contributions to the team. You are open to exploring new technologies and methodologies that can improve system performance and efficiency.

What you'll do

As a Senior HPC Cluster Engineer at Nebius AI, you will play a key role in the development of our hyperscaler platform. You will work closely with the GPU & InfiniBand team to enhance and optimize core components of our cloud platform. Your responsibilities will include tuning the performance of GPU clusters and InfiniBand networks, ensuring that our infrastructure meets the demands of our customers.

You will analyze and troubleshoot existing systems, identifying areas for improvement and implementing solutions that enhance performance and reliability. Your role will involve collaborating with cross-functional teams to support the integration of new hardware and technologies into our platform. You will also be responsible for automating processes related to fault detection and resolution, contributing to the overall efficiency of our HPC environments.

In addition to your technical responsibilities, you will have the opportunity to mentor junior engineers and share your knowledge with the team. You will participate in design reviews and contribute to the architectural decisions that shape our cloud infrastructure. Your insights will help drive innovation and ensure that we remain at the forefront of AI cloud solutions.

What we offer

At Nebius AI, we provide a competitive salary and a comprehensive benefits package that supports your professional growth. We value flexibility in the workplace, offering remote working arrangements that allow you to balance your personal and professional life. Our dynamic and collaborative work environment encourages initiative and innovation, making it an exciting place to grow your career.

As part of our team, you will have the chance to work alongside some of the most experienced and innovative leaders in the field of AI cloud infrastructure. We are committed to expanding our products and services, and we encourage you to apply even if your experience doesn't match every requirement. Join us in shaping the future of cloud computing for the global AI economy.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Nebius AI.

Similar Jobs You Might Like

Based on your interests and this role

Nebius AI

Hpc Cluster Engineer

Nebius AI📍 Prague - Remote

Nebius AI is seeking a Senior HPC Cluster Engineer to enhance and optimize their cloud platform focused on GPU computing and InfiniBand networks. You'll work with technologies like KVM and QEMU to ensure high performance in multi-GPU environments. This role requires expertise in Linux and virtualization technologies.

🏠 RemoteSenior
16h ago
Nebius AI

Systems Engineer

Nebius AI📍 Amsterdam - On-Site

Nebius AI is seeking a Systems Engineer to support benchmarking of GPU platforms for machine learning and AI workloads. You'll work closely with hardware and development teams to evaluate GPU performance using technologies like CUDA. This position requires expertise in AI and deep learning frameworks.

🏛️ On-SiteMid-Level
16h ago
Nebius AI

Hypervisor Engineer

Nebius AI📍 Amsterdam - Remote

Nebius AI is seeking a Senior Hypervisor Engineer to contribute to the development of their hyperscaler platform. You'll work with KVM and QEMU to optimize cloud infrastructure for AI applications. This role requires expertise in hardware virtualization and device emulation.

🏠 RemoteSenior
16h ago
Nebius AI

Systems Engineer

Nebius AI📍 Amsterdam - Remote

Nebius AI is hiring a Senior Systems Engineer for their Virtual Private Cloud Team to design and operate core networking layers of their cloud platform. You'll work with AWS and networking technologies in Amsterdam or remotely across Europe.

🏠 RemoteSenior
16h ago
Canonical

Hpc Software Engineer

Canonical📍 Americas - Remote

Canonical is hiring an HPC Software Engineer to deliver an outstanding HPC experience as part of the broader Ubuntu platform. You'll focus on Python software development for automation in the HPC sphere. This role requires strong mathematical and scientific skills.

🏠 RemoteMid-Level
1 month ago