Engineering Manager – Bare Metal & Clusters

Permanent employee, Full-time · Maria01 (Helsinki)

About:

Imagine a future where everyone has instant, low-cost access to intelligence. We’re building a fully featured European AI cloud - with everything one needs to train, experiment with, and deploy AI models. In addition, our GPUs run on 100% renewable energy.

We’re ambitious, curious, and gutsy doers. We practice a low hierarchy across the company and high morale in our teams. We’ve already achieved a lot, yet we’re only getting started. Now it’s your chance to join the ride. We offer more than just the job - we offer a career-defining opportunity to be part of building something big!

Responsibilities:
  • Lead and coordinate the development of bare-metal and virtualized GPU cluster offerings.

  • Work closely with SRE, hardware, and cluster teams to deliver robust infrastructure.

  • Drive the infrastructure and cluster roadmaps, aligning priorities across teams and ensuring clear delivery goals.

  • Oversee tracking of server infrastructure, ensuring visibility and accountability for hardware usage and deployments.

  • Align team efforts with company objectives, resolving priorities across multiple streams of work.

  • Implement and improve processes for roadmapping, prioritization, and cross-team collaboration.

  • Mentor and support engineers, fostering a culture of collaboration, delivery, and innovation.

  • Contribute to long-term scalability by identifying and addressing systemic infrastructure challenges.

Qualifications:
  • Proven experience as an Engineering Manager or Senior Technical Lead, ideally in a start-up or scale-up environment.

  • Strong background in bare metal server management and distributed computing/HPC.

  • Experience with virtualization in large-scale environments.

  • Strong leadership, organizational, and cross-functional communication skills.

  • Excellent communication skills, both technical and non-technical.

Nice-to-haves

  • Experience with MaaS and infrastructure automation.

  • Experience with the latest generation GPU systems.

  • Experience with high-performance networking, including RDMA-based network fabrics (e.g. InfiniBand, RoCE).

  • Experience with HPC workload orchestration using Slurm and/or Kubernetes.

  • Experience with observability and monitoring stack (Grafana, Prometheus, ELK).

  • Exposure to hardware lifecycle management and data center operations.

What we offer:
  • Company equity - a true stake in our journey.

  • Competitive salary and benefits, including health insurance, lunch benefit, and an annual personal budget (for sport, transport, wellness, or culture).

  • Flexible working environment.

  • Opportunity to work with cutting-edge AI technologies.

  • Career growth within a mission-driven company.

Assessment Process:
  1. Introductory chat (45 mins) - Meet with our Talent Partner to learn more about DataCrunch and share your career goals.

  2. Technical/leadership interview (45 mins) - With our CTO, focusing on your experience in team leadership, project coordination, and building teams and systems.

  3. Technical conversation - A discussion with our engineers, with a deeper dive into your technical approach and ways of working.

  4. Final interview (60 mins) - Meet with our CEO and the wider team.

We are looking forward to hearing from you!
Thank you for your interest in DataCrunch. Please fill out the following short form. Should you have difficulties with the upload of your data, please send an email to lena@datacrunch.io
Uploading document. Please wait.
Please add all mandatory information with a * to send your application.