Job Description
Job Summary
Mirantis is looking for a commercially driven, deeply technical Product Manager to own AI inference and model serving for k0rdent AI, our control plane for GPU infrastructure and distributed AI workloads. This role sits at the intersection of AI inference, cloud-native infrastructure, distributed systems, and performance engineering. You will define how NeoClouds and Enterprise customers deploy, scale, and operate production inference services while extracting maximum performance from the underlying GPU, network, and storage infrastructure.
This role owns product strategy and solution development for inference products across on-premises, cloud, and edge environments. The scope includes serverless inference, dedicated endpoints, workload placement, autoscaling, routing, lifecycle management, observability, and full-stack performance optimization. This person will define how customers run production model-serving workloads at scale while improving latency, throughput, utilization, reliability, cost, and operational control.
The ideal candidate has experience with high-performance infrastructure products and understands how production systems behave under real-world load. They should be comfortable reasoning across the full stack, identifying performance bottlenecks, evaluating system design trade-offs, and translating technical insight into clear product requirements, architecture direction, and customer-facing solutions.
Responsibilities
Qualifications
Why you’ll love Mirantis
Additional Information
What does Mirantis offer you?
It is understood that Mirantis, Inc. may use automated decision-making technology (ADMT) for specific employment-related decisions. Opting out of ADMT use is requested for decisions about evaluation and review connected with the specific employment decision for the position applied for. You also have the right to appeal any decisions made by ADMT by sending your request to isamoylova@mirantis.com
By submitting your resume, you consent to the processing and storage of your personal data in accordance with applicable data protection laws, for the purposes of considering your application for current and future job opportunities.
#remote
We are a Leader for Container Management in G2 (#2 after AWS)!