DevOps & Cloud Infrastructure

Travoom

DevOps & Cloud Infrastructure

Austin, TX
Full Time
Paid
  • Responsibilities

    Job Description

    Role: Senior DevOps Engineer / Platform Reliability Lead

    OleOle is looking for a senior DevOps leader to own the reliability, scalability, and operational integrity of a global, real-time football platform.

    This s a hands-on leadership role. The core architecture, technologies, and product direction are already defined. The focus now is execution—building infrastructure that scales cleanly, fails gracefully, and supports millions of users during the world’s biggest sporting moments.

    You will be responsible for ensuring that a complex, multi-system platform operates as one reliable, observable, and secure system.

    What you will own

    • End-to-end ownership of cloud infrastructure and platform reliability
    • Design and operation of high-availability, fault-tolerant systems
    • Kubernetes-based environments supporting real-time social, messaging, AI, and financial services
    • CI/CD pipelines that are safe, repeatable, and trusted by engineers
    • Monitoring, logging, alerting, and incident response across the entire platform
    • Security, access control, secrets management, and operational best practices
    • Production readiness for traffic spikes tied to live matches and global tournaments

    This role exists to prevent problems before users ever see them and to restore systems quickly and calmly when issues occur.

    What you’ll work on

    • Operating and scaling real-time systems for live scores, messaging, and in-match activity
    • Supporting AI translation workloads without impacting core platform performance
    • Ensuring wallet, rewards, and financial infrastructure remain secure, auditable, and always available
    • Managing production-grade MediaWiki infrastructure used for large-scale football history content
    • Designing failover strategies so no single system can take down the platform
    • Creating clear separation between development, staging, and production environments

    What we’re looking for

    Required

    • 7+ years of experience in DevOps, SRE, or platform engineering roles
    • Deep experience with AWS and cloud-native architectures
    • Strong Kubernetes and container orchestration experience
    • Proven track record running high-traffic, real-time production systems
    • Infrastructure-as-Code experience (Terraform preferred)
    • Strong understanding of Linux, networking, and system debugging
    • Experience designing systems for reliability, not just deployment

    Strong plus

    • Experience supporting crypto platforms, wallets, or exchanges
    • Experience with Rust or high-performance backend systems
    • Experience with live data feeds, sports, trading, or messaging platforms
    • Prior ownership of incident response and on-call operations

    How you work

    • You think in systems, not tickets
    • You anticipate failure modes instead of reacting to them
    • You communicate clearly and directly when something is unsafe or broken
    • You are comfortable making decisions and taking ownership
    • You focus on stability, clarity, and long-term maintainability

    This is not a role for someone who wants to debate architecture endlessly. The decisions are made. This role is about making them work in the real world.

  • Qualifications

    Qualifications

    Required Experience

    • 7–10+ years of experience in DevOps, SRE, or Platform Engineering
    • Prior experience at a startup or high-growth technology company, ideally from early or mid-stage through scale
    • Proven ownership of production infrastructure for high-traffic, real-time platforms
    • Deep hands-on experience with cloud-native architecture (AWS preferred)
    • Strong experience operating Kubernetes in production environments
    • Infrastructure-as-Code experience (Terraform or equivalent)
    • Demonstrated ability to design systems for reliability, fault tolerance, and scalability
    • Experience leading or owning incident response and production operations

    Strongly Preferred

    • Working knowledge of Rust or experience supporting high-performance backend systems
    • Experience with blockchain, crypto wallets, or exchange infrastructure
    • Background in fintech, payments, trading, or financial systems
    • Experience securing systems that handle transactions, keys, and sensitive data
    • Familiarity with real-time data pipelines, messaging systems, or event-driven architectures

    What distinguishes a great candidate

    • You have seen platforms break under real-world pressure — and fixed them
    • You understand how early technical decisions affect long-term scale
    • You know when to move fast and when stability matters more
    • You think in terms of risk, failure modes, and blast radius
    • You are comfortable taking ownership without needing constant direction

    What we are not looking for

    • Junior or mid-level DevOps engineers
    • Candidates without real production ownership
    • Purely theoretical or certification-only backgrounds
    • Engineers who want to debate decisions instead of executing them

    Additional Information

    Solutions not problems .

    • Creative problem solver who can courageously propose and support new ideas to our organization. Not interested in best practices, lets build something better!
    • Ability to adapt. An ideal candidate will welcome the opportunity to solve a broad range of problems using a wide array of technologies.
    • Comfortable with ambiguity, shifting priorities and general growing pains of an early-stage technology company
    • An exceptional entrepreneurial judgment that fosters independence over micro-management
    • Understanding of football and international sports a huge plus

    Ole Ole is located in beautiful Austin Texas, however, this role requires some travel we are privately held and rapidly growing!