Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Site Reliability Engineer

NERD UNITED DAO LLC

Site Reliability Engineer

Lehi, UT
Full Time
Paid
  • Responsibilities

    We are a bunch of Nerds making waves in the Web 3.0 space. We are well funded and in a state of hyper-growth. We count on our Site Reliability Engineers (SREs) to empower our users with a rich feature set, high availability, and stellar performance level. As we expand our customer deployments, we are currently seeking an experienced SRE to deliver insights from massive scale data in real time. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

    OBJECTIVES OF THIS ROLE

    • Maintain the production environment by monitoring availability and taking a holistic view of system health
    • Deploy software and systems to manage platform infrastructure and applications
    • Improve reliability, quality, and time-to-market of our suite of software solutions
    • Measure and optimize system performance, pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
    • Provide primary operational support and engineering for software applications

    DAILY AND MONTHLY RESPONSIBILITIES

    • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
    • Partner with development teams to improve services through rigorous testing and release procedures
    • Participate in system design consulting, platform management, and capacity planning
    • Create sustainable systems and services through automation
    • Balance feature development speed and reliability with well-defined service level objectives

    REQUIRED SKILLS AND QUALIFICATIONS

    • Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript
    • Experience with distributed storage technologies like NFS, HDFS, Ceph, S3 as well as dynamic resource management frameworks (Mesos, Kubernetes, Yarn)
    • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks

    PREFERRED QUALIFICATIONS

    • Previous success in technical engineering 
    • Coding experience