Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Senior Systems Engineer (Linux / HPC Environment)

Innovative Computer Solutions Group, Inc

Senior Systems Engineer (Linux / HPC Environment)

Washington, DC
Full Time
Paid
  • Responsibilities

    Benefits:

    Health insurance

    Paid time off

    IN-PERSON (ON-SITE)

    Location: Washington DC by Union Station

    Position Overview

    We are seeking a highly skilled Senior Systems Engineer to support and enhance a Linux-based high-performance computing (HPC) environment. This platform is critical for statistical analysis and economic research across multiple business lines within the organization.

    The ideal candidate will bring deep expertise in Linux system administration, automation, and HPC technologies, along with the ability to collaborate with data scientists, economists, and technical teams to deliver reliable, secure, and high-performing infrastructure.

    Key Responsibilities

    System Administration

    Manage, maintain, and optimize Linux-based high-performance computing servers

    Perform system updates, patching, and security hardening

    Monitor system performance and ensure high availability and efficient resource utilization

    Platform Support

    Provide Tier 3 support for the analytics platform, troubleshooting complex technical issues

    Ensure minimal downtime and rapid resolution of production incidents

    Translate business and analytical requirements into scalable technical solutions

    Collaboration & Communication

    Partner with data scientists, economists, and stakeholders to support analytical workloads

    Document system configurations, processes, and troubleshooting procedures

    Contribute to knowledge sharing and continuous improvement initiatives

    Security & Compliance

    Implement and maintain security controls to protect sensitive data

    Conduct vulnerability assessments and security audits

    Ensure compliance with organizational and regulatory standards

    Projects & Engineering

    Participate in platform upgrades, enhancements, and performance tuning initiatives

    Contribute to system design, architecture, and scalability planning

    Support implementation of new tools and technologies

    On-Call Support

    Participate in an on-call rotation to support critical systems and ensure continuous operations

    Required Qualifications

    Strong experience with Linux system administration (Red Hat, Ubuntu, or similar)

    Proficiency in shell scripting and system automation tools

    Hands-on experience with Ansible and Ansible Automation Platform

    Experience supporting high-performance computing (HPC) environments

    Preferred Technical Experience

    Familiarity with HPC technologies such as:

    SLURM

    Open OnDemand

    Experience with statistical and analytical tools such as:

    R, Python, MATLAB, Stata, SAS

    Core Competencies

    Strong problem-solving and troubleshooting skills

    Customer-focused mindset with a sense of ownership

    Excellent communication and collaboration abilities

    Ability to work effectively in cross-functional environments