Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

Site Reliability Engineers

The Midtown Group

Site Reliability Engineers

Manassas, VA
Full Time
Paid
  • Responsibilities

    Job Description

    Our client, a leading and reputable company dedicated to providing secure financial messaging services with cryptography emphasis to government and private sector clients is seeking SITE RELIABILITY ENGINEERS. SITE RELIABILITY ENGINEER JOB DESCRIPTION SUMMARY:

    In line with Division objectives and under guidance of a manager, the SITE RELIABILITY ENGINEER develops the methods and measures of analysis based on customer and contractual obligations. Analyzes the reliability in design of company products and services. Co-ordinates technical support/administration for moderate to highly complex systems/databases/applications for internal or external customers ensuring reliability requirements ranging from high to mission critical. Prepares reports, charts and diagrams to disclose results and highlight areas for further investigation. Identify toil and leverage automation to help eliminate it.

    SITE RELIABILITY ENGINEER RESPONSIBILITIES:

    • Exert technical influence to improve the reliability of our production products and systems.

    • Resolution of highly complex problem management issues through investigation and solution development for effective mitigation and prevention of future recurrence by means of process, procedure, or tools improvements.

    • Execution of production installations including configuration setups, error message handling, and service verification and review of operational procedures in accordance with the established process

    • Design, develop, test and maintain automation tools for infrastructure and problem management analysis

    • Provide effective and detailed systems analysis that can contribute to definition of throughput requirements, information and application data flows, hardware and software requirements, and alternative approaches.

    • Actively lead and participate in design review meetings for medium to large size/complexity/risk projects.

    • Participate in system/network projects/enhancements by representing the department and providing technical advice/ solutions ensuring adherence to documented processes and procedures and risk mitigation effort

    • Provide expert on-call support

    • Regular work on the weekends, mainly on Saturday, in support of production deployments

    • Interact with network services, software systems engineering and applications development in order to restore availability of services and identify root cause of complex problems

    • Provide technical guidance, mentorship, and coaching to less senior team members

    • Remain engaged in industry trends and best practices and share with others on the team and management

    • Steward reliability as a feature across the organization through concepts such as SLOs and service maturity.

    SITE RELIABILITY ENGINEER QUALIFICATIONS:

    • University degree in IT / Engineering or equivalent Experience
    • At least 6 years of experience in a similar position in a technical support environment including software development / debugging and problem analysis in support of mission critical applications and services.

    PROFESSIONAL KNOWLEDGE AND SKILLS:

    • Strong problem solving orientation and skills
    • Excellent communication skills, both verbally and in writing
    • Experience with distributed systems with high availability requirements and balancing the service reliability, sustainability, and technical debt for services running at scale
    • Demonstrated leverage of a methodical and analytical mindset during problem investigations and management of incidents
    • Ability to work under pressure
    • Comfort with RHEL and HP-UX
    • Familiarity with configuration and deployment management software such as BitBucket, Jenkins and Ansible.
    • Analytics software such as Elastic and Kibana
    • DB : Proficient working knowledge of Oracle DB
    • Middleware: Tuxedo, MQSeries
    • Public Key Infrastructure technologies
    • Programming languages: Scripting languages, including ksh and Perl
    • Exposure to languages such as C/C++ and Java.
    • Network technologies: TCP/IP, DNS, Firewall, ADC, VPN

     


    For 31 years, the Midtown Group has been connecting talented professionals with incredible employment opportunities.

    We are a small, woman-owned business certified by the Women’s Business Enterprise National Council (WBENC). Operating from our headquarters in Washington, DC, we provide trusted staffing services nationwide. Our clients include thousands of the most prestigious Fortune 500 companies, law firms, financial organizations, tech innovators, non-profits, and lobbying firms, as well as federal, state and local government agencies.

    Whether you’re looking for a temporary role or direct hire position, Midtown wants you to Love What You Do.

    To get you there, we can navigate the best options in your industry, connect you directly to hiring managers, and give you the insider tips you need to nail your interview.

    You’ve got the skills. You’ve got the experience. Let Midtown get you the response your resume deserves.

    Check us out! WWW.THEMIDTOWNGROUP.COM

    Finding the perfect fit should always be this easy.