Sorry, this listing is no longer accepting applications. Don’t worry, we have more awesome opportunities and internships for you.

High Performance Computing Development Lead

Digital Infuzion

High Performance Computing Development Lead

Gaithersburg, MD
Full Time
Paid
  • Responsibilities

    Job Description

     

    HIGH PERFORMANCE COMPUTING DEVELOPMENT LEAD

     

    Welcome! At Digital Infuzion, we believe people can lead better, healthier lives. To do so, researchers need insights faster, and providers need on-demand data and tailored software solutions. Which is why we are passionate about developing innovative solutions for the healthcare industry so researchers and providers can better serve their patients. We go beyond ordinary health IT services and solutions because we see the advancement of technology and bioinformatics as opportunities to make meaningful impacts in patient’s lives. If you feel drawn to doing what you love in a creative, open, and growth-oriented environment all while helping people live healthier lives, then keep scrolling - we may have just the opportunity for you.

     

    JOB DESCRIPTION

    We are looking for a System Administrator with experience in both GPU and HPC scaling of computational analysis to support our work at NIH. The position will be based in Bethesda/Rockville, MD. This position would entail both designing and implementing hardware and software for HPC systems. This could include implementation of OpenMPI, OpenMP, CUDA, OpenCL, Vulkan, SLURM, UnivaGrid Engine, Singularity, Infniband, etc. as well as designing optimal hardware platforms to implement these software frameworks on. In addition to these tasks, we are looking for an experienced System administrator who can design, develop and implement builds using Ansible, Puppet, or similar tools to support automated deployment and continuous delivery and continuous integration solutions. Similarly, implementing, monitoring, assessing, and predicting computational and network capacity as well as engaging in performance planning, running software upgrades after hours on production platforms, data-center build-outs, and change management will be required. Administration of multiple databases and providing appropriate sharding, replication, and partitioning as well as storage systems for high redundancy and availability. The position will also entail management of IT infrastructure, policies, and governance principles. 

     

    The successful applicant will also be involved with managing scientific software including Matlab, Simulink, Python, R, SAS, Spark, Imaris, IMOD, VTK/ITK, Columbus, etc. Management includes installation, updates, patching, integration, and scaling of these solutions. Finally, candidates will be expected to create and use standard approaches and best practices to create and maintain automated test plans, test scripts, test suites, etc.as well as stress testing for availability and redundancy of databases and file storage systems. Finally, an ideal candidate will also have helped architect and implement hybrid-cloud infrastructure for large organizations. 

     

    RESPONSIBILITIES

    • Works with technical management to architect NIH networks to effectively reflect business needs, service-level agreements and high availability requirements.
    • Capacity and performance planning, running software upgrades after hours on production platforms, data-center build-outs, and change management
    • Understanding of microservices architecture, containers (Docker), and Kubernetes
    • Execute routine changes, provision services, and deployments.
    • Fault isolation, service recovery, and incident analysis
    • Hands-on experience with major HPC services related to compute, network, storage, content delivery, administration and security, deployment and management, and automation technologies.
    • Hands-on experience with major Cloud foundation services related to compute, network, storage, content delivery, administration and security, deployment and management, and automation technologies.
    • Experience with enterprise hardware monitoring applications
    • Experience with Enterprise HPC and Cloud Storage
    • Ability to configure and troubleshoot VMWare
    • Disaster Recovery concepts and architectures
    • Experience working on Linux and Windows systems.
    • Continually updates understanding of business and technology status and objectives and responds to strategic design requests as the business evolves
    • Excellent verbal and written communication skills, with an emphasis on preparing clear and concise communication and giving oral presentations
    • Ability to handle multiple tasks
    • Critical thinking and reasoning ability is an essential part of this position
    • Ability to understand systems and system concepts; ability to break down complex user requirements into easy-to implement solutions is an important part of this position
    • Will be architecting, designing and implementing HPC hardware and software systems
    • Will be architecting, designing and implementing testing and validation methods. 
    • Focus on designing, developing and implementing automation to support continuous delivery and continuous integration solutions 
    • Create a standard approach and best practice to create and maintain automated test plans, test scripts, test suites, etc.
    • Create conceptual, logical and physical design for on-prem systems 
    • Maintain established service agreements to manage customer expectations and quality standards
    • Present ideas to both technical and non-technical users and staff to further the adoption of DevOps

    REQUIREMENTS

    • 5+ years of HPC system administration experience
    • Bachelor’s degree in Computer Sciences or related field
    • Experience with Network management at NIH – preferred
    • Experience with hybrid computational infrastructure - preferred
    • See above for more specific details

     

    DIGITAL INFUZION, INC. IS AN EQUAL OPPORTUNITY EMPLOYER. EOE/AA/M/F/D/V

    It is the policy of Digital Infuzion, Inc. to provide equal employment opportunities without regard to race, color, religion, sex, gender identity, sexual orientation, national origin, age, disability, marital status, veteran status, genetic information or any other protected characteristic under applicable law.

     

    Company Description

    Do you want to be part of something bigger? A place where your insights can transform the data-driven work of healthcare researchers and professionals. Does your heart beat faster when you’re actively... Advancing the understanding of data behind a disease? Solving the problem of medical research statistics that don’t add up? Taking a stand for a healthcare cause that you believe in, to promote awareness through data analysis? At Digital Infuzion, you will find a group of colleagues who share your passion for this mission. Our insights accelerate the work of physicians, researchers, and other healthcare providers. Join our team. You can learn more about Digital Infuzion here: https://www.digitalinfuzion.com/ https://www.linkedin.com/company/digital-infuzion/

  • Industry
    Hospital and Health Care