Cloud Engineer SME

Application Research Center LLC

Cloud Engineer SME

Chantilly, VA
Full Time
Paid
  • Responsibilities

    Benefits:

    401(k) matching

    Bonus based on performance

    Competitive salary

    Dental insurance

    Flexible schedule

    Health insurance

    Paid time off

    Training & development

    Vision insurance

    Job Title: Cloud Engineer SME

    Company: Application Research Center LLC (ARC)

    About ARC:

    Application Research Center LLC (ARC) is a technology-forward organization that provides research-driven, scalable solutions in cloud computing, artificial intelligence, and secure infrastructure. We serve both public and private sector clients with a commitment to innovation, performance, and mission success.

    Position Overview:

    We are looking for an experienced Cloud Engineer SME to join our advanced technology team. The ideal candidate is a hands-on cloud technologist with strong software development, networking, AI integration, and Linux system experience. You will be responsible for designing, implementing, and supporting resilient cloud solutions, while troubleshooting complex systems and mentoring others.

    Key Responsibilities:

    · Architect and implement scalable and secure cloud-native infrastructure using AWS, Azure, or GCP.

    · Develop automation tools and infrastructure as code using Python, Terraform, CloudFormation, or Ansible.

    · Design and troubleshoot VPCs, subnets, DNS, routing, VPN, NAT gateways, and hybrid cloud networking setups.

    · Integrate and optimize AI/ML pipelines in cloud environments (e.g., SageMaker, Azure ML, Vertex AI).

    · Develop and debug microservices, serverless applications, APIs, and data processing workflows.

    · Administer and secure Linux-based systems, monitor performance, and resolve OS-level issues.

    · Build, maintain, and optimize CI/CD pipelines for application deployment and infrastructure provisioning.

    · Write clean, reusable, and testable code with attention to software engineering best practices.

    · Perform root cause analysis of incidents and troubleshoot complex application, network, or infrastructure issues across distributed systems.

    · Set up monitoring, logging, and alerting frameworks using tools like Prometheus, Grafana, CloudWatch, or ELK Stack.

    · Collaborate with DevOps, AI/ML, and Security teams to improve system reliability and performance.

    · Provide expert-level technical guidance and mentorship to team members and stakeholders.

    Core Technical Skills Required:

    o Deep experience with Python, including scripting, automation, and API development.

    o Proficient in software engineering principles and familiar with modern development workflows (Git, testing, CI/CD).

    o Strong troubleshooting skills in cloud infrastructure, distributed systems, and application performance.

    o Networking expertise: TCP/IP, DNS, SSL/TLS, HTTP(S), firewalls, load balancers, and SDN concepts.

    o Cloud-native services and orchestration: Docker, Kubernetes, ECS/EKS/AKS, serverless (Lambda/Functions).

    o Hands-on experience with Linux system administration, troubleshooting kernel, memory, and I/O issues.

    o Familiarity with databases: SQL, NoSQL (PostgreSQL, DynamoDB, Redis, etc.).

    Preferred Qualifications:

    o Cloud certifications (e.g., AWS Certified Solutions Architect, Azure Solutions Architect Expert).

    o Familiarity with AI models

    o Experience with secrets management and identity tools (Vault, IAM, SSO).

    o Knowledge of regulatory and security frameworks (e.g., FedRAMP, NIST, SOC 2, HIPAA).

    o Prior experience supporting mission-critical or government systems.

    Soft Skills:

    · Excellent analytical and problem-solving abilities.

    · Strong verbal and written communication skills, especially in technical documentation.

    · Collaborative mindset with the ability to work in a cross-functional team.

    · Self-motivated and proactive in identifying problems and proposing solutions.

    What We Offer:

    · Competitive salary and performance incentives

    · Remote/hybrid work flexibility

    · Access to innovative AI/cloud infrastructure projects

    · Training and professional development resources

    · Mission-driven culture with real-world impact

    Flexible work from home options available.