Job Description
Role: AWS DevOps Engineer - Observability
Location: Erie PA
Duration: Long Term
Job Descritption:
- Design, develop and maintain an observability strategy that covers application performance, user experience, and system health on AWS and on-prem using tools like Splunk observability cloud, CloudWatch , OpenTelemetry..etc
- Design, configure, and maintain AppDynamics dashboards tailored to business needs, giving clear visibility into key metrics and KPIs.
- Implement OpenTelemetry SDKs and APIs across applications to collect traces, metrics, and logs. Ensure consistent instrumentation in .NET, Java, COTS, and other applications in the environment.
- Integrate and configure Splunk Observability Cloud to capture detailed trace data, enabling a comprehensive view across distributed applications.
- Ensure dashboards highlight trends, bottlenecks, and critical alerts for quick incident response.
- Define alerts and thresholds in both AppDynamics and Splunk to detect anomalies early.
- Extensive hands-on experience with installing necessary agents on servers, virtual machines, and AWS-supported services, and forwarding logs to a centralized location with configured aggregators.
- Deep understanding of logs, metrics, and tracing services and their capabilities.
- Conduct assessments related to AWS platform and observability, providing detailed comparative analysis, cost metrics, advantages of various monitoring tools, and setting benchmarks.
- Develop and maintain documentation processes by creating templates for observability, including assessment checklists, questionnaires, presentations, and proof of concepts with a hands-on approach.
- Participate in sessions and workshops for clients and internal team members on observability, delivering high-quality presentations.
- Participate in solution design and proposal development activities.
- Proven experience in IT monitoring and observability with a focus on cloud environments.
Requirements:
- Bachelor’s degree in computer science, Information Technology, or a related field.
- At least 10 years of IT experience, with a minimum of 6 years focused on AWS Cloud with an emphasis on observability.
- Extensive experience in implementing end-end unified observability using OpenTelemetry, Splunk observability cloud on multi cloud environment.
- Experience in design, configure, and maintain AppDynamics dashboards tailored to business needs
- Extensive experience with AWS services and a thorough understanding of compute, storage, networking, security, and database services in the cloud such as EC2, S3, VPCs, Network Flow Logs, RDS, etc.
- Expertise in AWS monitoring solutions such as Amazon CloudWatch and AWS CloudTrail.
- Proficiency in AWS-supported monitoring solutions such as Dynatrace, AppDynamics, DataDog, and Sumo Logic.
- Experience in monitoring infrastructure, APIs, microservices, JVMs, and RUM, with the ability to create necessary dashboards for visualization.
- Experience integrating with notification systems and incident management systems.
- Understanding of self-healing concepts and AIOps.
- Proficiency in programming languages such as Python, Java, or Go.
- Ability to work independently and as part of a team, demonstrating leadership when required.
- Relevant cloud certifications are required.
Regards,
Manoj
Derex Technologies INC
Contact : 973-834-5005 Ext 206