Cloud Operation Center Engineer - Bilingual in Korean

SBT Global, Inc.

Cloud Operation Center Engineer - Bilingual in Korean

Irvine, CA
Full Time
Paid
  • Responsibilities

    Job Description

    Roles & Responsibilities

    1. Firewall Operations & Troubleshooting

    • Create, modify, and manage Palo Alto firewall policies
    • Review traffic logs and validate firewall policy changes
    • Perform network troubleshooting using packet capture / traffic dumps
    • Diagnose issues based on NAT, security policies, and session states

    2. Load Balancer Operations

    • Configure and manage Citrix ADC VPX-based load balancers (LB/CLB)
    • Manage VIPs, services, service groups, and health checks
    • Resolve LB incidents such as backend server failures and connection issues
    • Analyze logs using nstrace, tcpdump, and related tools

    3. OpenStack Private Cloud Operations

    • Provision, deploy, and manage OpenStack instances
    • Manage volumes, shared volumes, networks, and security groups
    • Troubleshoot instance boot failures, network issues, and volume attachment problems
    • Support compute node failures and live migration operations

    4. Server Operations & Vendor Support

    • Operate Dell servers used as OpenStack compute nodes
    • Monitor server health via iDRAC and perform basic maintenance
    • Handle hardware failures and coordinate with Dell support
    • Support hardware replacement and vendor service activities

    5. Storage Operations

    • Manage NetApp ONTAP storage systems
    • Monitor nodes, SVMs, volumes, and aggregates
    • Analyze performance metrics such as latency, IOPS, and throughput
    • Respond to storage incidents and performance degradation issues

    6. Linux OS Operations

    • Perform Linux system administration and incident troubleshooting
    • Handle root password changes, mount issues, repository issues, etc.
    • Resolve boot failures, filesystem issues, and network interface problems
    • Analyze system logs and systemd service issues

    7. Monitoring & Observability

    • Operate monitoring platforms such as Zabbix, Grafana, and Prometheus
    • Monitor servers, networks, storage, OpenStack, LB, and firewall systems
    • Manage alerts, dashboards, metrics, and triggers
    • Perform root cause analysis using logs, metrics, and alerts

    8. Automation & Operational Improvement

    • Automate deployments and updates using Ansible
    • Automate repetitive operational tasks
    • Build workflow automation using Microsoft Teams / Power Automate
    • Automate alerts, reporting, approval, and request workflows
    • Develop scripts using Python, Shell, or PowerShell
    • Leverage AI coding tools for scripting, log analysis, and documentation
  • Qualifications

    Qualifications

    • Experience in Linux OS administration and troubleshooting
    • Understanding of TCP/IP, routing, NAT, and firewall policies
    • Hands-on experience with Palo Alto or similar firewalls
    • Experience with Citrix ADC VPX or similar load balancers
    • Experience with OpenStack or private cloud environments
    • Experience with NetApp ONTAP or enterprise storage systems
    • Experience with Dell servers and iDRAC-based operations
    • Packet capture / traffic dump troubleshooting experience
    • Experience with Ansible or scripting for automation
    • Ability to perform root cause analysis using logs, metrics, and network data

    Preferred Qualifications

    • Experience with Zabbix, Grafana, or Prometheus
    • OpenStack component experience (Nova, Neutron, Cinder, Glance)
    • NetApp ONTAP CLI and performance tuning experience
    • Citrix ADC nstrace / tcpdump troubleshooting experience
    • Microsoft Teams / Power Automate workflow automation experience
    • Scripting skills in Python, Shell, or PowerShell
    • Understanding of REST API, Webhook, JSON, YAML
    • Git-based version control experience
    • Experience with AI-assisted development tools (ChatGPT, Copilot, etc.)
    • Experience using AI for RCA, log analysis, and automation

    Key Skills

    OpenStack, Palo Alto Firewall, Citrix ADC VPX, NetApp ONTAP, Dell Server/iDRAC, Linux Administration, Monitoring (Zabbix/Grafana/Prometheus), Ansible, Automation Tools, Python/Shell/PowerShell, TCP/IP Networking, Packet Analysis, Incident Troubleshooting, Vendor Coordination, AI-assisted Operations

    Additional Information

    All your information will be kept confidential according to EEO guidelines.