AI/ML Engineer

CCS INC

AI/ML Engineer

Plano, TX
Full Time
Paid
  • Responsibilities

    Benefits:

    Bonus based on performance

    Dental insurance

    Health insurance

    Vision insurance

    Qualifications

    Bachelor’s Degree

    6+ years cloud architecture experience

    3+ years building production GenAI/LLM systems on AWS.

    Strong Python and AWS expertise, including Lambda, ECS/EKS, S3, SageMaker, Docker and Kubernetes.

    Production experience with vector databases and designing ingestion + embedding pipelines for both batch and streaming workloads.

    Hands-on with prompt design, evaluation, LLM orchestration, and RAG implementation patterns.

    Experience deploying and operating model- serving or MCP – like server infrastructure (selfhosted or managed).

    Proficient with IaC and delivery tooling, including Terraform/CloudFormation, GitOps, and CI pipelines.

    Experience with model-serving infrastructure, such as Amazon SageMaker, NVIDIA Triton, Ray Serve, or similar platforms.

    Hands-on experience with GenAI libraries and frameworks, including LangChain, LlamaIndex, Hugging Face, and OpenAI APIs.

    Deep operational expertise with vector databases, such as Pinecone, Milvus, Weaviate, or Qdrant.

    AWS Solutions Architect, AWS DevOps Engineer, or equivalent industry certifications.

    Responsibilities

    Cloud Architecture & Infrastructure, Design scalable, secure AWS architectures

    LLM & GenAI Platforms, Lead integration of API-based and self-hosted LLMs, implement RAG solutions

    Prompting & Evaluation, Develop prompt engineering strategies, reusable templates, and evaluation frameworks

    Vector Databases & Retrieval Pipelines, Implement and maintain vector stores (OpenSearch, Pinecone, Milvus, Qdrant)

    Data Ingestion & Processing Pipelines

    Microservices & Serverless Systems

    Python Development & AI Tooling

    Security, Governance & Cross-Functional Leadership