AI Engineer - AI Department

DESIGNLIBRO INC

AI Engineer - AI Department

San Jose, CA
Full Time
Paid
  • Responsibilities

    Benefits:

    401(k)

    401(k) matching

    Competitive salary

    Dental insurance

    Health insurance

    Opportunity for advancement

    Paid time off

    Relocation bonus

    Vision insurance

    Wellness resources

    Petlibro | San Jose, CA

    About Petlibro

    Petlibro is a design-thinking company creating products that nurture the intertwined lives of pets and their people. We launched with a philosophy that good design, in form and in function, can make a difference. Petlibro innovates with the latest technology to solve everyday problems for modern pet parents and revolutionize how we care for our pets.

    Since 2019, Petlibro has grown into one of the best-selling pet tech brands globally. From smart feeders with app insights to ultra-filtered automatic fountains to pet-health-focused smart apps, our products are engineered to magnify the bond between your pet and you. We are now building AI-powered products and services for modern pet care.

    Job Summary

    We are looking for an AI Engineer to design, train, and deploy AI models that power intelligent pet care and animal monitoring products. You will work across the full model lifecycle, from data curation and fine-tuning to production serving and on-device deployment. The role spans computer vision, vision-language models (VLMs), large language models (LLMs), and agentic AI systems, with a strong focus on animal behavior understanding and multimodal reasoning.

    Key Responsibilities

    Fine-tune LLMs and VLMs for pet care and animal behavior domains using modern post-training methods (SFT, DPO, GRPO/DAPO)

    Develop and evaluate computer vision models for animal detection, pose estimation, activity recognition, and health monitoring

    Build multimodal AI pipelines that combine video, audio, weight sensor data, and other sensor modalities for real-time animal behavior analysis

    Design and operate model serving infrastructure with high throughput and low latency (vLLM, TensorRT-LLM)

    Implement RAG systems and agentic workflows for domain-specific knowledge retrieval and automated decision-making

    Optimize models for on-device / edge deployment using quantization (GPTQ, AWQ, INT4/INT8), distillation, etc.

    Curate and manage training datasets including synthetic data generation and annotation pipelines for animal imagery and video

    Develop evaluation frameworks and benchmarks specific to animal AI tasks (behavior classification accuracy, false alert rates, etc.)

    Collaborate with firmware and embedded engineers to ship models on resource-constrained hardware

    Monitor model performance in production, set up drift detection, and maintain model versioning and rollback processes

    Qualifications & Skills

    Bachelor's or Master's degree in Computer Science, AI/ML, Electrical Engineering, or a related field

    2+ years of hands-on experience training and deploying ML models in production

    Strong proficiency in Python; familiarity with C++ for performance-critical paths

    Deep experience with PyTorch and the Hugging Face ecosystem (Transformers, PEFT, TRL, Datasets)

    Practical experience with the modern post-training pipeline: SFT, preference optimization, and RL-based methods

    Understanding of modern model architectures: Mixture of Experts, long-context models, multimodal encoders

    Experience with inference optimization: quantization, speculative decoding, KV-cache management, batching strategies

    Familiarity with MLOps tooling: experiment tracking (W&B, MLflow), model registries, CI/CD for ML

    Knowledge of cloud GPU infrastructure (AWS, GCP) and cost-efficient training/serving strategies

    Strong problem-solving skills and ability to work in a fast-paced, cross-functional environment

    Background in animal behavior, veterinary science, or bioinformatics is a strong plus

    Nice to Have

    Experience with video understanding models and temporal reasoning over long sequences

    Hands-on experience building agentic AI systems or tool-using LLM workflows

    Experience with structured outputs, function calling, and LLM-as-judge evaluation patterns

    Familiarity with on-device ML frameworks (Core ML, TensorFlow Lite, ONNX Runtime, ExecuTorch)

    Experience with synthetic data generation for training data augmentation

    Understanding of distributed training (FSDP, DeepSpeed) for large-scale model training

    Knowledge of AI safety, alignment techniques, and responsible AI practices

    Experience with Docker, Kubernetes, and production deployment pipelines

    Why Join Us

    Work directly on AI that improves animal welfare and pet care. Your models have real-world impact on pets and their owners

    Full-stack AI role, from research and training to production serving and on-device deployment

    Access to GPU compute and cloud infrastructure for experimentation and training

    Collaborative team that values shipping over slides

    Competitive salary, equity, and benefits

    We're building the future of intelligent pet care. If you want to push the boundaries of multimodal AI and apply it to a domain that genuinely matters, we want to talk.