← all jobs

[Remote] AI Evaluations Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Ellipsis Health is a health technology company seeking an AI Evaluations Engineer to join their AI Evaluation team. The role involves building infrastructure and tooling for AI system evaluations, developing evaluation frameworks, and improving developer experience.

Responsibilities

  • Build and maintain infrastructure and tooling for the AI evaluations platform used by internal teams, including automated testing platform for AI voice agents, debugging and observability tools
  • Develop and productionalize evaluation frameworks for individual system components such as ASR, LLMs, TTS, knowledge bases, and guardrails
  • Partner with ML, engineering and QA teams to translate evaluation requirements into robust, maintainable infrastructure and tooling
  • Improve developer experience by making evaluation systems easy to extend, well-documented, and reliable in day-to-day use
  • Ensure evaluation tooling meets production standards for reliability, performance, and maintainability

Skills

  • 5+ years of professional software engineering experience, with a strong focus on building backend systems, platforms, or developer tooling
  • Proven experience designing and maintaining production-grade infrastructure with code, including APIs, services, and data pipelines
  • Strong proficiency in at least one general-purpose programming language (e.g., Python, Typescript/Javascript, Java, or similar)
  • Experience using test automation frameworks, evaluation pipelines, or CI/CD-integrated testing systems
  • Familiarity with observability and debugging tools (logging, metrics, tracing) and building internal tools that improve developer and QA workflows
  • Strong debugging skills and a methodical approach to diagnosing production and evaluation issues
  • Ability to collaborate effectively across engineering, QA, and operations teams, translating requirements into reliable, maintainable systems
  • Product-minded approach to infrastructure, with attention to usability, documentation, and long-term maintainability
  • Experience working with complex, multi-component systems (e.g., ASR, LLMs, TTS, or other ML-powered services)
  • Experience working in healthcare or other regulated environments, including awareness of HIPAA and PHI handling
  • Familiarity with conversational AI or voice agents, including multi-turn dialogue, latency constraints, and error recovery
  • Familiarity with LLM observability or evaluation tools (e.g., Langfuse, prompt eval frameworks)
  • Background in digital health, care coordination, or patient-facing systems

Benefits

  • 401(k) matching
  • Health, vision, and dental insurance
  • Very flexible paid time off

Company Overview

  • AI Nursing Care Manager It was founded in 2017, and is headquartered in San Francisco, California, USA, with a workforce of 11-50 employees. Its website is http://www.ellipsishealth.com.
  • Company H1B Sponsorship

  • Ellipsis Health has a track record of offering H1B sponsorships, with 2 in 2026, 6 in 2025, 1 in 2024, 2 in 2023, 1 in 2021. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Account Manager II - TPA (Remote)

    Work from home Full-time role

    [Remote] Principal Solutions Architect - Healthcare Pharmacy (Remote)

    Work from home Full-time role

    [Remote] Systems Engineering Manager (Remote)

    Work from home Full-time role

    [Remote] Senior Technical Analyst - Provider Data Management Systems (Remote)

    Work from home Full-time role

    [Remote] Lead Data Analyst - Value Based Care Reporting (Remote)

    Work from home Full-time role

    Digital Production Specialist (Temporary) - Alfred Music

    Work from home Full-time role

    Licensed Crisis Counselor - Fully Remote in Deming, NM

    Work from home Full-time role

    Cannabis Internet Dispensary

    Work from home Full-time role

    [Remote] Application Engineering Manager

    Work from home Full-time role

    [Remote] Financial Analyst III

    Work from home Full-time role

    Experienced Full Stack Data Scientist – Retail Strategic Health Analytics at careerzynith

    Work from home Full-time role

    Mging Consul Eng- DV

    Work from home Full-time role

    Remote Customer Service Representative – Aviation Passenger Support & Travel Solutions – Work‑From‑Home Role at careerzynith

    Work from home Full-time role

    Clinical Informatics (RN), Principal - Faulkner

    Work from home Full-time role

    [Remote] Global Lead Intelligence Analyst

    Work from home Full-time role

    [Remote] Senior Software Engineer, Strategy Research Analytics

    Work from home Full-time role

    Medical Director (Ortho/Total Joint)

    Work from home Full-time role

    Search Engineer, Technical Support (Government Sector) - EVERGREEN ROLE

    Work from home Full-time role

    2026 Raytheon Part Time Co-op - Software Engineer (Remote)

    Work from home Full-time role

    Experienced Social Media Customer Support Representative - Work From Home Opportunity at careerzynith

    Work from home Full-time role

    [Remote] Sales Executive

    Work from home Full-time role