• Staff, Software Engineer - Conversational AI

    Walmart (Sunnyvale, CA)
    …(more computations) and model serving latency. So, we are always in a quest of crunching more numbers, while preserving our SLAs, and controlling the operational ... in terms of architecture, tooling (Tensorflow serving? / ONNYX? / Triton?) and infrastructure ( CPU ? / GPU?, GCP? / Azure?) for model serving based on the latest… more
    Walmart (05/16/25)
    - Save Job - Related Jobs - Block Source