• Etched.ai, Inc. (San Jose, CA)
    A tech company specializing in AI inference seeks a technical team member to build installation guides, SDK flows, and reproducible performance metrics. This role ... requires deep knowledge of ML inference infrastructure, software engineering, and the ability to work collaboratively across teams. Candidates should possess… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Hamilton Barnes Associates Limited (San Francisco, CA)
    …of the most advanced AI workloads worldwide. They're now building a serverless inference platform , beginning with cost-efficient batch inference and ... want to miss this opportunity! Key Responsibilities Take ownership of the inference platform architecture, from batch to low-latency workloads. Design, build,… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Motion Recruitment Partners LLC (San Jose, CA)
    …reimagining how teams communicate using real-time AI companions and we're looking for a Senior Machine Learning Engineer to help scale our ML systems into ... systems and AI. You'll be part of a small, senior team building out the backend for our intelligent...team building out the backend for our intelligent assistant platform , handling real‑time ML inference , streaming data,… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • Otter.ai (Mountain View, CA)
    Senior Software Engineer , Infrastructure (ML and Real-Time Speech) Mountain View, CA The Opportunity We're looking for an experienced Senior Software ... self, and do your best work every day. At Otter, we've built a platform that simplifies note taking, saves time, improves productivity and accessibility, and makes… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Roku, Inc. (San Jose, CA)
    …work. Roku is changing how the world watches TV Roku is the #1 TV streaming platform in the US, Canada, and Mexico, and we've set our sights on powering every ... TV. Our mission is to be the TV streaming platform that connects the entire TV ecosystem. We connect...We seek an outstanding, creative, and passionate Machine Learning engineer to join Roku's Recommendation team. You will be… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • X Development, LLC (Mountain View, CA)
    …You will be part of a talented, interdisciplinary engineering team helping build an AI platform from the ground up. How you will make 10X Impact Design and implement ... robust, end to end production-level software to enable inference , optimization, and other complex services as part of...practical experience 3+ years (or 5+ years for more senior levels) of experience building large, complex software systems… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Aera Technology (Mountain View, CA)
    …ability to generate value and unlock opportunities that were previously unattainable. As a Senior Infrastructure Platform Engineer , you will play a crucial ... enterprise-grade AKS clusters built for high concurrency, performance, and real-time AI inference , ensuring the platform is globally distributed and highly… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Neara (Palo Alto, CA)
    Job type: Full Time Department: Backend Engineer Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into ... jobsarchetypeaiio. About the Role Were looking for a highly motivated backend engineer with a passion for building performant, scalable, and resilient distributed… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Data Center Performance Engineer - Benchmarking and Optimization page is loaded## Senior Data Center Performance Engineer - Benchmarking and ... to do their best work.NVIDIA has a rapidly expanding ecosystem of data center platform designs. From single node HGX/DGX systems all the way up to large multi-node… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    We are now looking for a TensorRT-LLM Software Development Engineer ! NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups ... tests and performance tests for different stages of the inference pipeline. Collaborate across the company to guide the...Come join us and help build the GPU-accelerated DL platform used worldwide. #LI-Hybrid Your base salary will be… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Technical Marketing Engineer - GPU and System Architecture page is loaded## Senior Technical Marketing Engineer - GPU and System ... power AI at scale. We are looking for a Senior Technical Marketing Engineer focused on GPUs...rack-scale innovations that maximize performance and efficiency for AI inference & training.**What you'll be doing:**In this role, you… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    … upon which every new AI-powered application is built. We are seeking a senior engineer to design and build factory infrastructure and automation for NVIDIA ... Senior Software Engineer , Distributed Systems - NIM Factory page is loaded##... Inference Microservices (NIMs). The right person for this role brings… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Nvidia Corporation (Santa Clara, CA)
    …broadcasting. Our models are deployed on the NVIDIA Maxine platform for real-time video communication and content creation (https://developer.nvidia.com/maxine). Our ... for deep learning acceleration Deploying deep learning models and optimize the inference stack for real-time performance Deliver the benefits of NVIDIA's latest… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Engineer - AI and HPC Observability page is loaded## Senior Engineer - AI and HPC Observabilitylocations: US, CA, Santa Clara: US, TX, Austin: US, ... heart of this transformation. We are looking for a Senior AI & HPC Observability Engineer to...Engineer to design and build the next-generation observability platform for large-scale AI workloads, GPU clusters, and high-performance… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • PlusAI (Santa Clara, CA)
    …with our autonomy and runtime teams to improve our redundant on-vehicle platform and autonomous software stack. Develop perspectives on where opportunities and gaps ... and robustness of different autonomous software component into redundant on-vehicle platform . Design and develop fault detection and fault handling strategies for… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    Senior Technical Marketing Engineer - Data Center Scale Out page is loaded## Senior Technical Marketing Engineer - Data Center Scale Outlocations: US, ... help customers build these factories of the future, we are seeking a Senior Technical Marketing Engineer focused on scale-out architectures-covering our AI… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • GEICO (Palo Alto, CA)
    …Great Rewards and Great Careers.**GEICO AI ML Infrastructure team is seeking an exceptional Senior ML Platform Engineer to build and scale our machine ... Hands-on experience with inference optimization using vLLM, TensorRT-LLM, Triton Inference Server, or similarDevOps & Platform Skills* Advanced experience… more
    job goal (01/12/26)
    - Save Job - Related Jobs - Block Source
  • quadric.io, Inc (Burlingame, CA)
    …executes both NN graph code and conventional C++ DSP and control code. Role: The AI Inference Engineer in Quadric is the key bridge between the world of AI/LLM ... models and Quadric unique platforms. The AI Inference Engineer at Quadric will [1] port...platform ; [2] optimize the model deployment for efficient inference ; [3] profile and benchmark the model performance. This… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • NVIDIA Corporation (Santa Clara, CA)
    …low-latency inference .* Partner closely with GPU architecture, networking, and platform teams to exploit GPUDirect, RDMA, NVLink, and similar technologies for ... Principal Software Engineer - Large-Scale LLM Memory and Storage Systems...Todayjob requisition id: JR2010271NVIDIA Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning models… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source
  • Pathway Genomics Corporation (Palo Alto, CA)
    …is headquartered in Palo Alto, California. The opportunity We are looking for a Senior ML Infrastructure / DevOps Engineer who loves Linux, distributed systems, ... by the R&D team for large‑scale training and low‑latency inference . Design, build, and automate the ML platform... inference . Design, build, and automate the ML platform rather than just run pre‑defined playbooks. Work across… more
    job goal (01/13/26)
    - Save Job - Related Jobs - Block Source