- Jobright.ai (San Francisco, CA)
- …* 8 or more years of experience as a software reliability engineer or software engineer working on large-scale, internet-facing production services ... Join to apply for the Site Reliability Engineer - Inference role at Jobright.ai..." roles. San Francisco, CA $160,000.00-$180,000.00 4 days ago Software Engineer , Infrastructure, Early Career San Francisco,… more
- Together AI (San Francisco, CA)
- Role Together AI is seeking a Machine Learning Engineer to join our Inference Engine team, focusing on optimizing and enhancing the performance of our AI ... inference systems. This role involves working with state-of-the-art large...of low-level operating systems concepts including multi-threading, memory management, networking , storage, performance, and scale. Preferred: Knowledge of existing… more
- Red Hat (Boston, MA)
- Principal Machine Learning Engineer , AI Inference page is loaded## Principal Machine Learning Engineer , AI Inferenceremote type: Hybridlocations: ... open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings...optimize, and scale LLM deployments.As a Principal Machine Learning Engineer focused on vLLM, you will be at the… more
- Together AI (San Francisco, CA)
- About the Role Together AI is seeking a Distributed ML Systems Engineer to design and build scalable machine learning systems that power our accelerated AI ... of low-level operating systems concepts including multi-threading, memory management, networking , and storage, performance, and scale. Experience with cloud… more
- Oracle (Seattle, WA)
- Sr Principal Software Engineer , Networking - AI Infrastructure Innovation OCI (Oracle Cloud) AI Infrastructure Innovation team is pioneering the creation of ... . You will define architecture, lead complex system design, and implement innovative networking software that advances RDMA for GPUs and accelerates storage… more
- GEICO (Palo Alto, CA)
- …Careers.**GEICO AI ML Infrastructure team is seeking an exceptional Senior ML Platform Engineer to build and scale our machine learning infrastructure with a focus ... Design, implement, and maintain feature stores for ML model training and inference pipelines* Build and optimize LLM inference systems using frameworks… more
- NVIDIA Corporation (Santa Clara, CA)
- Principal Software Engineer - Large-Scale LLM Memory and Storage Systems page is loaded## Principal Software Engineer - Large-Scale LLM Memory and ... storage (GPU, CPU, local disk, and remote memory) for high-throughput, low-latency inference .* Partner closely with GPU architecture, networking , and platform… more
- Baseten (San Francisco, CA)
- Senior Software Engineer - Enterprise Platform Join to apply for the Senior Software Engineer - Enterprise Platform role at Baseten. Base Pay Range ... $200,000.00/yr - $270,000.00/yr About Baseten Baseten powers mission‑critical inference for the world's most dynamic AI companies, including Cursor, Notion,… more
- Baseten (San Francisco, CA)
- …build the platform engineers turn to to ship AI products. The Role As a Senior Software Engineer on the Core Product team at Baseten, you will be building and ... pay range $190,000.00/yr - $250,000.00/yr About Baseten Baseten powers mission‑critical inference for the world's most dynamic AI companies, like Cursor, Notion,… more
- NVIDIA (Santa Clara, CA)
- …storage (GPU, CPU, local disk, and remote memory) for high-throughput, low-latency inference . Partner closely with GPU architecture, networking , and platform ... NVIDIA Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning...cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and roadmap for memory… more
- Oracle (Redwood City, CA)
- …agents that integrate seamlessly with cloud services. Role Summary As a Principal Software Engineer (IC4), you will contribute to the design and implementation ... will work in a collaborative environment with applied scientists, ML engineers, and software teams to deliver performant and reliable AI infrastructure. This is a… more
- Mundanelabs (Palo Alto, CA)
- Principal Software Engineer , Embodied Systems Mundane is a venture-backed seed-stage robot learning startup founded by a team of Stanford researchers and ... team of engineers, roboticists, and dreamers. About the Role As a Principal Software Engineer on the Embodied Systems team , you'll architect and… more
- Sierra Business Solution (San Francisco, CA)
- Software Engineer , Site Reliability (SRE) Software Engineer , Site Reliability (SRE) at Sierra Business Solution . About Us We are an in‑person company ... experience with Terraform, AWS services, container orchestration, and cloud networking (IAM, VPC). Strong background in observability systems (Prometheus, Grafana,… more
- Sierra (San Francisco, CA)
- …Clay led the product and design teams for Google Workspace. What you'll do As a Software Engineer on our Site Reliability team at Sierra, you will be responsible ... Deep experience with Terraform, AWS services, container orchestration, and cloud networking (including IAM and VPC architecture). Strong background in observability… more
- Hp Iq (San Francisco, CA)
- …seamlessly integrating with cloud infrastructure. We are looking for a Senior Software Engineer to design and develop high‑performance, scalable services to ... edge devices. Optimize data pipelines and storage solutions for real‑time AI inference and processing. Implement security and privacy best practices for distributed… more
- Gauss Labs (Palo Alto, CA)
- …roles. Continue with Google Continue with Google Continue with Google Continue with Google Software Engineer , AI Platform - New Grad Machine Learning Engineer ... training/ inference workflows, and deployment automation Solid understanding of software engineering best practices: version control (Git), unit testing, code… more
- Slab Inc. (Palo Alto, CA)
- …training/ inference workflows, and deployment automation. Solid understanding of software engineering best practices: version control (Git), unit testing, code ... or edge deployments). Experience in distributed/parallel systems, information retrieval, networking , and systems software development. Development experience in… more
- CompScience (San Francisco, CA)
- About CompScience At CompScience, we're not just building software , we're saving lives. We're a high-growth startup on a mission to prevent 1 million workplace ... and engineering teams are composed of distinguished computer vision engineers, software architects, data scientists and product and design leaders from Amazon… more
- Together AI (San Francisco, CA)
- This role focuses on enabling custom models and dedicated inference on Together. We are responsible for optimizing autoscaling, minimizing cold starts, achieving the ... fault tolerant, distributed systems and API microservices Experience running serverless inference platforms, doing model bring-up on short notice, being on call,… more
- Atlassian (San Francisco, CA)
- …leverage the power of AI & ML without any complications. As a Machine Learning Systems Engineer on the AI & ML Platform team, you will build and scale the core ... infrastructure to allow software engineers, ML engineers & data scientists to develop,...and tools. Understanding of LLMs, best deployment practices and inference optimisation. Experience in building and implementing high-performance RESTful… more