- Acceler8 Talent (San Francisco, CA)
- A dynamic technology company in San Francisco is seeking a Machine Learning Engineer ( Inference ) to enhance AI inference efficiency. This mid-senior level ... role focuses on building and optimizing inference stacks, requiring experience with tools such as vLLM and TensorRT. The compensation includes a competitive salary,… more
- F. Hoffmann-La Roche Gruppe (Pleasanton, CA)
- …to come. Join Roche, where every voice matters. The Position Principal DevOps Engineer - ML /AI Algorithms Developing software is great, but developing software ... a purpose is even better! As a Principal DevOps Engineer - ML /AI Algorithms, you will work...and system optimization. Proficiency in Python for automating production systems , including Git, Gitlab, Git actions, GitHub CI/CD, familiarity… more
- Together AI (San Francisco, CA)
- About the Role Together AI is seeking a Distributed ML Systems Engineer to design and build scalable machine learning systems that power our accelerated ... high-load and high-performance requirements. If you are passionate about designing ML systems that operate at scale and eager to create impactful solutions,… more
- Serve Robotics (San Francisco, CA)
- Sr. Software Engineer , ML Edge Inference Engineer Join to apply for the Sr. Software Engineer , ML Edge Inference Engineer role at Serve ... seeking a highly skilled Sr. Software Engineer , ML Edge Inference Engineer to...as NVIDIA Jetson platforms. You will work closely with ML researchers, embedded systems engineers, and robotics… more
- Aldea Inc (San Francisco, CA)
- …AI capabilities. The role involves building and optimizing infrastructure for training and inference systems at scale. Ideal candidates will have experience with ... deep learning frameworks, large model training, and production-grade systems . The company offers comprehensive benefits including equity participation and flexible… more
- Together AI (San Francisco, CA)
- A leading AI research company in San Francisco is seeking a skilled software engineer to focus on optimizing AI systems and ensuring robust performance. ... Candidates should have extensive experience in building scalable distributed systems and proficiency in modern programming languages. The role also involves new… more
- Apple Inc. (San Francisco, CA)
- …equivalent experience). 5+ years in software engineering focused on ML inference , GPU acceleration, and large-scale systems . Expertise in deploying and ... Senior Software Engineer , Model Inference San Francisco Bay...Continuously evaluate emerging research and industry trends in LLM inference , distributed systems , and ML … more
- Jobright.ai (San Francisco, CA)
- Join to apply for the Site Reliability Engineer - Inference role at Jobright.ai 2 days ago Be among the first 25 applicants Join to apply for the Site ... Reliability Engineer - Inference role at Jobright.ai Get...models and building a high-throughput, low-latency API for distributed systems . Responsibilities: * Work on our Inference … more
- Parafin (San Francisco, CA)
- …deploy to batch or real-time targets with minimal boilerplate. Build our real-time ML inference platform. Stand up and scale low-latency model serving. Expand ... batch ML inference . Improve scheduling, parallelism, cost controls,...platforms, shadow/canary deployments, and automated rollback. Experience with low‑latency inference systems . What We Offer Salary Range:… more
- SoFi (San Francisco, CA)
- Join to apply for the Senior Software Engineer , ML Platform role at SoFi 1 day ago Be among the first 25 applicants Join to apply for the Senior Software ... Engineer , ML Platform role at SoFi Get...platforms (AWS preferred) and containerization (Docker, Kubernetes). Familiarity with ML workflows, including model training, batch/online inference ,… more
- Scale AI, Inc. (San Francisco, CA)
- Machine Learning Research Engineer , Enterprise ML Systems Ready to Apply? Join the team shaping the future of AI at Scale. AI is becoming vitally important ... that serve all of our enterprise clients. As an ML Sys Research Engineer , you'll work on...You will: Build, profile and optimize our training and inference framework. Post‑train state of the art models, developed… more
- Genies (San Francisco, CA)
- …the visual and interactive layer for the LLM-powered internet. Genies is looking for a ML Infra and Model Optimization Engineer to join our R&D team. Based ... APIs/services (eg, FastAPI, Flask, gRPC). Hands-on experience deploying and operating ML models at scale, including: GPU-based inference services concurrency… more
- Waymo (San Francisco, CA)
- … through major evolutions while supporting high production demands. Experienced at modern ML data, training, and inference systems for large models. ... Perception engineers and build a deep understanding of the ML development workflows and systems , their requirements...extraction for model training, large model training pipelines, model inference systems , etc. Work closely with all… more
- Proximity Works (San Francisco, CA)
- …ML systems optimized for: Low latency High throughput Cost‑efficient inference Implement ML pipeline best practices (versioning, monitoring, A/B testing, ... About the Role This role is for a hands‑on ML Engineer who can design, train, and...vector databases TensorFlow / PyTorch Proven experience building production‑grade ML systems at scale. Familiarity with LLMs,… more
- Amazon (San Francisco, CA)
- Firmware Engineer , Annapurna Labs, ML Acceleration - Performance Instrumentation & Developer Tools AWS Utility Computing (UC) provides product Annapurna Labs ... pipelines and scripting (Python, shell) for algorithm validation Understanding of ML training/ inference workloads and their performance characteristics Takes… more
- Dynamo AI (San Francisco, CA)
- …top-tier AI innovations. We are looking for a highly talented and experienced Lead Software Engineer to lead the development of our ML services on the cloud. ... engineering team to design, develop, and deploy high-performance, secure, and reliable ML services adhering to industry-leading standards. Partner closely with the … more
- Apple Inc. (San Francisco, CA)
- …and neural machine translation, language modeling, sequence-to-sequence models, etc. Experience developing AI/ ML systems at scale in production or in high-impact ... AI/ ML - Machine Learning Research Engineer ,...real-world, large-scale, user-facing machine translation or large language model systems . Strong spoken and written communication skills. At Apple,… more
- Serve Robotics (San Francisco, CA)
- A pioneering robotics firm in San Francisco seeks a skilled Sr. Software Engineer specializing in ML Edge Inference . This role requires a strong background ... in deploying ML models on NVIDIA Jetson platforms, optimizing performance for...models on NVIDIA Jetson platforms, optimizing performance for robotics systems , and working collaboratively with cross-functional teams. The successful… more
- Apple Inc. (San Francisco, CA)
- …and work on meaningful, challenging and novel problems. Description As a Machine Learning Systems Engineer , you will work closely with Siri modeling teams and ... other cross‑functional teams to optimize model training and inference . You will be working across the ML...how tiny details impact the model Can understand complex ML systems that include data, training pipeline,… more
- Crusoe Energy Systems LLC (San Francisco, CA)
- …resilient fault-tolerant queues, model catalogs, and scheduling mechanisms. Build and optimize systems for high-volume AI inference , handling millions of API ... AI team seeks an ambitious and experienced Senior Software Engineer to join their team. You'll have a pivotal...VLLM or similar frameworks. (Preferred) Performance optimizations on GPU systems and inference frameworks. (Preferred) Soft Skills:… more