- Micron Technology, Inc. (San Jose, CA)
- …This role is tightly coupled to the company vision to drive Micron's transformation cloud memory and influence industry trends. As a Business Development Leader, ... profitability, and mitigate risks **Strategic Leadership:** Set the vision for cloud memory business development, aligning cross-functional teams to achieve… more
- NVIDIA (Santa Clara, CA)
- …layer that spans GPU memory , pinned host memory , RDMA-accessible memory , SSD tiers, and remote file/object/ cloud storage to support large-scale LLM ... Dynamo is a high-throughput, low-latency inference framework for serving generative AI and reasoning models across multi-node distributed environments. Built in Rust… more
- Microsoft Corporation (Mountain View, CA)
- …365, OneDrive, Skype, Teams and Xbox Live. We are looking for a Principal AI Architect to join our team! **Responsibilities** + Model Bring-Up & Characterization ... of AI accelerator and GPU architectures, including compute pipelines, memory hierarchies, and interconnects. + Proficiency with PyTorch, CUDA, Triton, or similar… more
- NVIDIA (Santa Clara, CA)
- NVIDIA is building the world's leading AI company, and we are looking for an expert AV and GenAI Solutions Architect to help assist customers with adoption of ... as well as building and deploying solutions around Generative AI and Physical AI and other related...and ride sharing algorithms among other things. A Solutions Architect is the first line of technical expertise between… more
- Deloitte (San Jose, CA)
- … Architect - Professional; Professional Machine Learning Engineer, Professional Cloud Architect + Experience with LLM prompt engineering, fine-tuning, ... to build, deploy, and operate integrated/verticalized sector solutions in software, data, AI , network, and hybrid cloud infrastructure. These solutions are… more
- Deloitte (San Jose, CA)
- … Architect - Professional; Professional Machine Learning Engineer, Professional Cloud Architect + Experience with LLM prompt engineering, fine-tuning, ... to build, deploy, and operate integrated/verticalized sector solutions in software, data, AI , network, and hybrid cloud infrastructure. These solutions are… more
- NVIDIA (Santa Clara, CA)
- …seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You'll ... architect and implement high-performance inference stacks, optimize GPU kernels...industry benchmarks, and scale workloads across multi-GPU, multi-node, and multi- cloud environments. You'll collaborate across inference, compiler, scheduling, and… more
- Cadence Design Systems, Inc. (San Jose, CA)
- …solutions, and networking to ensure optimal performance, scalability, and reliability for all our AI workloads. + Cloud AI Service Integration: Support and ... secure the use of public cloud AI services, including Azure OpenAI services...integration with Docker for containerized job submission. + Full-Stack AI Tech Stack Development & Operations: Architect … more
- Google (Sunnyvale, CA)
- …optimize kernels, manage memory usage, and reduce latency to ensure our AI solutions are not just powerful, but economically viable and at scale. + Provide ... Staff Software Engineer, AI , Infrastructure, Applied AI _corporate_fare_ Google...access to Global 1000 customers via our existing Google Cloud relationships. The opportunity in this space is tremendous.… more
- Honeywell (San Jose, CA)
- …seeking a highly skilled Artificial Intelligence & Machine Learning Systems Engineer to architect , design, and develop advanced AI /ML systems that power our next ... application programming (no Python-only backgrounds) + Proven track record of deploying AI /ML solutions to cloud and edge/constrained devices + Strong systems… more
- NVIDIA (Santa Clara, CA)
- …development, and middleware development, with customer-facing responsibilities to enable cloud service providers with next-generation computing platforms. You will ... + Lead hardware bring-up activities, BSP development, and hardware-software co-design for Cloud Service Provider deployments. + Partner directly with CSPs to deliver… more
- NVIDIA (Santa Clara, CA)
- …Docker). + Experience with large-scale inference serving, LLMs, or similar high-performance AI workloads. + Background with memory management, data transfer ... GPU resource management, and intelligent request handling, Dynamo achieves high-performance AI inference for demanding applications. Our team is addressing the most… more
- Google (Sunnyvale, CA)
- …architecture. + Experience with modern GPU architectures (NVIDIA, AMD, or other AI accelerators), memory hierarchies, and performance bottlenecks. + Experience ... architect truly transformative solutions, shaping the future of AI and accelerated computing for Google and the world....teams working on the development of our TPUs, Vertex AI for Google Cloud , Google Global Networking,… more
- General Motors (Sunnyvale, CA)
- …how we scale AI to achieve autonomy. **What You'll Do:** + Architect , build, and optimize core AI /ML platform infrastructure to support massive-scale model ... **Job Description** **The Role:** We are seeking a Principal AI Engineer to lead the design and advancement of our AI platform. You will play a key role in… more
- Google (Sunnyvale, CA)
- …to this technology that powers Google's AI /ML ambitions and enables the AI /ML applications for Google and Cloud customers. You will develop C++ code ... teams working on the development of our TPUs, Vertex AI for Google Cloud , Google Global Networking,...build firmware running on 32/64-bit embedded processors with limited memory footprints on the accelerator ASICs. + Architect… more
- Walmart (Sunnyvale, CA)
- **Position Summary ** **What you'll do ** We are building the next generation of AI powered experiences on smart TVs think on device large language models, real time ... defined problems through data analysis. We are seeking a Staff Software Engineer to architect the future of TV advertising. As viewers shift from linear TV to… more
- Celestica (San Jose, CA)
- …Software Engineers to contribute to our next-generation data center networking, and AI compute blade systems. You will be instrumental in designing, developing, and ... networking and compute engineers to optimize overall data center efficiency + Architect solutions for customer's data center management needs working with multiple… more
- Oracle (Santa Clara, CA)
- …storage, and communications companies to design unparalleled servers for Oracle's Cloud deployments,_ _database appliance, middleware, and AI solution offerings. ... blocks for some of Oracle's market leading and growing Cloud Systems & Appliances._ _The FPGA requirements of our...and power sequence of high-performance CPU processors, GPUs, DDR memory DIMMs, PCIe Cards, Flash modules, disk drives etc._… more
- Amazon (Cupertino, CA)
- …hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, ... computing, and distributed architectures, where you'll help shape the future of AI acceleration technology This is an opportunity to work on cutting-edge products… more
- NVIDIA (Santa Clara, CA)
- …in embedded firmware development with customer-facing responsibilities to enable cloud service providers with next-generation computing platforms. You will work ... protocol stacks (Redfish, PLDM, MCTP, NSM) and hardware-software co-design for Cloud Service Provider deployments. + Debug and troubleshoot NVIDIA GPU firmware… more