- NVIDIA (Santa Clara, CA)
- …Libraries and Networking team at NVIDIA. We deliver communication runtimes like NCCL and NVSHMEM for Deep Learning and HPC applications. We are looking ... Engineer to guide our key partners and customers with NCCL . Most DL/HPC applications run on large clusters with...Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP,… more
- NVIDIA (Santa Clara, CA)
- …apply today! We're looking for a highly motivated, creative engineer with strong experience in system software to join the DGX Cloud Software Team. You will ... a strong programming background, a deep understanding of distributed systems , familiarity with software testing and deployment,...more high-level languages (C, C++, Go, Rust, etc) + System -level experience with both hardware and software … more
- NVIDIA (Santa Clara, CA)
- …the workload and the system , and empower them to find opportunities in software and hardware, build high level models to propose and deliver the best hardware ... forward-thinking, hard-working, and creative people to join a multifaceted software team with high standards! This software ...to help improve the performance and efficiency of the system . What you'll be doing: + Build internal profiling… more
- NVIDIA (Santa Clara, CA)
- …+ At least 5+ years of engineering experience with multi-GPU platforms + Strong system software (firmware, BIOS, kernel, driver, operating system ) expertise ... direct customer interaction, and the reward of contributing to software and products, to join our team of Solution...or HPC data center technologies including Upper Layer Protocols ( NCCL , MPI) Your base salary will be determined based… more
- NVIDIA (Santa Clara, CA)
- …software architects in the field of AI and high-performance networking and system software . We research, develop, and deploy solutions in networking hardware, ... programming environments, and system software to make current and future...software to make current and future high-end computer systems more performant, scalable, and usable. + Creating proofs-of-concept… more
- NVIDIA (Santa Clara, CA)
- …Future of Data Center Networking. NVIDIA is seeking a visionary and experienced Software Architect to join our CTO Architecture Group, where we drive the innovation ... at the forefront of architecting next-generation GPU networking, defining the software architecture for groundbreaking technologies in areas like network programming… more
- NVIDIA (Santa Clara, CA)
- …+ Familiarity with GPU programming and performance: CUDA, memory hierarchy, streams, NCCL ; proficiency with profiling/debug tools (eg, Nsight Systems /Compute). + ... We are seeking highly skilled and motivated software engineers to join us and build AI... engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You'll… more
- NVIDIA (Santa Clara, CA)
- …UCX that are crucial for scaling Deep Learning and HPC. We're seeking a Senior Software Architect to help co-design next-gen data center platforms and scalable ... communications software . DL and HPC applications have a huge compute...NVSHMEM, OpenSHMEM, UCX, UCC). + Deep understanding of operating systems , computer and system architecture. + Solid… more
- NVIDIA (TX)
- We are seeking Software Engineers with previous experience building and running private and public clouds at production scale. As part of the DGX Cloud team, you'll ... on-call rotation. + Consult with and provide consultation for peer teams on systems design best practices. + Participate in a supportive culture of values-driven… more
- NVIDIA (Santa Clara, CA)
- …platforms have already made a significant impact in the AI and Software -Defined Networking fields which are broadly used across leading academic institutions, ... engineering role is for an expert in AI networking systems who will provide leadership to our customers who...transform the industry. We are looking for an extraordinary Software Engineer focused on Networking and AI factory challenges.… more
- NVIDIA (Santa Clara, CA)
- We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are currently seeking ... a Senior Software Engineer to lead the development...AI Software Resiliency Features: Implement and optimize software features that improve AI system reliability… more
- NVIDIA (Santa Clara, CA)
- …passionate about accelerated computing. We are looking for you - a Networking Software Architect, to develop the next generation of networking protocols for AI. We ... are developing RDMA Transport protocols within the Networking software architecture team at NVIDIA. We build the underlying...to grow with the increasing scale of next generation systems . This is an outstanding opportunity to advance the… more
- NVIDIA (CA)
- …how you can make a lasting impact on the world. We are now looking for a Senior System Software Engineer to work on user facing tools for Dynamo Inference ... or able to quickly gain expertise in vLLM, SGLang, PyTorch, NVIDIA GPUs, and supporting software stacks such as NIXL, NCCL , CUDA, as well as HPC technologies… more
- NVIDIA (Santa Clara, CA)
- …the next wave of artificial intelligence. We are looking for a highly motivated senior software engineer for an exciting role in our communication libraries and ... programming interface specifications like MPI/OpenSHMEM. + Design, implement and maintain system software that enables interactions among GPUs and interactions… more
- Microsoft Corporation (Redmond, WA)
- …hands on experience with production ML systems , large-scale training infrastructure, NCCL , CUDA libraries and tools. \#CoreAI Software Engineering IC4 - The ... OAI and OSS models, and many more. As a Senior Software Engineer on the training infrastructure...software . + 2+ years of experience with distributed systems and cloud-based infrastructure. + 1+ year of experience… more
- NVIDIA (Santa Clara, CA)
- …fit for you, we'd love to hear from you! NVIDIA is seeking a Senior High Performance Computing (HPC) and AI Networking Performance Research and Analysis Engineer to ... types of hardware and platforms, such as HCAs, Switches, CPUs, GPUs, and Systems . You will develop performance analysis tools and methodologies to dive deeply into… more
- Oracle (Topeka, KS)
- …Oracle and our customers to build and deploy AI at scale. We are looking for a ** Senior Software Engineer** to join our growing team and help shape the future of ... management** , **self-service ML infrastructure** , and **model training and serving systems ** . Work on critical AI infrastructure that powers Oracle's GenAI and… more
- NVIDIA (Santa Clara, CA)
- We are seeking a Senior AI/ML Performance and Efficiency Engineer, GPU Clusters at NVIDIA to join our AI Efficiency efforts. As an Engineer, you will have a pivotal ... engineering organizations to deliver efficiency in our usage of hardware, software , and infrastructure + Proactively monitor fleet wide utilization patterns, analyze… more
- NVIDIA (Santa Clara, CA)
- …Good understanding of computer system architecture, HW-SW interactions and operating systems principles (aka systems software fundamentals) + Implement ... and Networking team at NVIDIA. We deliver libraries like NCCL , NVSHMEM, UCX for Deep Learning and HPC. We...parallel programming and at least one communication runtime (MPI, NCCL , UCX, NVSHMEM) + Experience conducting performance benchmarking and… more
- NVIDIA (Santa Clara, CA)
- …Provide leadership and strategic mentorship on the management of large-scale HPC systems including the deployment of compute, networking, and storage. + Develop and ... they occur. + Build innovative tooling to accelerate researchers' velocity, debugging and software performance at scale. What we need to see: + Bachelor's degree in… more