Description
NVIDIA has continuously reinvented itself. Our invention of the GPU sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. Today, research in artificial intelligence is booming worldwide, which calls for highly scalable and massively parallel computation horsepower that NVIDIA GPUs excel.
NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that only we can address, and that matter to the world. This is our life’s work, to amplify human creativity and intelligence. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join our diverse team and see how you can make a lasting impact on the world!
We are looking for a Senior Software Engineer to join our mission to continue improving our HPC infrastructure. Our team builds and operates sophisticated infrastructure to enable business critical services and AI applications. You will be working with a team of passionate and skilled engineers that are continuously working to provide better tools to build and manage this infrastructure. Ideal candidate is strong in software development, designing and creating reliable distributed systems, and has the ability to implement well thought-out long-term maintenance strategy.
What you’ll be doing:
Design highly available and scalable systems to meet the demands of our HPC clusters
Evaluate new and innovative technologies as the landscape evolves
Continuously improve infrastructure provisioning and management using automation
Support a globally distributed, multi-cloud hybrid environment - AWS, GCP and On-prem
Build strong cross functional relationships and align with partners across various business units
Ensure the highest level of up-time and Quality of Service (QoS) to our users through operational excellence
Participate in team's on-call rotation and be a contact for service incidents
What we need to see:
5+ years of experience in design, implementation, and delivery of large engineering projects
Comfortable with at least two of the following programming languages: Golang, Java, C/C++, Scala, Python, Elixir.
Understands scalability challenges and performance of server-side code. Able to craft and develop horizontally scalable, resilient and performing-under-load systems.
Versatile technologist with experience in full software development lifecycle – from inception and design to deployment, operation, and iterative development.
Proficient in cloud computing and are hands-on in at least one cloud platform: GCP, AWS, or Azure.
Proficient in modern CI/CD techniques, Gatos and Infrastructure as Code (Isac)
Strong work ethic and a passion for problem solving
B.S. degree in Computer Science or related technical field (or equivalent experience)
Detail oriented with great communication and collaboration skills
Ways to stand out from the crowd:
Prior experience building solutions for HPC clusters based on Slur or Kubernetes
Strong understanding of Linux operation system and TCP/IP fundamentals
Company
NVIDIA
Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.Simmilar jobs
-
Software Airworthiness Engineer
Software Airworthiness Engineer
Overview General Atomics pioneers technologies with the potential to change the world. Behind a talented global team of engineers, GA delivers safe,...
Overview General Atomics pioneers technologies with the potential to change the world....
General Atomics September 16, 2025
-
R&D Principal Software Engineer - GPU Virtualization
R&D Principal Software Engineer - GPU...
Job Description: About Us: Broadcom is a global leader in semiconductor and infrastructure software solutions. As part of our commitment to...
Job Description: About Us: Broadcom is a global leader in semiconductor and...
Broadcom September 17, 2025
-
Sr. Software Engineer (Multiple Positions) (REF260915V)
Sr. Software Engineer (Multiple Positions)...
Contact & additional information Equal Opportunity Employer, including disability / veterans *Bosch adheres to Federal, State, and Local laws...
Contact & additional information Equal Opportunity Employer, including disability /...
Bosch September 18, 2025
-
Sr Software Engineer (Multiple Positions) (REF260914W)
Sr Software Engineer (Multiple Positions)...
Contact & additional information Equal Opportunity Employer, including disability / veterans *Bosch adheres to Federal, State, and Local laws...
Contact & additional information Equal Opportunity Employer, including disability /...
Bosch September 18, 2025
-
Systems Analyst/Software Engineer (Associate to Senior...
Systems Analyst/Software Engineer (Associate...
Job Description At Boeing, we innovate and collaborate to make the world a better place. We’re committed to fostering an environment for...
Job Description At Boeing, we innovate and collaborate to make the world a better...
Boeing September 18, 2025