Software Engineer, Fleet Health Instrumentation Intern - Fall 2025 - #257967

NVIDIA


Date: 6 hours ago
City: Santa Clara, CA
Contract type: Full time
Our work at NVIDIA is dedicated towards a computing model focused on visual and AI computing. For two decades, NVIDIA has pioneered visual computing, the art and science of computer graphics, with our invention of the GPU. The GPU has also shown to be spectacularly effective at solving some of the most complex problems in computer science. Today, NVIDIA’s GPU simulates human intelligence, running deep learning algorithms and acting as the brain of computers, robots and self-driving cars that can perceive and understand the world. We are looking to grow our company and teams with the smartest people in the world and there has never been a more exciting time to join our team! Join the opportunity to design, prototype, and ship high-impact features that keep NVIDIA's GPU-accelerated platforms running smoothly at global scale.

You’ll enter the same engineering culture that powers NVIDIA’s services, applying modern software practices—from service design and development to system instrumentation and data-pipeline engineering. Our internship focuses on writing robust, performant code (Golang / Python) and automating everything that can be automated, so NVIDIA’s cloud offerings deliver world-class reliability.

What You Will Do

  • Design and build software that collects, transforms, and publishes health data about our global GPU fleet.
  • Develop micro-services and data pipelines in Go or Python that ingest and normalize data from many diverse sources—routing millions of records per day (Kafka, Airflow, Kinesis).
  • Instrument production infrastructure and workloads running on Kubernetes and bare-metal clusters; add tracing and metrics hooks for deeper insights.
  • Automate deployments and testing with CI/CD (GitLab, Argo) and IaC (Terraform), ensuring repeatable, low-touch releases.
  • Participate in the full lifecycle of cloud services —from design docs and code reviews through deployment, monitoring, and continuous improvement.
  • Collaborate with other engineers to debug live issues and turn post-incident insights into durable code fixes.
  • Contribute to internal tooling and dashboards that help engineers visualize fleet health, utilization, and capacity trends.

What We Need To See

  • Actively pursuing a BS or MS in Computer Science, Computer Engineering, or a closely related quantitative field (e.g., Physics or Mathematics).
  • Solid understanding of distributed‑systems fundamentals , modern software‑engineering practices, and data‑modeling principles.
  • Proficiency in at least one programming language—preferably Python or Go .
  • Working knowledge of Linux , basic networking concepts, and Kubernetes container orchestration.

Ways To Stand Out From The Crowd

  • A systematic, analytical problem‑solving approach paired with clear written and verbal communication skills and a strong sense of ownership.
  • Demonstrated ability to debug, optimize, and automate code or workflows with minimal guidance.
  • Hands‑on experience building, deploying, and operating services in a public‑cloud or large on‑prem environment.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

The hourly rate for our interns is 18 USD - 71 USD. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.

You will also be eligible for Intern benefits . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

JR1997488

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Football Research & Development Analyst

San Francisco 49ers, Santa Clara, CA
$95,000 - $105,000 per year
2 days ago
The Football R&D Analyst will take a data driven approach to support Coaching, Player Personnel, and Player Performance staff as a supplement to strengthen existing methods, under the direction of the General Manager. Responsibilities and Duties: Supports Coaching staff and Player Personnel Dept with statistical analysis and reports pursuant to prep for upcoming opponents. Uses analytical methods to detail on-field...

Associate Creative Director, Events

NVIDIA, Santa Clara, CA
1 week ago
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by incredible technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving...

Product Manager, Robotics

UnitX, Santa Clara, CA
2 weeks ago
Title: Product Manager, Robotics About US: UnitX builds cutting-edge industrial automation solutions leveraging advanced robotics, AI, and vision systems. We are a fast-paced, ambitious startup committed to redefining how industries automate and scale. We’re hiring a Product Manager specialized in 3D Robotics to lead our motion robotics product initiatives. This role involves close interaction with engineering, field operations, and customers...