At NVIDIA, we’re building the foundation for the next generation of Physical AI. Our Cosmos team is passionate about developing robust, production-grade open-source Machine Learning software to support machine learning and generative-AI research at scale. We’re looking for expert-level
Python engineers who are passionate about building production-ready systems and want to make a lasting impact through open-source contributions.
If you care deeply about software craftsmanship, maintainability, and performance—and have hands-on experience building ML systems—this role is for you.
What You’ll Be Doing
- Develop and maintain high-quality, modular, and well-tested Python code for large-scale ML infrastructure. See https://github.com/nvidia-cosmos
- Design and optimize post-training, inference, and data processing pipelines used by ground breaking ML models.
- Collaborate with research and product teams to bring ML systems from prototype to production.
- Contribute to open-source projects and build internal tools that enable scalable AI experimentation.
- Improve performance, reliability, and observability of large distributed systems.
- Support teammates through design reviews, code reviews, and collaborative development.
What We Need To See
- Pursuing BS, MS or PhD degree in Computer Science, Engineering, or a related field, or equivalent experience
- Strong proficiency in Python and a track record of delivering production-quality software.
- Experience with PyTorch (or similar frameworks such as JAX or TensorFlow), especially in real-world applications.
- Deep understanding of ML system design, training loops, data loaders, evaluation, and model serving.
- Familiarity with containerization, CI/CD, and maintaining in production environments
- Comfortable working with large codebases, building reusable libraries, and writing documentation and tests.
Ways To Stand Out In The Crowd
- Contributions to open-source ML or Python infrastructure projects.
- Background in scaling ML training and inference systems across GPUs as well as experience building libraries that wrap or extend PyTorch functionality.
- Prior exposure to multimodal models or simulation environments (vision, language, audio).
- Familiarity with NVIDIA’s GPU compute stack or high-performance computing clusters.
- Experience with distributed computing tools like DDP, FSDP, ZeRO, or Ray.
Are you dedicated, upbeat and dynamic with excellent analytical ability? Are you an engineer passionate and highly motivated about solving complex problems? If so, you may be a perfect fit for NVIDIA!
The base salary range is 104,000 USD - 189,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits .
NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
JR1998235