We are looking for Software Engineering Manager to lead the development efforts for the Triton Inference Server team! Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to recommenders to large language models. We are a fast-paced, agile team building tools and software to make design and deployment of new deep learning models easier and accessible to more inference solution providers and data scientists.
In this role, you will manage an engineering team designing, developing, and optimizing software that streamlines AI inferencing. Ideal candidates will not only have experience leading an agile, system software engineering team, but also motivated to push the boundaries of what is possible with AI inferencing on both CPUs and GPUs. If this sounds exciting, we would love to hear from you!
What you'll be doing:
- Lead, mentor, and grow the Triton engineering team and be responsible for planning and execution of projects as well as the quality and performance of the Triton Inference Server.
- Work closely with Product and Program Management to establish feature roadmaps and coordinate project dependencies; load-balance asynchronous requests across available resources; and collaborating on all feature designs.
- Engage with internal and external partners and costumers to understand their use cases and requirements.
What we need to see:
- Masters or PhD or equivalent experience in Computer Science, computer architecture, or related field.
- 8+ years of overall experience in developing customer facing software.
- 3+ years of experience recruiting, training, and leading software engineering teams.
- Strong fundamentals in building and deploying cloud services using HTTP REST, gRPC, protobuf, and related technologies.
- Excellent C/C++ and Python programming and software design skills, including debugging, performance analysis, and test design. Emphasis on clean and SOLID object-oriented programming principles are a plus.
- Experience running a large open source project - use of GitHub, bug tracking, branching and merging code, OSS licensing issues handling patches, etc.
- Experience with agile software development practices is a requirement, including familiarity with tools such as JIRA and AHA.
Ways to stand out from the crowd:
- Experience working in a globally distributed organization.
- Experience with machine learning algorithms and frameworks. Especially experience frameworks such as TensorFlow, PyTorch, ONNX, TensorRT, OpenVino, and vLLM.
- Good knowledge of CPU and/or GPU hardware architecture.
- Knowledge of GPU programming such as OpenCL or CUDA
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and passionate people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the multifaceted and quickly growing field Deep Learning and Artificial Intelligence!
The base salary range is 216,000 USD - 333,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.