Deep Learning Performance Architect
NVIDIA
NVIDIA is developing processors and system architectures that accelerate deep learning on edge devices, workstations and data center GPUs for a variety of applications, including automotive, robotics, large language models (LLMs) and AI generative models. We are looking for an expert deep learning system performance architect to join our modelling, efficiency optimization, performance projections and analysis effort. In this position, you will have the chance to optimize deep learning hardware and software architecture and make the significant impact in a dynamic technology focused company
What you’ll be doing:
Analyze performance and efficiency of various machine learning/deep learning algorithms on different architectures
Identify architecture and software performance bottlenecks and propose optimizations
Explore new features and hardware capabilities on deep learning applications
What we need to see:
BSc. MS or PhD in relevant discipline (CS, EE, Math, etc.,)
4+ years of working experience in relevant directions (e.g., performance models and optimizations) will be a plus
Be familiar with deep learning platform architecture (e.g., GPU)
A strong background in computer architecture
Be familiar with LLM or generative AI deep learning algorithms
Experience on system performance or energy efficiency model development and analysis
Familiar with machine learning and deep learning frameworks