포지션 상세
Intelligent machines powered by artificial intelligence—computers that can learn, reason, and interact with people—are transforming every industry. GPU-accelerated deep learning provides the foundation for machines to perceive, reason, and solve complex problems. We are seeking an exceptional Senior Perception Engineer to help design and productize NVIDIA's next-generation autonomous driving perception stack. You will work on the core 3D obstacle perception pipeline, contribute to architecture and algorithm design, and remain deeply hands-on with implementation, including modern transformer-based, multi-modal, and vision-language techniques where they add real value.
• Design and implement advanced 3D perception models using multi-camera inputs and/or multi-sensor fusion (camera, radar, lidar) for obstacle detection and tracking, including opportunities to explore BEV and transformer-based 3D perception.
• Build efficient, production-grade deep learning models by defining objectives with the team. Select and prototype architectures, run experiments, and follow training and evaluation guidelines. Use techniques like large-scale pretraining, distillation, and parameter-efficient fine-tuning (e.g., LoRA).
• Help define and maintain KPI frameworks to quantify perception performance; analyze large-scale real and synthetic datasets to identify failure modes and systematically improve accuracy, robustness, and efficiency, incorporating approaches like self-supervised and representation learning when beneficial.
• Contribute to the data strategy for perception by specifying data and labeling requirements. Help prioritize data collection and annotation. Collaborate with data and ground-truth teams, including model-assisted workflows such as active learning, auto-labeling, and multimodal AI systems combining vision and language.
• Collaborate with safety, systems, and software teams to ensure perception solutions meet product requirements for safety, latency, resource usage, and software robustness, and are ready for deployment at scale.
• Hands-on experience developing deep learning–based perception or closely related systems for complex real-world problems, with strong proficiency in frameworks such as PyTorch and a track record of taking models from prototype to production.
• Proven experience in data-driven development, including close collaboration with data, labeling, and validation teams on data strategy, labeling quality, and iterative model improvement.
• Strong programming skills in Python and/or C++, with experience building reliable, high-performance, production-quality software.
• Excellent communication and collaboration skills, with the ability to work effectively across multidisciplinary teams.
주요업무
• Develop and improve the technical build, architecture, and roadmap for 3D obstacle perception to support end-to-end autonomous driving. Use innovative CNN and transformer-based architectures when appropriate.• Design and implement advanced 3D perception models using multi-camera inputs and/or multi-sensor fusion (camera, radar, lidar) for obstacle detection and tracking, including opportunities to explore BEV and transformer-based 3D perception.
• Build efficient, production-grade deep learning models by defining objectives with the team. Select and prototype architectures, run experiments, and follow training and evaluation guidelines. Use techniques like large-scale pretraining, distillation, and parameter-efficient fine-tuning (e.g., LoRA).
• Help define and maintain KPI frameworks to quantify perception performance; analyze large-scale real and synthetic datasets to identify failure modes and systematically improve accuracy, robustness, and efficiency, incorporating approaches like self-supervised and representation learning when beneficial.
• Contribute to the data strategy for perception by specifying data and labeling requirements. Help prioritize data collection and annotation. Collaborate with data and ground-truth teams, including model-assisted workflows such as active learning, auto-labeling, and multimodal AI systems combining vision and language.
• Collaborate with safety, systems, and software teams to ensure perception solutions meet product requirements for safety, latency, resource usage, and software robustness, and are ready for deployment at scale.
자격요건
• PhD with 4+ years, MS with 6+ years, or BS (or equivalent experience) with 8+ years of relevant experience in Computer Science, Computer Engineering, or a related technical field.• Hands-on experience developing deep learning–based perception or closely related systems for complex real-world problems, with strong proficiency in frameworks such as PyTorch and a track record of taking models from prototype to production.
• Proven experience in data-driven development, including close collaboration with data, labeling, and validation teams on data strategy, labeling quality, and iterative model improvement.
• Strong programming skills in Python and/or C++, with experience building reliable, high-performance, production-quality software.
• Excellent communication and collaboration skills, with the ability to work effectively across multidisciplinary teams.



