Senior C++/Deep Learning Engineer, GPU Optimization
Remote - US
About the Company
At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business.
A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family, we are focused solely on developing software for automated trucks to transform how the world moves freight.
Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer.
Meet the teamÂ
Torc's virtual driver software utilizes cutting-edge deep learning techniques to perceive the vehicle's environment, predict the movements of other vehicles, and execute accurate driving decisions. We are actively seeking a highly experienced senior engineer to join the hardware acceleration team. This is an exceptional opportunity for you to have a significant impact on the future of the autonomous vehicle industry by enhancing AI performance.Â
What you'll do:Â
- Optimize machine learning inference models for NVIDIA Orin executionÂ
- Leverage data parallelism and CUDA programmingÂ
- Implement tensorrt pluginsÂ
- Stay abreast of the latest advancements in PyTorch, maximizing their potential for target hardware executionÂ
- Collaborate with machine learning engineers to develop innovative and performant deep learning solutionsÂ
- Analyze and optimize deep learning inference using profiling and optimization tools, identifying and eliminating performance bottlenecksÂ
- Contribute to the development of internal tools and libraries to further enhance deep learning performance on the target hardwareÂ
- Document your work clearly and concisely, sharing knowledge effectively with team membersÂ
What youâll need to Succeed:Â
- Bachelor's degree in computer science, data science, artificial intelligence or related field with 6+ years of professional experience or a master's degree with 3+ years of experienceÂ
- Mastery of Modern C++ (14 or more recent) and Python, with the ability to write efficient and maintainable code for both performance and flexibilityÂ
- Familiarity with object-oriented software design patterns, and their implementation in C++Â
- In-depth knowledge of CUDA programming and experience with optimizing deep learning kernelsÂ
- Excellent understanding of parallel computing (GPGPU) and high-performance (HPC) conceptsÂ
- âŚ
This job isn't fresh anymore!
Search Fresh JobsJob Profile
100% paid medical 100% paid medical, dental, and vision 100% paid medical, dental, and vision premiums 401(k) with employer match Collaborative environment Competitive compensation Competitive compensation package Diverse and inclusive workplace Flexible schedule Generous paid vacation Life Insurance Paid Vacation Stock options
Tasks- Coding
- Design
- Implementation
- Software design
Agile Artificial Intelligence Autonomous driving Autonomous Technology Autonomous vehicle technology C C++ Coding standards Compensation CUDA Data Science Deep Learning Git HPC Insurance Jira Machine Learning NVIDIA Object-oriented design Parallel Computing Python PyTorch R TensorFlow TensorRT
Experience6 years
EducationArtificial Intelligence Bachelor's Bachelor's degree Bachelor's degree in Computer Science Computer Science Data Science Master's Master's degree Related Field
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9