Solutions Architect, Generative AI
US, CA, Santa Clara
We are looking for a AI Solution Architect Engineer with experience in Generative AI software development and deployment. As part of the Solution Architect organization, we work with the most exciting computing hardware and software, driving the latest breakthroughs in deep learning and AI with NVIDIA’s key customers. This role offers an excellent opportunity to build your career in the rapidly growing field of AI while working with the world's most successful technology companies. Primary responsibilities will be to lead software customer technical engagements with NVIDIA products and technologies. Join us in this exciting endeavor!
What you’ll be doing:
Develop and demonstrate software solutions based on NVIDIA’s ground breaking AI software and hardware technologies to customers. Develop GenAI model pipeline and perform in-depth analysis and optimization to ensure the best performance on current- and next-generation GPU architectures
Develop and debug software for NVIDIA and OSS AI frameworks and libraries
Lead and develop proof-of-concepts (PoCs) for software solutions applied to Consumer Internet industry use-cases such as NLP/LLM, retrieval, recommender, etc. by working closely with customer's AI developers. Build collateral (notebook/code) for PoCs
Work closely with business development team owning the technical relationship and enabling customer in building innovative solutions based on NVIDIA technologies
Partner with NVIDIA software engineering, product, sales teams to secure design wins at customers. Enable development and growth of NVIDIA product features through customer feedback and PoC evaluations
What we need to see:
BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields or equivalent experience
5+ years of experience as an AI/Software Engineer with proven track record coding in Python and/or C++ with popular AI software libraries and GPUs
Experience with GenAI applications and LLM training/fine-tuning, inference optimization and/or RAG pipelines
Ability to communicate your ideas/code clearly through GitHub, documentation
Great teammate who enjoys collaborating with teams across the organization such as Engineering/Research, Sales, Product, and Marketing
Effective verbal/written communication, and technical presentation skills
Self-starter with passion for growth, enthusiasm for continuous learning and sharing findings across the team
Ways to stand out from the crowd:
Experience working with enterprise developers and customer facing skills
Experience with large-scale production data pipelines and AI model training/deployment
Knowledge of MLOps technologies such as containers, Kubernetes, data center deployments etc.
Able to think creatively to debug and solve complex problems
We make extensive use of conferencing tools, but occasional travel is required for local on-site visit to customers and data science conferences. We are open to remote work location. We look forward to have you join our team!
With highly competitive salaries, a comprehensive benefits package, and an excellent engineering work culture, NVIDIA is widely considered to be one of the technology industry's most desirable employers. NVIDIA has some of the most innovative people working on meaningful problems that are defining the field of ML/DL, data science, robotics, and graphics.
The base salary range is 148,000 USD - 230,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. ApplyJob Profile
Occasional travel required Open to remote work
Benefits/PerksBenefits Competitive salaries Comprehensive benefits Comprehensive benefits package Diversity Eligible for Equity Equity Equity and benefits Excellent engineering work culture Remote work Work environment
Tasks- Collaborate with business development
- Debug software for ai frameworks
- Develop proof-of-concepts
- Develop software solutions
- Lead software customer technical engagements
- Optimize genai model pipeline
- Partner with engineering and sales teams
- Training
AI AI software Ai software libraries Business Development C C++ Communication Containers Data center Data center deployments Data Pipelines Data Science Deep Learning Deployment Documentation Engineering Genai applications Generative AI GitHub GPU GPUs Inference optimization Kubernetes LLM Llm training Marketing ML MLOps NLP NVIDIA NVIDIA products Presentation Python Rag pipelines Robotics Sales Software Development Software Engineering Technical Presentation Training
Experience5 years
EducationB.S. Computer Engineering Computer Science Data Science Deep Learning Engineering Equivalent Equivalent experience Marketing Mathematics MS Ph.D. Physics Software Engineering
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9