Sr ML Training Engineer
Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.
We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
What you will be working on:
- Optimize PyTorch -based training code for large scale distributed training
- Enhance existing training frameworks to better accommodate FP8 and mixed precision
- Ensure efficient utilization of GPU resources for large scale distributed training
- Ensure efficient setup and utilization of network for large scale distributed training
- Quality and performance analysis between data types such as BF16 and FP8 for large deep learning models
- Write high quality, product level code that is easy to maintain and test while following standard development methodologies
What you will need to succeed:
- B.S., M.S, or Ph.D. in Computer Science, Computer Engineering or a related area
- Proficiency in Linux, Docker
- Understanding of modern transformer-based model architectures
- Expert in Python and PyTorch
- In-depth experience with DDP, FSDP, ring attention, and related distributed training strategies
- Experience with NCCL
- Experience with DeepSpeed , MS-AMP, Collosal -AI, Megatron, and related distributed training technologies
- Expert in PyTorch profiling tools
- Experience with network performance analysis
#FireflyGenAI:
Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. The U.S. pay range for this position is $150,700 — $284,400 annually. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. Your recruiter can share more about the specific salary range for the job location during the hiring process.
At Adobe, for sales roles starting salaries are expressed as total target compensation (TTC = base + commission), and short-term incentives are in the form of sales commission plans. Non-sales roles starting salaries are expressed as base salary and short-term incentives are in the form of the Annual Incentive Plan (AIP).
In addition, certain roles may be eligible for long-term incentives in the form of a new hire equity award.
Adobe will consider qualified applicants with arrest or conviction records for employment in accordance with state and local laws and “fair chance” ordinances.
Adobe is proud to be an Equal Employment Opportunity and affirmative action employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other applicable characteristics protected by law. Learn more.
Adobe aims to make Adobe.com accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email accommodations@adobe.com or call (408) 536-3015.
Adobe values a free and open marketplace for all employees and has policies in place to ensure that we do not enter into illegal agreements with other companies to not recruit or hire each other’s employees.
Similar AI Jobs
Principal AI Engineer
at Skeleton Key
🌎USA
💰$155,680 - $253,920/Yearly
Apply Now4 months ago