Software Engineer, ML Systems - AI Infra

20 Sep 2025

Vacancy expired!

Summary: Meta is seeking a software engineer to join our AI Infra. In this role, you will have the opportunity to make a significant contribution to the field of scalable ML systems image generation by developing novel techniques and algorithms for efficiently scaling out ML training, including mitigation of training instability. These techniques will have a large direct impact on Meta’s top ML and recommendation models, impacting recommendations of Ads and content to the Family of Apps’ 3 billion plus users. You will design and develop cutting-edge ML training stability techniques to production to solve real world problems. At Meta, you work alongside and learn from top minds in the field and have access to uniquely large scale computation resources. Join us for this exciting and rewarding role and we are hiring in multiple locations.Required Skills: Software Engineer, ML Systems - AI Infra Responsibilities:

Designing and developing ML training stability and scalability techniques in AI Infra

Consistently and sustainably advance the state of AI, including setting and executing against roadmaps for 6-month plus timeframes

Work towards long-term ambitious research, development and productionization goals, planning and successfully executing and intermediate milestones

Solve critical problems and provide mentorship to other team members

Work and collaborate with cross-functional teams, build relationships with stakeholders

Define use cases and develop methodology and benchmarks to evaluate different approaches

Minimum Qualifications: Minimum Qualifications:

Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta.

Ph.D degree in Computer Science, Computer Engineering and other related fields

Experience with modern deep learning and recommendation system algorithms and techniques

Proficiency in Python and PyTorch

Preferred Qualifications: Preferred Qualifications:

Published in ML conferences NeurIPS, ICML, etc.

Experience with large-scale distributed system and working with large amounts of data

Experience with bringing research to production on real-world applications

Experience with large-scale ML recommender system models

Public Compensation: $116,002/year to $168,000/year + bonus + equity + benefitsIndustry: InternetEqual Opportunity: Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@fb.com.

Full-time
  • ID: #50284841
  • State: Washington Olympia 98501 Olympia USA
  • City: Olympia
  • Salary: USD TBD TBD
  • Showed: 2023-09-20
  • Deadline: 2023-11-18
  • Category: Et cetera