Apply directly to jobs in best companies
Search Companies / Jobs

Software Engineer, Systems ML - HPC Specialist at Meta
New York City, United States


Job Descrption
Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web.

Some aspects of this role as an HPC specialist may include authoring components such as cuBLAS, cuDNN, AITemplate, FlashAttention and development of runtimes such as LLM disaggregated runtime. HPC specialists spend time optimizing the program to reduce the accelerators idle time. They also develop tools to debug (cuda-gdb), profiler utilizing the accelerated computing hardware (such as PE’s/SFU etc in MTIA or Transformer engine in H100). They are experts in systems who are able to design, debug and accelerate AI workloads from single-node scale up to multi-node scale out distributed systems. They also are able to influence the next generation of Silicon architectures (such as Tensor Core in V100. Transformer Engine in H100) based on the evolving AI workload needs.

We are hiring in multiple locations.Software Engineer, Systems ML - HPC Specialist Responsibilities
  • Apply relevant AI and machine learning techniques to build & optimize our intelligent systems that improve Metas products and experiences
  • Develop custom/novel architectures, define use cases, and develop methodology & benchmarks to evaluate different approaches
  • Apply in depth knowledge of how the machine learning system interacts with the other systems around it
  • Drive large efforts across multiple teams
  • Assist in goal setting related to project impact, AI system design, and ML excellence
  • Mentor other AI Engineers & improve the quality of AI work in the broader team
Minimum Qualifications
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
  • 4+ years of experience in HPC and parallel computing.
  • Proficiency in GPU programming using CUDA and familiarity with CUDA libraries (cuBLAS, cuDNN, etc.).
  • Proven track record of leading successful HPC projects.
  • Proven technical expertise in HPC architectures and technologies.
  • Effective leadership and communication skills.
Preferred Qualifications
  • PhD in Computer Science, Computer Engineering, or relevant technical field.
  • Experience developing AI algorithms or AI-System infrastructure in C/C++ or Python.
  • Experience with distributed systems or on-device algorithm development.
LocationsAbout Meta Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics. Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com. $177,008/year to $251,000/year + bonus + equity + benefits

Individual pay is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base salary, Meta offers benefits. Learn more about benefits at Meta.

Complete form below to directly Send your CV / Linkedin Profile to Software Engineer, Systems ML - HPC Specialist at Meta.
@
You will receive all responses from employer on this email
Example: Application for the post of 'Accountant'
Example: Introduce your self and give purpose of your application
*All fields are mandatory.
META
1528 jobs found
AI Research Scientist - Language (Technical Leadership) at Meta
New York City, United States
Graphics Software Engineer- Pipeline/Tooling, Reality Labs (Avatars) at Meta
Bellevue, United States
Front End Data Science Developer at Meta
Fremont, United States
Associate General Counsel, Reality Labs at Meta
New York City, United States
Software Engineer - Product (Technical Leadership) at Meta
Menlo Park, United States
Program Manager, Payment Partnership Operations at Meta
New York City, United States
Software Engineer, Systems ML - HPC Specialist at Meta
New York City, United States
R&D Prototyping Technician at Meta
Pittsburgh, United States
Electrical Subject Matter Expert at Meta
Eagle Mountain, United States
Workforce Analyst, Mission Control Analytics at Meta
New York City, United States
10 Other Companies Worldwide
SumUp  
Financial Services
Spire  
Information Services
Spotify  
Musicians
Neo4j  
Software Development
Delta Capita  
Financial Services
Zscaler  
Computer and Network Security
Cohesity  
Software Development
IAG GBS  
Airlines and Aviation
England Rugby  
Spectator Sports
Faculty  
Technology, Information and Internet
1