Software Engineer - ML/LLM Inference (San Francisco) Job at Alldus, San Francisco, CA

cldydVBnRkFFWmQvZndzNmVDZURON3N0dnc9PQ==
  • Alldus
  • San Francisco, CA

Job Description

Get AI-powered advice on this job and more exclusive features.

Direct message the job poster from Alldus

Principal Recruitment Consultant | AI & Machine Learning | Co-organizer of the AI in Action Podcast

My client is searching for a talented engineer to work on ML/LLM inference and serving. They specialize in developing next-gen LLM fine-tuning and inference engines.

We are seeking a talented and motivated Software Engineer specializing in Machine Learning (ML) and Large Language Model (LLM) inference to join our dynamic ML Inference team. In this role, you will bridge the gap between AI/ML research and systems programming to build and enhance our next-generation LLM Inference Engine. You will play a crucial role in optimizing the performance, scalability, and efficiency of our LLM serving systems.

Key Responsibilities:

Develop and Enhance Inference Engine:

  • Design, implement, and optimize the next-generation LLM Inference Engine.
  • Integrate the latest LLM inference techniques from research to enhance latency and throughput.

Performance Optimization:

  • Conduct deep performance optimizations across multiple layers of the technology stack, including PyTorch, C++, and CUDA.
  • Analyze and improve system performance to meet the demands of various use cases.
  • Work closely with customers to understand specific performance requirements and optimize solutions accordingly.
  • Provide technical expertise and support to ensure successful deployment and operation of inference systems.

Technical Leadership:

  • Define the roadmap and technical vision for the inference stack.
  • Lead initiatives to drive innovation and maintain the competitive edge of our inference technologies.

Infrastructure Development:

  • Collaborate with partner teams to build and maintain scalable, multi-replica serving infrastructure.
  • Ensure the reliability and scalability of LLM serving systems to handle increasing workloads.

Qualifications:

Technical Skills:

  • Proficiency in systems programming languages such as C++.
  • Strong experience with machine learning frameworks, particularly PyTorch.
  • Expertise in GPU programming and CUDA for performance optimization.
  • Solid understanding of AI/ML concepts, especially related to large language models.

Experience:

  • Proven experience in developing and optimizing ML/LLM inference systems.
  • Demonstrated ability to integrate research advancements into production systems.
  • Experience with performance tuning and profiling across various technology stacks.
  • Experience with vLLM

Seniority level

  • Seniority level

    Mid-Senior level

Employment type

  • Employment type

    Full-time

Job function

  • Industries

    Staffing and Recruiting and Software Development

Referrals increase your chances of interviewing at Alldus by 2x

Inferred from the description for this job

San Francisco, CA $130,000.00-$238,000.00 3 days ago

San Francisco, CA $40,000.00-$70,000.00 2 weeks ago

San Francisco, CA $145,000.00-$230,000.00 5 days ago

Full-Stack Software Engineer (Jr/Mid level)

San Francisco, CA $220,000.00-$350,000.00 4 hours ago

San Francisco, CA $150,000.00-$230,000.00 2 months ago

San Francisco, CA $150,000.00-$176,000.00 2 months ago

San Francisco, CA $99,500.00-$200,000.00 1 day ago

San Francisco, CA $130,000.00-$140,000.00 2 days ago

San Francisco, CA $120,000.00-$190,000.00 8 months ago

San Francisco, CA $125,000.00-$175,000.00 1 month ago

Software Engineer, Frontend (All Levels)

San Francisco, CA $150,000.00-$220,000.00 1 hour ago

San Francisco, CA $56.25-$173,000.00 2 weeks ago

San Francisco, CA $176,000.00-$250,000.00 2 weeks ago

Alameda, CA $130,000.00-$160,000.00 4 weeks ago

San Francisco, CA $150,000.00-$283,000.00 2 weeks ago

San Francisco, CA $150,000.00-$300,000.00 5 days ago

San Francisco, CA $165,000.00-$165,000.00 2 years ago

San Francisco, CA $140,000.00-$280,000.00 7 months ago

San Francisco, CA $140,000.00-$180,000.00 1 month ago

San Francisco, CA $130,000.00-$185,000.00 2 months ago

San Francisco, CA $99,500.00-$200,000.00 1 day ago

San Francisco, CA $150,500.00-$269,200.00 2 days ago

San Francisco, CA $100,000.00-$200,000.00 1 year ago

San Francisco, CA $120,000.00-$200,000.00 2 years ago

San Francisco, CA $150,000.00-$250,000.00 9 months ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

Job Tags

Full time,

Similar Jobs

Aspire Bakeries

Technician B Mechanic Job at Aspire Bakeries

 ...Focus, Creativity and Care to help us deliver on People Safety, Food Safety, Quality and Collaboration. As aMaintenance Mechanic B to ensure our equipment and facility are always in a safe and operational state of readiness. Under established guidelines and supervision... 

Royal Oaks - a HumanGood community

Certified Nurse Assistant/Restorative Aide Full Time Mornings Job at Royal Oaks - a HumanGood community

 ...a Best CCRC-2024 by U.S. News & World Report, is seeking a Full-Time C.N.A./RNA to join our Skilled Nursing team. As a Certified Nursing...  ...begins with YOU. At HumanGood, we offer the opportunity to be part of something bigger than yourself on top of an incredible package... 

Good Samaritan

Registered Nurse - RN - Local Traveler Job at Good Samaritan

 ...USA Shift: Varies Job Schedule: PRN Weekly Hours: Varies Salary...  ...float premium ~ Additional $1.25 per hour weekend shift differential ~ Additional $1.25...  ...Job Summary The Registered Nurse (RN) is responsible for utilizing the nursing... 

Matthews Painting and Drywall LLC

Painters and Drywall Installers/ Finishers Job at Matthews Painting and Drywall LLC

 ...Job Description Job Description We are seeking an experienced Painters and Drywall hangers/ finishersto join our commercial and residential team. We work on all sorts of properties, but most of our jobs involve applying interior, exterior, and industrial coatings... 

AO SOUTH - Lisa Cassidy

Customer Success Associate (Remote) Job at AO SOUTH - Lisa Cassidy

 ...Ambition Over Experience: Redefine Your Career with Limitless Potential Are you ready to...  ...of your career and embrace the freedom of working from anywhere? Were searching for individuals...  .... Whether you prefer working from home or a beachside cafe, the choice is yours....