Senior Research Engineer - Performance Optimization

Company: Luma AI
Location: San Jose
Posted on: June 1, 2025

Job Description:

We are looking for engineers with significant problem solving experience in PyTorch, CUDA and distributed systems. You will work with Research Scientists to build & train cutting edge foundation models on thousands of GPUs.Responsibilities

Ensure efficient implementation of models & systems for data processing, training, inference and deployment.
Identify and implement optimization techniques for massively parallel and distributed systems.
Identify and remedy efficiency bottlenecks (memory, speed, utilization) by profiling and implementing high-performance CUDA, Triton, C++ and PyTorch code.
Work closely together with the research team to ensure systems are planned to be as efficient as possible from start to finish.
Build tools to visualize, evaluate and filter datasets.
Implement cutting-edge product prototypes based on multimodal generative AI.Experience
- Experience training large models using Python & Pytorch, including practical experience working with the entire development pipeline from data processing, preparation & data loading to training and inference.
- Experience optimizing and deploying inference workloads for throughput and latency across the stack (inputs, model inference, outputs, parallel processing etc.).
- Experience with profiling CPU & GPU code in PyTorch, including Nvidia Nsight or similar.
- Experience writing & improving highly parallel & distributed PyTorch code, with familiarity in DDP, FSDP, Tensor Parallel, etc.
- Experience writing high-performance parallel C++. Bonus if done within an ML context with PyTorch, like for data loading, data processing, inference code.
- Experience with high-performance Triton / CUDA and writing custom PyTorch kernels. Top candidates will be able to utilize tensor cores; optimize performance with CUDA memory and other similar skills.
- Good to have experience working with Deep learning concepts such as Transformers & Multimodal Generative models such as Diffusion Models and GANs.
- Good to have experience building inference / demo prototype code (incl. Gradio, Docker etc.).Compensation
  - The pay range for this position in California is $180,000 - $250,000yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan.$200,000 - $280,000 a yearIn addition to cash base pay, you'll also receive a sizable grant of Luma's equity.The pay range for this position is for Bay Area. Base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience.Your applications are reviewed by real people.
    #J-18808-Ljbffr

Keywords: Luma AI, San Leandro , Senior Research Engineer - Performance Optimization, Engineering , San Jose, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Jose recruiters find you. Post your resume for free!

Get San Jose Engineering jobs via email.

View more San Leandro Engineering jobs

Other Engineering Jobs

Information Security Engineer, Tides
Description: Tides is a nonprofit and philanthropic organization committed to advancing social justice. We work across the social sector to shift power to communities of color and other groups historically denied (more...)
Company: ABFE
Location: San Francisco
Posted on: 05/26/2025

DevRel Engineer
Description: Exa is building an API to the world's knowledge unlike any that have ever existed. The people need to know -- through badass demos, tweets, hackathons, API docs, and more.Want to get the world excited (more...)
Company: Exa
Location: San Francisco
Posted on: 05/26/2025

Senior Applications Engineer
Description: As a Senior Application Engineer for the PI Expert team, you will be responsible for design and development of Power Integrations official design tool - . This role is highly collaborative and requires (more...)
Company: Power Integrations, Inc.
Location: San Jose
Posted on: 05/26/2025

Salary in San Leandro, California Area | More details for San Leandro, California Jobs |Salary

Engineer - Product Architecture
Description: Our vision is to transform how the world uses information to enrich life for all.Micron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of information (more...)
Company: Micron Memory Malaysia Sdn Bhd
Location: San Jose
Posted on: 05/26/2025

Sr. IT Engineer
Description: Select how often in days to receive an alert: Create AlertLocation: San Jose, California, United StatesAbout Supermicro:Supermicro is a top-tier provider of advanced server, storage, and networking (more...)
Company: Support Revolution
Location: San Jose
Posted on: 05/26/2025

Project Engineer
Description: Location 423 Construction Main Office-San JoseThis position is responsible for planning, developing, coordinating, managing onsite construction and engineering activities and ensuring alignment with Graniterock's (more...)
Company: International Executive Service Corps
Location: San Jose
Posted on: 05/26/2025

EVM Engineer - Magic Eden
Description: 180,000 - 220,000 per yearWe are a multi-chain NFT Platform, pushing boundaries on Solana, Ethereum, Polygon, and Bitcoin Our mission is to make digital ownership universal to supercharge the internet (more...)
Company: deCircle
Location: San Jose
Posted on: 05/26/2025

Senior Full-stack Engineer (C# + .NET)
Description: Our client is a startup dedicated to wellness and health. Their app is tailored to both businesses and individuals invested in their health and well-being. If you enjoy working with cutting-edge technologies (more...)
Company: Softwaremind
Location: San Jose
Posted on: 05/26/2025

Cyber Security Engineer
Description: Job Title: IT Security EngineerLocation: San Jose, CA 95110Duration: 6 Months Responsibilities: li Work with Client's Identity and Access
Company: TalentBurst Inc
Location: San Jose
Posted on: 05/26/2025

Senior Application Engineer ( AC/DC )
Description: This is a senior level position in application/system engineering. Responsibilities include research in new switching power technology, defining system power architecture, digital, analog,
Company: Analog Group
Location: San Jose
Posted on: 05/26/2025

Loading more jobs...

Senior Research Engineer - Performance Optimization

Didn't find what you're looking for? Search again!

Other Engineering Jobs

Log In or Create An Account