workfromanywhereworkfromanywhere
All jobs
nameEngineering

CUDA Kernel Optimization Specialist - AI Trainer

United StatesPosted 18 days ago

Analyze and optimize GPU kernels for performance, efficiency, and hardware utilization. Use profiler metrics to guide kernel improvements. Review GPU kernel implementations to identify bottlenecks without needing extensive algorithmic background.

Location: United States

Responsibilities

  • Write, modify, and reason about C++17, Python, and GPU programming code.
  • Apply CUDA, HIP, and shader programming expertise to improve performance outcomes.
  • Document optimization decisions clearly.

Requirements

  • At least 1 year of professional or graduate-level research experience with GPUs.
  • Strong understanding of GPU profiler performance metrics for kernel optimization.
  • Ability to optimize GPU kernels without deep prior context on every algorithm.
  • Fluent in core C++ features through C++17.
  • Working knowledge of Python and Git.
  • Fluent in at least one GPU programming model like CUDA, HIP, Slang, HLSL, or GLSL.

Additional Information

  • Originally posted on Himalayas

Location

United States

Category

Engineering

Company

name

Source

himalayas

Posted

18 days ago

Share this job

XLinkedIn

Similar remote jobs

EverAINewDesign

Mid/Senior AI Cinematic Video Editor (Full Remote Worldwide)

Remote (Global)
today
EverAINewProduct

Senior AI Vertical Mini-Series Director (Full Remote Worldwide)

Remote
today
Lemon.ioNewEngineering

Senior React Full-Stack Developer

Remote (US and Europe)
today