AI Inference Engineer Job at Signify Technology, Sunnyvale, CA

N2k4SysvQ2d4ZzBlbkZybzZTRHJTWnZlcWc9PQ==
  • Signify Technology
  • Sunnyvale, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Ultimate Staffing

Part-time Administrative Assistant Job at Ultimate Staffing

 ...Lead - Emergency Services Location: Camarillo (Hybrid Role) MUST RESIDE IN VENTURA COUNTY Pay Rate: $23.00/hour Hours: Part-time, 25 hours/week Monday-Friday Duration: ASAP, Contract Assignment-5 months About the Role: We are seeking a proactive and... 

LEAP

Senior Structures Engineer Job at LEAP

 ...Your Mission Conduct in-depth structural analysis for critical rocket structures, components and assemblies, ensuring our vehicle withstands the demanding environment of sub-orbital and orbital flights. You will leverage advanced simulation tools, rigorous hand calculations... 

Motive Energy

Inside Sales Rep Job at Motive Energy

 ...Role: We are seeking a motivated and detail-oriented Inside Sales Representative to support our outside sales team. This role is...  ...verbal and written communication skills Experience using CRM software Ability to prioritize, multitask, and manage time... 

Siemens Energy

Project Manager, Steam Turbine Modifications Job at Siemens Energy

Snapshot of your Day Are you a dynamic, self-starting candidate who will be able to apply, build, and extend the needed network with all Steam Turbine product lines? Support and drive the Service business for our SU products M&U (Modifications and Upgrades), Small Industry...

Beaver Creek Ski Resort

Maintenance Crew - Housing May Be Available Job at Beaver Creek Ski Resort

 ...connections with teammates and guests from around the world. With 40+ resorts across 3 continents, you can join our team for a season or stay...  ...rates in the industry, free Epic pass(es) along with free ski and snowboard lessons, 40% retail discounts, the chance to grow...