AI Inference Engineer Job at Signify Technology, Sunnyvale, CA

N2k4SysvQ2d4ZzBlbkZybzZTRHJTWnZlcWc9PQ==
  • Signify Technology
  • Sunnyvale, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

The Newport Group - Executive Recruiters

Construction Superintendent Job at The Newport Group - Executive Recruiters

 ...Were looking for Construction Superintendents with strong experience managing Type III and Type V wood-frame construction over concrete specifically on wrap and podium-style multifamily projects. Arcadia, CA multiple positions available (Podium-style project... 

Tata Electronics

Operations Program Manager Job at Tata Electronics

 ...TEPL) is a greenfield venture of the Tata Group. The Tata Group operates in more than 100 countries across six continents, with the...  ...seeking a highly motivated and experienced Operations Program Manager (OPM) to join our US team. The OPM will play a pivotal role in... 

UHY-US

Tax Manager Job at UHY-US

As a Tax Manager, you will be responsible for overseeing and reviewing financial information for clients, such as business and individual tax returns, with the benefit of gaining exposure to a diverse client base operating in a variety of industries. The Tax Manager oversees...

Safra National Bank

Senior Banking Operations Manager Job at Safra National Bank

 ...Senior Banking Operations Manager VP | Aventura, Florida About Safra National Bank of New York Safra National Bank of New York ("Safra National") is a nationally chartered U.S. Bank supervised by the Office of the Comptroller of the Currency and member of the Federal... 

Air Distribution Technologies, Inc.

ADMINISTRATIVE ASSISTANCE-REMOTE POSITION Job at Air Distribution Technologies, Inc.

 ...Summary : The Executive Administrative Assistant will provide comprehensive administrative support to senior executive members, both onsite and remote, to ensure the efficient operation of the Plano office. This role involves a mix of advanced and managerial tasks requiring...