AI Inference Engineer Job at Signify Technology, San Mateo, CA

bmg1ZWtkSDFKY1g4S2VBY3VmS1p5Z0tvVWc9PQ==
  • Signify Technology
  • San Mateo, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

R.E. Beckner Construction, Inc.

CDL Driver - Ready Mix Job at R.E. Beckner Construction, Inc.

 ...time Pay: $23/hour during training Expected Hours: 40 hours per week Overview Frontier Ready Mix, Inc. is seeking a skilled and dependable Truck Driver with experience operating concrete mixer trucks and transporting materials to construction sites. The... 

Air Sea Packing Group

Art Handler Job at Air Sea Packing Group

Part-Time Art Handler If youre looking to grow in the world of fine art logistics and you have a passion for it, then we just may have the perfect role for you! Air Sea Packing Group operates multinational white-glove logistics network which transports artwork, antiques... 

Hampton

Night Auditor Job at Hampton

 ...~ High school or equivalent (Preferred) Experience: ~ Hotel experience: 1 year (Preferred) Work Location: One location Pay: $16.00 - $17.00 per hour Expected hours: 16 per week Schedule: ~8 hour shift ~ Night shift ~ Weekends only... 

Busy Bees Home Care

Life Skills Coach Job at Busy Bees Home Care

 ...Job Summary The Life Skills Coach provides direct support to individuals with intellectual and developmental disabilities, empowering them to lead more independent and fulfilling lives. This role focuses on assisting individuals both in their homes and within community... 

INSPYR Solutions

Qualitative Researcher Job at INSPYR Solutions

 ...Experience building mobile experiences and/or working with youth and teens Research experience in emerging technologies such as AR/VR...  ...quality is our commitment. As a national expert in delivering flexible technology and talent solutions, we strategically align...