Back to Jobs

[Remote] Generative AI Inference Engineer

Remote, USA Full-time Posted 2025-11-24
Note: The job is a remote job and is open to candidates in USA. Stability AI is seeking passionate Machine Learning Engineers to join their Inference team, focusing on the creative applications of generative AI models. The role involves leading the design and development of customer-facing multi-modal ML inference systems and collaborating with various teams to optimize and deploy cutting-edge models. Responsibilities • Lead efforts to drive the design, development of customer-facing multi modal ML inference systems • Work with the Platform and Inference teams on building inference systems for the next generation of models, where you will work on areas such as optimization, model tuning and deployment • Partner with leading cloud providers to deliver hosted Stability AI inference solutions • Be a strategic thought partner for leaders across the organization on driving business impact through machine learning • Be part of the team to bring new Stability models and pipelines into existence • Prototype and productionize inference platform improvements and new features Skills • 7+ years working on productionizing machine learning systems, including inference pipeline development • Expert level knowledge on writing and running python services at scale • 5+ years working on python scientific stack, pyTorch and at least one high-performance inference framework (e.g. Triton and TensorRT) • Deep understanding of Diffusion Architecture • Experience profiling and optimizing deep neural networks on Nvidia GPUs, using profiling tools such as NVIDIA Nsight • Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV • Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure • Experience with Docker • Ability to rapidly prototype solutions and iterate on them with tight product deadlines • Strong communication, collaboration, and documentation skills • Experience with the open-source ML ecosystem (HuggingFace, W&B, etc.) Company Overview • Stability AI is an artificial intelligence company focused on developing open-source generative AI models. It was founded in 2019, and is headquartered in London, England, GBR, with a workforce of 51-200 employees. Its website is https://stability.ai. Apply tot his job Apply To this Job

Similar Jobs

Executive Assistant, Digital Media [Remote]

Remote, USA Full-time

Analyst, Research, Insights, and Analytics (Burbank) Burbank, CA, USA

Remote, USA Full-time

Director Product Management, AI

Remote, USA Full-time

Remote Graphic Design Internship

Remote, USA Full-time

Internal Audit Administrator

Remote, USA Full-time

Manager of Global Audit & Assurance Services

Remote, USA Full-time

Risk Analyst at Climate First Bank Florida

Remote, USA Full-time

Beauty Counter Manager - Charlotte Tilbury - Roosevelt Field

Remote, USA Full-time

Remote B2B Campaign Manager

Remote, USA Full-time

Branch Manger Dade Southeast District (Remote - hybrid)

Remote, USA Full-time

Allstate – UM/UIM Represented Casualty Adjuster – Remote – California

Remote, USA Full-time

Experienced Cancellations Processor for Automotive Industry - Remote Opportunity at blithequark

Remote, USA Full-time

Remote Part Time Data Entry Clerk - $1400 weekly

Remote, USA Full-time

Experienced Sales Customer Experience Specialist for Evening and Overnight Shifts - Remote Work Opportunity with blithequark

Remote, USA Full-time

“Turn Words into Income: Remote General Transcription Jobs with Flexible Hours & Great Pay”

Remote, USA Full-time

Require Freelance Online German (DaF) Teacher in Portland, OR

Remote, USA Full-time

Senior DevOps Engineer

Remote, USA Full-time

(Regional Remote) Territory Sales Consultant - HC

Remote, USA Full-time

[Hiring] Nurse RN | North PAT @UF Health

Remote, USA Full-time

Data Analyst Consultant - #1774

Remote, USA Full-time