Home Job Details
N
Information Technology 🏒 Full Time ⭐️ Verified

Senior AI Research Engineer - Generative Models

Nebula AI Systems
San Francisco
Estimated Salary
USD 180.000 – USD 260.000
New
Live Update
30 Juni 2026
Deadline
30 Jun 2027

Job Description

We are building the infrastructure for the next generation of Artificial Intelligence. At Nebula AI Systems, we don't just predict the future; we engineer it. We are seeking a visionary Senior AI Research Engineer to lead our cutting-edge Generative AI initiatives.

In this role, you will push the boundaries of what is possible with Large Language Models (LLMs) and diffusion models. You will work in a high-performance environment, collaborating with world-class researchers and engineers to deploy models that redefine human-computer interaction.

Why Join Us?

  • Work with state-of-the-art hardware (H100 clusters).
  • Competitive equity package and top-tier healthcare.
  • Flexible remote-first culture with annual team retreats.
  • Direct impact on the roadmap of AI products used by millions.

Responsibilities

  • Lead the research and development of novel Generative AI architectures, specifically focusing on scaling laws and efficient fine-tuning techniques.
  • Design and optimize inference pipelines to ensure low-latency, high-throughput model deployment in production environments.
  • Collaborate with cross-functional teams (Product, Engineering, Design) to translate technical research into scalable product features.
  • Mentor junior researchers and provide technical guidance on complex machine learning problems.
  • Publish high-impact research papers and contribute to open-source communities.
  • Evaluate and benchmark new model architectures against state-of-the-art competitors.

Qualifications

  • PhD or Master’s degree in Computer Science, Mathematics, or a related quantitative field.
  • 5+ years of professional experience in Deep Learning, specifically in Natural Language Processing (NLP).
  • Strong proficiency in Python, PyTorch, or TensorFlow.
  • Proven track record of publishing at top-tier conferences (NeurIPS, ICML, ACL, or ICLR).
  • Experience with model quantization, distillation, and deployment on cloud infrastructure (AWS, GCP, or Azure).
  • Deep understanding of transformer architectures and attention mechanisms.

Required Skills

Python PyTorch TensorFlow NLP Transformers LLM Large Language Models Deep Learning CUDA AWS GCP Machine Learning Distributed Systems

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All