Job Description
Are you ready to define the technological landscape of 2026? Nexus Future Labs is pioneering the next generation of artificial intelligence, and we are seeking a visionary Senior Generative AI Engineer to join our elite team in San Francisco.
In this pivotal role, you will architect and deploy state-of-the-art Large Language Models (LLMs) and multimodal systems. We are not just building AI; we are building the future infrastructure that will power the world's most innovative enterprises. If you are passionate about pushing the boundaries of what is possible with NLP and Deep Learning, we want to hear from you.
Key Highlights:
- Work on cutting-edge LLM fine-tuning and RAG (Retrieval-Augmented Generation) architectures.
- Competitive compensation package including performance-based equity.
- Access to the latest H100 GPU clusters and proprietary datasets.
- Flexible hybrid work model with a premium office environment.
Responsibilities
- Design and implement scalable LLM inference pipelines using PyTorch and TensorFlow.
- Optimize model performance for low-latency, high-throughput production environments.
- Develop and fine-tune open-source models (e.g., Llama 3, Mistral) for specific enterprise use cases.
- Collaborate with cross-functional teams to integrate AI agents into web and mobile applications.
- Conduct rigorous research on emerging AI architectures to align with the 2026 roadmap.
- Mentor junior engineers and foster a culture of technical excellence.
Qualifications
- PhD or Masterβs degree in Computer Science, AI, or a related technical field.
- 5+ years of professional experience in Machine Learning and Deep Learning.
- Expert proficiency in Python, C++, and distributed systems.
- Strong understanding of Transformer architectures, attention mechanisms, and tokenization.
- Experience with vector databases (Pinecone, Milvus) and LLM orchestration frameworks (LangChain, LlamaIndex).
- Proven track record of deploying models to production with high availability.