Job Description
We are seeking a visionary Lead Generative AI Architect to pioneer the next frontier of artificial intelligence. Based in our San Francisco headquarters, you will be responsible for designing the core infrastructure for our upcoming 2026 generative ecosystem, focusing on scalable Large Language Models (LLMs) and multi-modal agents. This is a high-impact role for an engineer who thrives on ambiguity and wants to shape the future of technology.
Why Join Nexus AI Labs?
As a leader in the AI space, we offer a unique opportunity to work with state-of-the-art hardware and proprietary datasets. You will be at the intersection of research and production, driving innovations that define the next decade of human-computer interaction.
Responsibilities
- Architect and lead the end-to-end development of proprietary generative AI models and infrastructure.
- Optimize model inference latency and cost-efficiency for high-volume production environments.
- Define technical standards for AI safety, alignment, and ethical deployment.
- Collaborate with cross-functional teams to integrate AI capabilities into consumer and enterprise products.
- Conduct research into state-of-the-art architectures, including Transformer variants and Mixture-of-Experts.
- Mentor junior engineers and data scientists to foster a culture of technical excellence.
Qualifications
- Masterβs degree or PhD in Computer Science, Mathematics, or a related technical field.
- 8+ years of software engineering experience, with at least 4 years focused on Machine Learning or AI systems.
- Deep proficiency in Python, PyTorch, TensorFlow, and distributed computing systems (Kubernetes, AWS/GCP).
- Proven experience deploying LLMs at scale, including fine-tuning and RAG (Retrieval-Augmented Generation) implementation.
- Strong understanding of deep learning principles, NLP, and neural network architectures.
- Excellent problem-solving skills and the ability to communicate complex technical concepts to non-technical stakeholders.