Job Description
Shape the Future of Intelligence
Nexus Innovations is pioneering the next generation of Large Language Models and autonomous agents. We are seeking a visionary Senior Generative AI Engineer to join our elite engineering team in San Francisco. If you are passionate about pushing the boundaries of what is possible with AI, we want to hear from you.
In this role, you will architect and deploy cutting-edge AI solutions that power our global products. You will work closely with a team of world-class researchers and engineers to solve complex problems in natural language processing, multimodal learning, and real-time inference.
Responsibilities
- Model Architecture: Design, train, and fine-tune large-scale generative models (LLMs) using Python and PyTorch.
- Optimization: Optimize model inference latency and throughput for production environments, leveraging techniques like quantization and distillation.
- Collaboration: Partner with data scientists and product managers to define AI product requirements and roadmap.
- R&D: Stay at the forefront of AI research, implementing novel architectures and techniques into our production stack.
- MLOps: Build scalable pipelines for data preprocessing, model training, and continuous evaluation.
- Code Quality: Write clean, maintainable code and contribute to our open-source AI libraries.
Qualifications
- Education: Masterβs or PhD in Computer Science, Mathematics, or a related technical field.
- Experience: 5+ years of professional experience in machine learning, deep learning, or NLP.
- Tech Stack: Strong proficiency in Python, PyTorch, TensorFlow, and Hugging Face Transformers.
- LLM Experience: Proven track record of working with LLMs (e.g., GPT, Llama, Claude) and RAG (Retrieval-Augmented Generation) architectures.
- Problem Solving: Demonstrated ability to tackle complex algorithmic challenges and debug high-dimensional data.
- Communication: Excellent written and verbal communication skills for cross-functional collaboration.