Job Description
We are on a mission to engineer the cognitive layer of the digital world for 2026. Aetheria AI is seeking a visionary Senior AI/LLM Architect to lead the next generation of autonomous systems. You will not just be fine-tuning models; you will be architecting the infrastructure for Agentic Workflows and Multimodal Reasoning.
In this role, you will bridge the gap between theoretical machine learning research and production-grade deployment at scale. If you are passionate about pushing the boundaries of what Large Language Models can achieve in complex environments, we want to hear from you.
Responsibilities
- Architect Next-Gen LLMs: Design and implement state-of-the-art foundation models, focusing on reasoning, memory, and multi-step planning capabilities.
- Optimize Inference Pipelines: Engineer high-performance inference engines to reduce latency and cost while maximizing throughput for real-time applications.
- Research & Innovation: Stay at the forefront of AI trends, evaluating and integrating emerging architectures such as MoE (Mixture of Experts) and Hybrid Retrieval-Augmented Generation (RAG).
- System Integration: Collaborate with backend teams to integrate AI agents into complex enterprise ecosystems, ensuring seamless human-AI interaction.
- Mentorship: Guide a team of talented ML engineers and data scientists, fostering a culture of technical excellence and innovation.
Qualifications
- Expertise in Deep Learning: 5+ years of experience building and deploying NLP models at scale.
- Proficiency in Frameworks: Strong command of Python, PyTorch, TensorFlow, and experience with Hugging Face Transformers and LangChain.
- Advanced Mathematical Foundation: Solid understanding of linear algebra, calculus, and probability theory.
- Production Experience: Demonstrable track record of deploying models in high-traffic, low-latency environments (AWS, GCP, or Azure).
- Education: Masterβs degree or PhD in Computer Science, Artificial Intelligence, or a related field.