Home Job Details
A
Artificial Intelligence 🏒 Full Time ⭐️ Verified

Senior AI/LLM Architect - Shaping 2026 Intelligence

Aetheria AI
Austin
Estimated Salary
USD 180.000 – USD 250.000
Live Update
19 Mei 2026
Deadline
19 Mei 2027

Job Description

We are on a mission to engineer the cognitive layer of the digital world for 2026. Aetheria AI is seeking a visionary Senior AI/LLM Architect to lead the next generation of autonomous systems. You will not just be fine-tuning models; you will be architecting the infrastructure for Agentic Workflows and Multimodal Reasoning.

In this role, you will bridge the gap between theoretical machine learning research and production-grade deployment at scale. If you are passionate about pushing the boundaries of what Large Language Models can achieve in complex environments, we want to hear from you.

Responsibilities

  • Architect Next-Gen LLMs: Design and implement state-of-the-art foundation models, focusing on reasoning, memory, and multi-step planning capabilities.
  • Optimize Inference Pipelines: Engineer high-performance inference engines to reduce latency and cost while maximizing throughput for real-time applications.
  • Research & Innovation: Stay at the forefront of AI trends, evaluating and integrating emerging architectures such as MoE (Mixture of Experts) and Hybrid Retrieval-Augmented Generation (RAG).
  • System Integration: Collaborate with backend teams to integrate AI agents into complex enterprise ecosystems, ensuring seamless human-AI interaction.
  • Mentorship: Guide a team of talented ML engineers and data scientists, fostering a culture of technical excellence and innovation.

Qualifications

  • Expertise in Deep Learning: 5+ years of experience building and deploying NLP models at scale.
  • Proficiency in Frameworks: Strong command of Python, PyTorch, TensorFlow, and experience with Hugging Face Transformers and LangChain.
  • Advanced Mathematical Foundation: Solid understanding of linear algebra, calculus, and probability theory.
  • Production Experience: Demonstrable track record of deploying models in high-traffic, low-latency environments (AWS, GCP, or Azure).
  • Education: Master’s degree or PhD in Computer Science, Artificial Intelligence, or a related field.

Required Skills

Python PyTorch TensorFlow NLP Large Language Models Machine Learning AWS GCP Data Science Deep Learning Transformers CUDA Docker Kubernetes

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All