Principal Architect, AI/ML

New

Fully remote work environment within the United States.Full-TimePrincipal

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Master’s degree in Computer Science, Mathematics, Engineering, or a related technical field, or equivalent practical experience.
Extensive experience in senior or principal architecture roles delivering production-grade AI/ML systems at scale.
Strong hands-on experience with at least one major cloud platform (Google Cloud strongly preferred; AWS or Azure acceptable).
Deep expertise in LLM optimization techniques including fine-tuning (e.g., LoRA), quantization, and efficient inference (e.g., vLLM, TensorRT-LLM).
Experience with modern ML frameworks such as PyTorch, JAX, or PyTorch/XLA for training and deployment.
Proven ability to design and implement agentic AI systems using frameworks like LangChain, LangGraph, or similar tools.
Strong understanding of secure, private, and data-sovereign AI architecture patterns.
Familiarity with observability and evaluation tools for LLM systems (e.g., LangSmith, Langfuse, Vertex AI Evaluation).
Excellent communication and stakeholder management skills, with the ability to translate technical depth into business impact.
Passion for mentoring, innovation, and staying at the forefront of AI/ML advancements.

Serve as the senior technical authority on AI/ML architecture, guiding the design and implementation of advanced machine learning and generative AI systems.
Lead pre-sales and solution design efforts, partnering with business development teams to scope opportunities and present technical solutions to clients.
Architect end-to-end AI systems, including model training, fine-tuning, optimization, deployment, and production-scale serving.
Design secure, privacy-compliant, and data-sovereign AI solutions aligned with regulatory requirements such as GDPR.
Guide the selection and application of cloud-native technologies, with emphasis on Google Cloud and large-scale AI infrastructure.
Lead the design of agentic systems and workflows using modern frameworks such as LangGraph, LangChain, and similar tooling.
Optimize model performance and cost efficiency through techniques such as quantization, pruning, and advanced serving strategies.
Collaborate with cloud and data architects to ensure cohesive, scalable end-to-end solutions for enterprise clients.
Provide technical mentorship to internal teams and clients, establishing best practices in high-performance AI engineering.
Act as a thought leader through external contributions such as technical writing, presentations, and industry engagement.

View Full Description & ApplyYou'll be redirected to the employer's site