Senior AI Engineer

Contract/Contract-To-Hire/Remote (W-2 & No Visa Sponsorship or Transfer Available)

We’re looking for a Senior ML Engineer with strong hands-on experience in GenAI agent development and modern AI engineering. This role focuses on building production-grade agents, MCP integrations, and enterprise knowledge systems using our cloud and data stack.

Key Skills

Experience building and deploying GenAI agents (including open-source LLM agents) in production environments.
Strong knowledge of MCP, tool integrations, and agent orchestration.
Ability to design and maintain knowledge bases, vector search, and retrieval systems.
Cloud expertise in Azure (preferred) or AWS.
Strong SQL skills and experience with Snowflake.
Experience working with Databricks for data and ML workflows.
Solid background in traditional ML (classification, clustering, etc.).
Ability to build evals, guardrails, and safety layers for agents at scale.
Experience with MLOps and LLM Observability.
Experience building CI/CD pipelines (Github, DevOps).

What This Person Will Do

Design, build, and maintain production‑grade APIs, microservices, and internal SDKs that support AI models, LLM agents, retrieval pipelines, and tool integrations.
Develop and refine GenAI and agentic systems, including MCP tools, secure system integrations, and scalable agent‑orchestration infrastructure.
Create and maintain knowledge bases, retrieval pipelines, and vector search systems to power RAG and agent workflows across internal platforms.
Implement comprehensive observability for LLM agents, retrieval pipelines, and AI microservices, covering monitoring, logging, tracing, drift detection, hallucination tracking, latency, cost, and quality metrics.
Build automated evaluation and regression‑testing pipelines for agents, prompts, tools, and model updates to ensure reliability and continuous improvement.
Develop frameworks for prompt versioning, experiment tracking, reproducibility, and model governance, ensuring consistent and auditable AI development practices.
Establish and maintain MLOps and LLMOps pipelines, including model training, deployment, CI/CD and environment promotion (dev → QA → prod)
Optimize model serving and inference infrastructure for performance and cost efficiency (batching, caching, quantization, GPU/CPU autoscaling).
Collaborate with data scientists and cloud teams to productionize models, ensure reproducibility, and support scalable AI systems.
Work across Azure, Snowflake, and Databricks to support production AI systems, data pipelines, and model deployments.

Get in Touch