About Indicium AI
Indicium AI is trusted by the world's leading enterprises to deliver AI into production at scale. We are a global, AI-native consultancy with deep expertise across Financial Services, Energy & Utilities, Healthcare & Life Sciences, Retail & CPG, and Manufacturing - guiding organizations from strategy through build to measurable business outcomes.
With 600+ AI experts, 50+ enterprise clients, and five global locations, we work side-by-side with the world's leading AI partners - including Anthropic, Databricks, AWS, OpenAI, and Microsoft - to deliver modern AI with speed, clarity, and lasting impact.
About the Opportunity
We're seeking an experienced AI Engineer to design, build, and deploy production-grade AI systems powered by large language models. This role sits at the intersection of software engineering and AI implementation, focusing on building reliable, scalable applications rather than model training or research.
You'll work with cutting-edge LLM technologies, building advanced AI systems that solve complex real-world problems through multi-agent orchestration, intelligent tool integration, and robust production workflows.
You'll be crafting the orchestration layer that makes these systems production-ready—handling failure modes, optimizing agent collaboration, and ensuring consistent, reliable outputs at scale.
You’ll combine strong software engineering fundamentals with deep practical knowledge of LLM capabilities, limitations, and best practices for building non-deterministic systems that users can trust.
Key Responsibilities
- Design and implement production AI systems integrating LLMs, RAG pipelines, vector databases, and agentic frameworks.
- Create evaluation frameworks to measure and monitor system performance, accuracy, and reliability
- Build and maintain production-grade AI applications with clean code, appropriate error handling, APIs, and data pipelines
- Experience implementing, maintaining and evaluating retrieval systems (vector/graph databases, ingestion pipelines, chunking strategies, retrieval techniques such as HyDE)
- Implement feedback loops and observability to continuously improve system performance
- Craft effective prompts and optimize for latency, cost, and quality across different model providers and configurations
Preferred Qualifications
- Hands-on experience building applications with LLM APIs and deep understanding of their capabilities, limitations, and failure modes
- Practical implementation of RAG architectures, vector databases, knowledge graphs and prompt engineering
- Experience building multi-step LLM workflows and agentic systems using frameworks (e.g. SDK, Strands, Claude Agents SDK, LangGraph, etc.) or custom implementations where needed
- Strong Python (or other modern programming language) proficiency with production API/service development experience and cloud platform knowledge (AWS, GCP, Azure)
- Understanding of distributed systems, CI/CD, testing frameworks, and deployment pipelines
- Solid foundations and understanding of production-grade, cloud-native platform and infrastructure requirements, design, and implementation.
- Strong data manipulation skills (pandas, SQL) and understanding of evaluation strategies for LLM-based systems
- Ability to work with ambiguity and optimise non-deterministic systems through a process of experimentation and evaluation while balancing latency/cost/quality tradeoffs
Nice to Haves
- Experience with AI-assisted coding using tools like Claude Code, OpenAI Codex, Github Copilot
- Experience with fine-tuning LLMs for domain-specific applications and knowledge of when fine-tuning is preferable to prompt engineering or RAG
- Experience with real-time streaming, multimodal models, or search technologies like Elasticsearch
- Familiarity with model observability tools (LangSmith, Weights & Biases) and cost optimization strategies
- Experience in specialized verticals (financial services, energy, healthcare, legal, retail) with understanding of compliance, security, and responsible AI practices
- Experience with setting up tool calling agents, handoffs, and guardrails
Why Indicium AI
- Work on AI projects that actually transform the world's largest enterprises
- Use cutting-edge AI tools and technologies every single day
- Collaborate with global teams on high-impact, real-world solutions
- Be backed by a supportive team that's genuinely in your corner
- Benefit from serious investment in your learning and career growth
- Earn competitive compensation and benefits
- Enjoy company events and gatherings that bring the global team together
- Join a fast-growing company where ambitious careers thrive