Sai Harsha Kondaveeti

|

London, UK · Open to AI/LLM Engineering Roles

Building production-grade AI systems with agentic architectures, retrieval-augmented generation, and evaluation-driven ML infrastructure.

Sai Harsha Kondaveeti - AI/ML Engineer

What I Build

Production-grade AI systems with measurable outcomes, structured orchestration, and deployment readiness

Agentic Systems

Designing structured multi-agent workflows with clear execution boundaries and controlled orchestration.

Retrieval-Augmented Generation (RAG)

Modular RAG pipelines with explicit separation of retrieval, orchestration, prompting, and evaluation.

Production ML Pipelines

Scalable ML systems with observability, reliability, and deployment readiness.

Featured Projects

View All

RAG Foundry

Framework for building RAG systems with separated retrieval, ranking, and generation components, plus tools to measure quality across datasets.

Core Idea

Separate retrieval, orchestration, prompting, and evaluation into independent, testable modules

Architecture

Plugin-based retriever system with swappable vector stores and configurable chunking strategies

What You Can Reuse

Evaluation harness with dataset-driven metrics (GitHub template available)

PythonLangChainRAGEvaluation

Agiorcx Lib

Library that provides an execution layer for AI agents, with explicit control flow, error handling, and guardrails for production use.

Core Idea

Explicit control flow for agents with guardrails, rollback mechanisms, and execution logging

Architecture

State machine-based orchestration with pre/post execution hooks and audit trails

What You Can Reuse

Agent coordinator pattern with guardrails (library + examples)

PythonAgent OrchestrationControl FlowLLMs

Evallit

Evaluation toolkit for LLM and RAG systems that runs dataset-based checks, records metrics, and exposes hooks for monitoring and debugging.

Core Idea

Define test datasets upfront, run automated evaluation passes, track metric trends over iterations

Architecture

Pluggable metric system (ROUGE, semantic similarity, custom scorers) with experiment tracking

What You Can Reuse

Evaluation pipeline template with metric collectors and reporting dashboards

PythonLLM EvaluationMetricsObservability

LIA Swarm

Proof-of-concept multi-agent system that applies RAG Foundry design principles to coordinate agents across retrieval, reasoning, and response steps.

Core Idea

Demonstrate coordinated multi-agent workflows with shared context and structured communication protocols

Architecture

Message bus for agent communication with priority queuing and context inheritance

What You Can Reuse

Multi-agent coordination pattern with message schemas (reference implementation)

PythonMulti-AgentRAGCoordination

Recent Writing

View All

Notes and deep dives from building RAG systems, agentic workflows, and production ML tooling.

A Small Design Lesson From Building a RAG Ingestion System

A Small Design Lesson From Building a RAG Ingestion System

Lesson 1 from Building RAG Library by Sai Harsha Kondaveeti...

AI/MLEngineering
Read on Substack
From Predictive to Agentic AI: A practical map of ML modeling types

From Predictive to Agentic AI: A practical map of ML modeling types

ML Modeling Map by Sai Harsha Kondaveeti | AI Engineer...

AI/MLEngineering
Read on Substack
LLMs aren't the only way: The Real Future of AI Nobody’s Talking About

LLMs aren't the only way: The Real Future of AI Nobody’s Talking About

110+ research papers analyzed: What production AI engineers need to know about neuro-symbolic systems, world models, embodied intelligence, and the post-LLM era...

AI/MLEngineering
Read on Substack