Kethan Goparapu

Building production systems at the intersection of software engineering and machine learning at Observe.AI

Backend services, retrieval pipelines, model serving infrastructure, and the tooling to ship ML to production reliably.

200K+
queries/day served
8-GPU
cluster managed
34%
hallucination reduction

Projects

Larger systems I designed, built, and maintain. Code available on request.

Tools I Built

Problems I kept running into at work, turned into reusable tools.

Experience

Observe.AI

ML Engineer

Jun 2024 — Present
San Francisco, CA
  • Rebuilt retrieval from keyword search to hybrid (BGE-large + BM25 + cross-encoder reranking)62% → 84% relevance
  • Fine-tuned Llama-3 8B with QLoRA on 90K transcript pairs — matches GPT-4 on domain evals at ⅓ cost
  • Built and deployed model serving on Kubernetes + Triton with dynamic batchingp99: 3s → 500ms
  • Built nightly eval framework against 2,000 labeled call scenarios; blocks deploys on regression
  • Migrated training from cron jobs to Airflow DAGs with parallel feature computation8h → 90min
  • Set up Grafana monitoring for inference latency, confidence drift, embedding driftcaught 2 silent degradations

Forethought

ML Engineer

Mar 2023 — May 2024
San Francisco, CA (Remote)
  • Built document Q&A system (LangChain + OpenAI embeddings + Pinecone) across 5,000 support docs
  • Integrated GPT-3.5/4 for automated response drafting with human-in-the-loop feedback45% → 68% acceptance
  • Built automated eval pipeline comparing model outputs against human-labeled ground truth weekly
  • Dockerized ML services, deployed on AWS ECS with FastAPI serving layer

Freshworks

Associate Engineer → Software Engineer II

Jan 2020 — Dec 2022
India
  • Built RESTful APIs and backend services in Node.js and Go for CRM platform500K+ DAU, 99.95% uptime
  • Contributed to monolith → containerized microservices migration on AWS ECS
  • Authored Terraform IaC and GitHub Actions CI/CD pipelinesdeploy: 2h → 15min
  • Designed Kafka + Redis event pipeline for user behavior analytics2M+ events/day
  • Built first ML feature: intent classification (TF-IDF → fine-tuned BERT) for ticket auto-triage85% auto-routed

Technical Skills

System DesignMicroservicesREST APIsgRPCEvent-Driven ArchitectureCI/CD PipelinesTesting (Unit, Integration, E2E)Code ReviewDistributed SystemsMessage Queues
Deep LearningNLPNeural NetworksSupervised LearningContrastive LearningPyTorchscikit-learnHugging FaceLangChainLlamaIndexXGBoostspaCyONNXRAG PipelinesFine-tuning (LoRA/QLoRA)Embeddings (BGE, JobBERT, OpenAI)Triton Inference ServerBentoML
KubernetesDockerAWSGCPSageMakerECSLambdaS3TerraformAirflowKafkaRedisPostgreSQLPineconepgvectorFAISSChromaDBMLflowWeights & BiasesGrafana
PythonTypeScriptGoSQLBashNext.jsReactFastAPINode.jsWebSocket
M.S. Information Systems
Robert Morris University — Moon Township, PA
Jan 2023 — May 2024
B.Tech Electronics & Communication Engineering
HITAM — Hyderabad, India
Jul 2017 — May 2021

Let's talk

Interested in software engineering and ML roles — backend systems, retrieval pipelines, model serving, or systems design. Always happy to chat.