Ravi Kumawat

AI Engineer · Melbourne, Australia

I architect, develop, and deploy production-grade AI systems that solve real business problems — from intelligent interview platforms processing thousands of candidate sessions to clinical-grade DL inference tools deployed inside hospitals. I specialize in taking ideas from whiteboard to live, scalable cloud infrastructure. If you have a complex problem that needs a robust, automated software solution, I can build it end to end.

Ravi Kumawat

May 2026 — Melbourne, Australia.

A short version of a long story.

I am an AI Engineer specializing in architecting, developing, and deploying scalable software systems that solve complex business problems and drive end-to-end automation. My core strength lies in bridging the gap between cutting-edge AI research and production-ready engineering — turning prototypes into stable, high-performing systems that real users depend on daily.

In my recent role as a Software Engineer and Founding Engineer at evolvue.ai (KrishAI Technologies Private Limited), developed under the guidance of Krish Naik (YouTube) and Mayank Agarwal (YouTube), I built and shipped several high-impact systems from the ground up — an AI-powered automated interview platform, an intelligent Udemy query response system, and Xenodocs (a document intelligence tool).

My technical focus revolves around applied AI — specifically bringing Agentic AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) into stable, high-performing production environments. I am highly driven by building practical, automated workflows and am actively open to freelance projects and collaborations where I can translate cutting-edge computational techniques into tangible business value.

Prior to my work in the enterprise tech space, I served as a Research Intern at the Indian Institute of Technology (IIT) Ropar, where I led the initial build of IntelliClinix — a clinical-workflow tool that helps doctors at PGIMER Chandigarh run inference from trained DL models on histopathology and radiology images, and re-annotate poor outputs to drive model fine-tuning.

On the academic front, I am currently pursuing a Master of Data Science at the University of Melbourne, after completing my B.S. in Data Science and Applications from the Indian Institute of Technology (IIT) Madras.

evolvue.ai — what I built.

Founding Engineer · KrishAI Technologies Pvt Ltd · Feb 2025 — Present

At evolvue.ai, I was responsible for building the core technical infrastructure of an AI-powered automated interview platform from scratch — a system that conducts real-time mock interviews with candidates using voice-based AI. This wasn't a chatbot wrapper; it was a full production system handling concurrent interview sessions with audio streaming, intelligent evaluation, and detailed feedback generation.

Core technical work I delivered:

  • LLM Integration & Prompt Engineering: Designed and implemented the complete LLM orchestration layer — structured prompt pipelines for question generation, answer evaluation, and feedback synthesis. Built custom evaluation rubrics that produce consistent, actionable interview feedback.
  • Speech-to-Text & Text-to-Speech Pipelines: Engineered real-time audio processing pipelines — STT for transcribing candidate responses with high accuracy, TTS for generating a natural interviewer voice. Optimized latency to maintain conversational flow during live sessions.
  • AWS Infrastructure (S3, ECS, ECR): Architected the complete cloud deployment — containerized services on ECS with images managed through ECR, audio and document storage on S3, and RDS for persistent data. Designed for horizontal scalability and cost-efficiency.
  • CI/CD Pipeline: Built continuous integration and deployment pipelines using GitHub Actions — automated testing, container builds, and zero-downtime deployments to ECS. Every push to main triggers a complete build-test-deploy cycle.
  • Load Testing with Locust: Wrote comprehensive load testing suites simulating hundreds of concurrent interview sessions. Identified and resolved bottlenecks in the audio pipeline and LLM response chain before they hit production.
  • Monitoring with Sentry: Integrated Sentry for real-time error tracking, performance monitoring, and alerting. Set up custom dashboards to track LLM response times, STT accuracy rates, and session completion metrics across all services.

The platform was built under the guidance of Krish Naik and Mayank Agarwal, both recognized leaders in India's AI and data science community. Working closely with them gave me deep exposure to building AI products that scale — not just technically, but as businesses.

PythonFastAPILangChainOpenAIAmazon BedrockAWS S3AWS ECSAWS ECRGitHub ActionsLocustSentryPostgreSQL

IntelliClinix — DL in the hospital workflow.

Research Intern · IIT Ropar (in collaboration with PGIMER, Chandigarh) · May 2024 — Jan 2025

IntelliClinix is the project I built at IIT Ropar — a research-focused effort developed in close collaboration with clinicians at PGIMER, Chandigarh, one of India's largest referral hospitals with a daily footfall of more than ten thousand patients. Our lab had, over time, developed several deep learning solutions for medical images — histopathology whole-slide images (WSI), fluorescence in-situ hybridization (FISH), and MR/CT scans of organs — and we wanted to pilot these solutions inside the clinical workflows of the partners with whom they were originally developed.

I did the initial end-to-end build of the application myself, and I now guide a team of current students remotely who are continuing development. The resulting software is generic for both 2D and 3D medical images, lets clinicians run inference from trained models inside their normal workflow, and — crucially — lets them flag, correct, and re-annotate poor model outputs so the corrected data flows straight back into a model fine-tuning loop.

What I built and shipped:

  • Local-server desktop application: Designed and built the application as a web app that runs on a local server inside the hospital — important because patient data cannot leave the clinical environment. This required deep coordination between web-app engineering and on-premise deployment constraints.
  • 2D & 3D image support: The viewer and inference pipeline are generic — the same app handles 2D histopathology WSIs and 3D MR/CT volumes, with consistent interactions for slicing, zooming, and overlaying model predictions.
  • MONAI inference engine: Integrated MONAI as the core medical imaging deep learning framework, wiring up lab-trained PyTorch models so clinicians can run inference on incoming studies with a single click.
  • nnU-Net pipeline integration: Plugged the nnU-Net training/inference framework into the workflow so segmentation models for organs and lesions can be retrained on corrected data without rebuilding infra from scratch.
  • CVAT annotation loop: Integrated CVAT as the annotation surface clinicians use to mark up poor predictions — those re-annotations feed directly into the fine-tuning dataset, closing the loop between deployment and model improvement.
  • 3D Slicer & ITK-SNAP interop: Built interop with 3D Slicer and ITK-SNAP so radiologists who already live inside those tools can pull cases, overlay AI predictions, edit, and push corrections back without leaving their existing environment.
  • Clinician feedback → model fine-tuning: Designed the data flow so that every clinician correction becomes a labelled training example. This turns day-to-day clinical use into a continuous, in-situ training signal for the lab's models.
  • Remote mentorship & team handoff: After the initial pilot, I transitioned into a mentorship role — I now guide the current student team on architecture, model integration, and clinical UX while they push the project forward.

The work was the result of several months of iteration, feedback, and changes driven by the clinicians themselves. It required significant expertise in both web application development and understanding clinical workflows — the kind of dual-track engineering where a misplaced UI control or a slow inference call directly impacts whether a radiologist will actually use the tool. Once fully rolled out, IntelliClinix is expected to help deploy several of our lab's DL solutions at scale and benefit thousands of patients every month at PGIMER Chandigarh.

PyTorchMONAInnU-NetCVAT3D SlicerITK-SNAPFlaskDICOMNIfTIDockerLocal Server Deploy

Other experience.

Teaching Assistant|Dec 2023 — Aug 2025

IIT Madras — Modern Application Development

Facilitated deep-dive technical sessions on Modern Application Development (CS2003P/CS2006P) for diverse student cohorts. Conducted rigorous code reviews focused on scalable MVC architectures, RESTful API design, and clean coding principles using Flask.

Consultant|Oct 2024

Biofield Power — DevOps & Cloud Consultant

Consulted leadership to transition rapid-prototype codebases into production-hardened, globally deployable assets. Architected dual-cloud deployment strategies across GCP & AWS with Nginx reverse proxies and automated CI/CD delivery pipelines.

What I can build for you.

I take projects from initial concept to production deployment. Whether you need an AI-powered product, a data pipeline, a cloud migration, or a full-stack web application — I handle the architecture, development, testing, deployment, and monitoring. Here's what I ship:

AI-Powered Products

Interview platforms, chatbots, recommendation engines, document processors. End-to-end LLM integration with structured outputs, RAG pipelines, and agentic workflows. Production-tested with real users.

Cloud Infrastructure & DevOps

AWS architecture (S3, RDS, EC2, ECS, Lambda, Bedrock), containerized deployments, CI/CD pipelines with GitHub Actions, load testing, monitoring with Sentry, and zero-downtime deployment strategies.

Full-Stack Web Applications

Next.js, React, Flask, Django. Scalable APIs, database design, authentication, background processing with Celery, caching with Redis. Mobile-responsive and production-grade.

Data Science & Computer Vision

PyTorch, TensorFlow, Scikit-learn. Deep learning model training, image classification and segmentation pipelines, statistical analysis, and data-driven decision systems.

Things I've shipped.

Project 01 · IIT Madras Coursework

GenAI Enterprise Learning Management System

A group project for the Software Engineering course at IIT Madras — an LMS infused with Generative AI that recommends personalized learning paths and answers learner questions directly from course content, grounded in the curriculum rather than hallucinated.

  • RAG conversational agent: Designed a Retrieval-Augmented Generation pipeline over course materials, PDFs, and internal docs, so a learner's question is answered from their own curriculum.
  • Local LLM with Llama 3.1 + Ollama: Ran Llama 3.1 locally via Ollama — no third-party calls and fast contextual querying on self-hosted hardware.
  • Tailored learning pathways: Built a recommendation layer that combines progress signals with topic embeddings to suggest the next module per learner, instead of a static curriculum.
  • React + Flask + MongoDB stack: React.js frontend for the learner experience, Flask REST API as the orchestration layer, and MongoDB for the course catalogue, embeddings, and progress state.
React.jsFlaskLlama 3.1OllamaRAGMongoDBVector Search

Project 02 · IIT Madras Coursework

Urban Waste Classification via Computer Vision

My Deep Learning for Computer Vision course project at IIT Madras — a computer vision pipeline for urban waste classification and segmentation across high-variance, real-world datasets, the kind of noisy, inconsistent imagery that breaks demo-grade models. Built from data ingest through to evaluation reports.

  • CNN architecture & training: Designed and trained convolutional networks in PyTorch for multi-class classification across waste categories (plastic, organic, e-waste, metals, paper, etc.).
  • Segmentation pipeline: Extended the system with a segmentation head so individual waste items can be isolated even when overlapping in cluttered street-level photos.
  • Iterative data-quality loop: Built a loop of error analysis → targeted relabelling → retraining that progressively lifted macro-accuracy and precision-recall on the hardest underrepresented categories.
  • OpenCV preprocessing: Wrote OpenCV preprocessing routines (lighting normalisation, augmentations, region proposals) tuned for the failure modes the model kept hitting in the wild.
PyTorchOpenCVCNNSegmentationNumPyMatplotlib

Project 03 · IIT Madras Coursework

High-Concurrency Ticket Booking System

My Modern Application Development course project at IIT Madras — a full-stack ticketing platform I built solo, then deployed live on AWS EC2. I shared it on LinkedIn, had real users try it for ~15 days, then tore the instance down. A complete build-deploy-validate-retire cycle on real infrastructure.

  • Flask + Vue.js architecture: Built the backend as focused Flask services behind a Vue.js frontend, separating the booking, payment, and notification flows.
  • Redis caching & locks: Used Redis to cache hot reads (seat maps, schedules) and back distributed locks during the reservation step, keeping database pressure low under concurrent demand.
  • Celery + Gmail SMTP: Moved emails, receipts, and audit logs onto Celery workers — booking confirmations went out via Gmail SMTP without blocking the user-visible path.
  • Live AWS EC2 deployment: Deployed the full stack on an AWS EC2 VM, opened it to real users for two weeks of hands-on testing, then decommissioned the instance after validating the system end to end.
  • Concurrency hardening: Added transactional seat-lock semantics so concurrent requests can never produce a double-booking.
FlaskVue.jsRedisCeleryGmail SMTPAWS EC2SQLite

Project 04 · UniMelb Coursework

Real-Time Social Media Sentiment Analysis — Solar Batteries & Subsidy Schemes (Australia)

A group project for the Cluster & Cloud Computing course at the University of Melbourne — a cloud-native system analyzing Australian public opinion on solar batteries and energy-subsidy policy. We were allocated 6 VMs (9 GB RAM each) on the Melbourne Research Cloud and built the entire stack on a self-provisioned Kubernetes cluster.

  • Kubernetes cluster on MRC: Provisioned and configured a Kubernetes cluster across the six research-cloud VMs as the foundation for every service in the pipeline.
  • Fission serverless harvesters: Built Mastodon and Bluesky data fetchers and deployed them as Fission timer functions running every 30 minutes to pull the latest posts.
  • Elasticsearch + Kibana: Streamed harvested posts into an Elasticsearch cluster for storage and search, with Kibana set up for exploration and dashboards.
  • LLM-based text processing: Deployed Gemma 2B locally on the cluster, and used build.nvidia.com and Groq APIs to run larger LLMs that extract sentiment, complaints, and structured signals from post text.
  • Analytics + government data: Built analytics over sentiment and complaints, and overlaid official government datasets to add geographical breakdowns of public opinion.
KubernetesFissionElasticsearchKibanaGemma 2BGroqNVIDIA NIMMelbourne Research Cloud

Where I studied.

Current|Feb 2025 — Present (Expected 2027)

Master of Data Science

University of Melbourne

Focused on advanced machine learning algorithms, scalable cloud computing, large-scale data engineering, and statistical learning theory.

Completed|Jan 2021 — May 2025

BS, Data Science & Applications

Indian Institute of Technology (IIT) Madras

Comprehensive specialization in mathematics, machine learning, big-data architecture, and software engineering. Built strong foundations in statistics, probability theory, and computational methods.

Skills & stack.

Languages
Python, JavaScript, TypeScript, SQL, C
AI / ML
LLMs, RAG, Agentic AI, LangChain, LangGraph, Amazon Bedrock, OpenAI, PyTorch, TensorFlow, Scikit-learn, MONAI, nnU-Net
Backend
Flask, Django, FastAPI, REST APIs, Celery, Redis, PostgreSQL, MongoDB
Frontend
Next.js, React, Vue.js, Tailwind CSS
Cloud & DevOps
AWS (S3, RDS, EC2, ECS, ECR, Lambda), GCP, Docker, Kubernetes, Nginx, GitHub Actions CI/CD
Medical Imaging
MONAI, nnU-Net, CVAT, 3D Slicer, ITK-SNAP, DICOM, NIfTI
Testing & Monitoring
Locust (load testing), Sentry (error tracking & performance), Pytest
Tools
Git, CVAT, Postman, Linux, VS Code

Recent essays.

Let's work together.

I'm actively open to freelance engagements, contract work, and collaborations. Whether you need a production AI system, a cloud migration, or a full-stack application built from scratch — I deliver end-to-end, on time, and built to last.

Currently based in Melbourne, available for remote work globally.