Vikram Guntuka

About

My Journey

From engineering fundamentals to production AI systems — here's how my career has evolved.

2024 – Present

AI Engineer at myOnsite Healthcare

Designing production-grade RAG pipelines, agentic AI systems, and HIPAA-compliant AI workflows for enterprise healthcare.

2023 – 2024

AI/ML Engineer — Agentic & RAG Systems

Built multi-agent architectures, LLM fine-tuning pipelines, and enterprise knowledge assistants using LangChain, LangGraph, and vector databases.

2021 – 2025

B.Tech CSE — Parul University

Graduated with a focus on Machine Learning, Deep Learning, and Natural Language Processing.

Skills

Technical Expertise

A comprehensive toolkit across the AI stack — from model fine-tuning to production deployment.

Programming

PythonTypeScriptSQLBash

Machine Learning

ClassificationRegressionClusteringFeature EngineeringScikit-learnTensorFlowNeural NetworksBERT

Generative AI

LLMsPrompt EngineeringRAGGraphRAGAgentic AIMulti-Agent SystemsLangChainLangGraphLlamaIndexFine-Tuning (LoRA/QLoRA/PEFT)

LLMs

Llama 2/3MistralClaudeOpenAI GPTGeminiDeepSeekGLMQwenMixtralPhi

Vector Databases

pgvectorFAISSPineconeElasticsearch

Backend & APIs

FastAPIFlaskREST APIsActiveMQApache Kafka

Cloud & Infra

AWS EC2DockerLinuxNginxGit/GitLab

DevOps

DockerCI/CDELK StackMicroservicesMonitoring

Automation

n8nAirflowPython Scripting

Compliance

HIPAAPHIAI GovernanceResponsible AI

Experience

Where I've Worked

Building production-grade AI systems for enterprise healthcare and beyond.

Projects

Featured Work

Production-grade AI systems built for enterprise — from RAG pipelines to multi-agent platforms.

Problem

Enterprise HR teams lacked a unified AI-powered platform to automate the complete recruitment lifecycle from job creation to employee onboarding.

Solution

Designed a multi-tenant SaaS platform with AI-powered ATS, resume parsing, candidate ranking, and an intelligent interview engine with adaptive question generation.

Architecture

FastAPI microservices + LangChain orchestration + PostgreSQL multi-tenant DB + React frontend. n8n handles workflow automation. Claude and OpenAI power LLM features.

Impact

Automated 80% of recruitment screening workflows, reducing time-to-hire by 60% for enterprise clients.

Features

AI-powered Applicant Tracking System

Resume Parsing & Candidate Ranking

LLM-based Resume Screening

AI Interview Engine with Voice & Text Evaluation

Employee Learning Platform

Career Portal Builder

RBAC & Multi-Tenant Architecture

HR Analytics Dashboard

Tech Stack

PythonFastAPILangChainPostgreSQLDockerReactn8nClaudeOpenAI

Source Live Demo

Problem

Traditional ATS platforms required extensive manual effort for screening, scheduling, and candidate follow-up.

Solution

Built an Agentic AI recruitment platform where intelligent AI agents autonomously manage the entire ATS workflow.

Architecture

LangGraph-based multi-agent architecture with LangChain orchestration, FastAPI backend, PostgreSQL, and Docker deployment.

Impact

Reduced recruiter workload by 70% through full automation of screening, scheduling, and follow-up workflows.

Features

AI Resume Screening & Candidate Ranking

Job Matching & Resume Parsing

AI Recruiter Assistant

Automated Interview Scheduling

Candidate Follow-up Automation

Email Automation & Recruiter Dashboard

AI Decision Support

Recruitment Analytics

Tech Stack

LangGraphLangChainFastAPIPostgreSQLDockerOpenAIClauden8n

Source Live Demo

Problem

Healthcare professionals needed instant access to domain-specific knowledge buried across thousands of clinical documents.

Solution

Built a production-grade RAG platform with intelligent document ingestion, hybrid search, and LLM-powered question answering.

Architecture

LangChain + Elasticsearch + pgvector for hybrid retrieval. FastAPI microservices with Docker deployment. Llama 3 for inference.

Impact

Reduced clinical document search time by 85%, enabling instant knowledge retrieval across 50K+ documents.

Features

PDF & DOCX Ingestion

Intelligent Chunking & Embedding Generation

Semantic Search & Hybrid Retrieval

Top-K Retrieval & Vector Search

Context Engineering & AI Question Answering

Tech Stack

LangChainElasticsearchpgvectorFastAPIDockerLlama 3

Source Live Demo

Problem

Off-the-shelf LLMs lacked domain-specific accuracy for specialized healthcare and HR use cases.

Solution

Fine-tuned Llama 2, Llama 3, and Mistral using LoRA, QLoRA, and PEFT for domain-specific performance.

Architecture

Hugging Face Transformers + BitsAndBytes quantization + LoRA/QLoRA adapters. GPU-optimized training pipeline.

Impact

Achieved 15-25% improvement in domain-specific task accuracy compared to base models.

Features

Domain-specific Fine-Tuning

Parameter-Efficient Training (PEFT)

4-bit & 8-bit Quantization

GPU Memory Optimization

Tech Stack

Hugging Face TransformersBitsAndBytesLoRAQLoRAPEFT

Source Live Demo

Problem

Enterprise document processing required manual extraction and classification, leading to slow turnaround and errors.

Solution

Built an end-to-end document intelligence solution combining OCR, NLP, and semantic search for automated document understanding.

Architecture

OpenCV + Tesseract for OCR, LangChain for document parsing, pgvector for semantic search, FastAPI backend.

Impact

Processed 10K+ documents monthly with 95% extraction accuracy, reducing manual effort by 90%.

Features

OCR & Document Parsing

Information Extraction

Semantic Search

AI Document Chat

Tech Stack

OpenCVTesseractLangChainpgvectorFastAPI

Source Live Demo

Problem

Logistics operations lacked real-time ETA prediction integrated with visual intelligence for package handling.

Solution

Developed a multi-agent architecture combining ETA prediction with Llama 3.2 Vision for multimodal intelligence.

Architecture

LangGraph multi-agent system with Llama 3.2 Vision for image understanding. FastAPI + Docker deployment.

Impact

Improved ETA prediction accuracy by 30% through multi-modal context integration.

Features

Multi-Agent Architecture

Vision Question Answering

Image Captioning

Structured Data Extraction

Intelligent Reasoning

Tech Stack

LangGraphLangChainLlama 3.2 VisionFastAPIDocker

Source Live Demo

Problem

Database teams spent hours manually optimizing slow SQL queries in production analytics workloads.

Solution

Built an AI-assisted SQL optimization platform that analyzes, rewrites, and optimizes queries automatically.

Architecture

LangChain + OpenAI for query analysis, Python for SQL parsing, n8n for workflow automation and feedback loops.

Impact

Reduced average query execution time by 40% across analytics workloads.

Features

AI Query Rewriting & SQL Optimization

Performance Analysis

Workflow Automation

Intelligent Feedback Loop

Tech Stack

LangChainOpenAIPythonn8n

Source Live Demo

Problem

Social media platforms needed automated content moderation to detect and filter offensive language at scale.

Solution

Fine-tuned a BERT-based NLP model for multi-class offensive language detection with automated filtering.

Architecture

Hugging Face Transformers + BERT fine-tuning pipeline. Flask API for real-time inference.

Impact

Achieved 94% F1-score on offensive language detection, deployed in production moderation pipelines.

Features

Offensive Language Detection

Content Moderation

Automatic Filtering

Dashboard Reporting

Tech Stack

Hugging Face TransformersBERTScikit-learnFlask

Source Live Demo

Certifications

Credentials

Deep Learning Specialization

DeepLearning.AI

LangChain for LLM Application Development

DeepLearning.AI

AWS Cloud Practitioner Essentials

Amazon Web Services

Blog

Writing

Thoughts on AI engineering, RAG systems, LLM fine-tuning, and production best practices.

Jan 20258 min read

Get in Touch

Have a project in mind or just want to chat about AI? I'd love to hear from you.

Email

vikramnetha27@gmail.com

Phone

+91 87121 63880

Location

Ahmedabad, Gujarat, India

My Journey

AI Engineer at myOnsite Healthcare

AI/ML Engineer — Agentic & RAG Systems

B.Tech CSE — Parul University

Technical Expertise

Programming

Machine Learning

Generative AI

LLMs

Vector Databases

Backend & APIs

Cloud & Infra

DevOps

Automation

Compliance

Where I've Worked

AI Engineer

AI/ML Engineer

Featured Work

AI-Powered End-to-End HR SaaS Platform

Problem

Solution

Architecture

Impact

Features

Tech Stack

Agentic AI ATS Platform

Problem

Solution

Architecture

Impact

Features

Tech Stack

Enterprise RAG Knowledge Assistant

Problem

Solution

Architecture

Impact

Features

Tech Stack

LLM Fine-Tuning Platform

Problem

Solution

Architecture

Impact

Features

Tech Stack

AI Document Intelligence Platform

Problem

Solution

Architecture

Impact

Features

Tech Stack

Multi-Agent ETA Prediction & Vision Intelligence

Problem

Solution

Architecture

Impact

Features

Tech Stack

AI-Powered Query Optimization Platform

Problem

Solution

Architecture

Impact

Features

Tech Stack

AI-Powered Profanity Detection

Problem

Solution

Architecture

Impact

Features

Tech Stack

Credentials

Deep Learning Specialization

LangChain for LLM Application Development

AWS Cloud Practitioner Essentials