Vikram.G
Available for opportunities

Vikram Guntuka

Building production-grade AI systems with agentic AI, RAG, and LLM fine-tuning.

1.5+

Years Experience

10+

AI Projects

5+

Production Systems

100+

Automation Workflows

Vikram Guntuka

About

My Journey

From engineering fundamentals to production AI systems — here's how my career has evolved.

2024 – Present

AI Engineer at myOnsite Healthcare

Designing production-grade RAG pipelines, agentic AI systems, and HIPAA-compliant AI workflows for enterprise healthcare.

2023 – 2024

AI/ML Engineer — Agentic & RAG Systems

Built multi-agent architectures, LLM fine-tuning pipelines, and enterprise knowledge assistants using LangChain, LangGraph, and vector databases.

2021 – 2025

B.Tech CSE — Parul University

Graduated with a focus on Machine Learning, Deep Learning, and Natural Language Processing.

Skills

Technical Expertise

A comprehensive toolkit across the AI stack — from model fine-tuning to production deployment.

Programming

PythonTypeScriptSQLBash

Machine Learning

ClassificationRegressionClusteringFeature EngineeringScikit-learnTensorFlowNeural NetworksBERT

Generative AI

LLMsPrompt EngineeringRAGGraphRAGAgentic AIMulti-Agent SystemsLangChainLangGraphLlamaIndexFine-Tuning (LoRA/QLoRA/PEFT)

LLMs

Llama 2/3MistralClaudeOpenAI GPTGeminiDeepSeekGLMQwenMixtralPhi

Vector Databases

pgvectorFAISSPineconeElasticsearch

Backend & APIs

FastAPIFlaskREST APIsActiveMQApache Kafka

Cloud & Infra

AWS EC2DockerLinuxNginxGit/GitLab

DevOps

DockerCI/CDELK StackMicroservicesMonitoring

Automation

n8nAirflowPython Scripting

Compliance

HIPAAPHIAI GovernanceResponsible AI

Experience

Where I've Worked

Building production-grade AI systems for enterprise healthcare and beyond.

Projects

Featured Work

Production-grade AI systems built for enterprise — from RAG pipelines to multi-agent platforms.

Problem

Enterprise HR teams lacked a unified AI-powered platform to automate the complete recruitment lifecycle from job creation to employee onboarding.

Solution

Designed a multi-tenant SaaS platform with AI-powered ATS, resume parsing, candidate ranking, and an intelligent interview engine with adaptive question generation.

Architecture

FastAPI microservices + LangChain orchestration + PostgreSQL multi-tenant DB + React frontend. n8n handles workflow automation. Claude and OpenAI power LLM features.

Impact

Automated 80% of recruitment screening workflows, reducing time-to-hire by 60% for enterprise clients.

Features

AI-powered Applicant Tracking System
Resume Parsing & Candidate Ranking
LLM-based Resume Screening
AI Interview Engine with Voice & Text Evaluation
Employee Learning Platform
Career Portal Builder
RBAC & Multi-Tenant Architecture
HR Analytics Dashboard

Tech Stack

PythonFastAPILangChainPostgreSQLDockerReactn8nClaudeOpenAI

Problem

Traditional ATS platforms required extensive manual effort for screening, scheduling, and candidate follow-up.

Solution

Built an Agentic AI recruitment platform where intelligent AI agents autonomously manage the entire ATS workflow.

Architecture

LangGraph-based multi-agent architecture with LangChain orchestration, FastAPI backend, PostgreSQL, and Docker deployment.

Impact

Reduced recruiter workload by 70% through full automation of screening, scheduling, and follow-up workflows.

Features

AI Resume Screening & Candidate Ranking
Job Matching & Resume Parsing
AI Recruiter Assistant
Automated Interview Scheduling
Candidate Follow-up Automation
Email Automation & Recruiter Dashboard
AI Decision Support
Recruitment Analytics

Tech Stack

LangGraphLangChainFastAPIPostgreSQLDockerOpenAIClauden8n

Problem

Healthcare professionals needed instant access to domain-specific knowledge buried across thousands of clinical documents.

Solution

Built a production-grade RAG platform with intelligent document ingestion, hybrid search, and LLM-powered question answering.

Architecture

LangChain + Elasticsearch + pgvector for hybrid retrieval. FastAPI microservices with Docker deployment. Llama 3 for inference.

Impact

Reduced clinical document search time by 85%, enabling instant knowledge retrieval across 50K+ documents.

Features

PDF & DOCX Ingestion
Intelligent Chunking & Embedding Generation
Semantic Search & Hybrid Retrieval
Top-K Retrieval & Vector Search
Context Engineering & AI Question Answering

Tech Stack

LangChainElasticsearchpgvectorFastAPIDockerLlama 3

Problem

Off-the-shelf LLMs lacked domain-specific accuracy for specialized healthcare and HR use cases.

Solution

Fine-tuned Llama 2, Llama 3, and Mistral using LoRA, QLoRA, and PEFT for domain-specific performance.

Architecture

Hugging Face Transformers + BitsAndBytes quantization + LoRA/QLoRA adapters. GPU-optimized training pipeline.

Impact

Achieved 15-25% improvement in domain-specific task accuracy compared to base models.

Features

Domain-specific Fine-Tuning
Parameter-Efficient Training (PEFT)
4-bit & 8-bit Quantization
GPU Memory Optimization

Tech Stack

Hugging Face TransformersBitsAndBytesLoRAQLoRAPEFT

Problem

Enterprise document processing required manual extraction and classification, leading to slow turnaround and errors.

Solution

Built an end-to-end document intelligence solution combining OCR, NLP, and semantic search for automated document understanding.

Architecture

OpenCV + Tesseract for OCR, LangChain for document parsing, pgvector for semantic search, FastAPI backend.

Impact

Processed 10K+ documents monthly with 95% extraction accuracy, reducing manual effort by 90%.

Features

OCR & Document Parsing
Information Extraction
Semantic Search
AI Document Chat

Tech Stack

OpenCVTesseractLangChainpgvectorFastAPI

Problem

Logistics operations lacked real-time ETA prediction integrated with visual intelligence for package handling.

Solution

Developed a multi-agent architecture combining ETA prediction with Llama 3.2 Vision for multimodal intelligence.

Architecture

LangGraph multi-agent system with Llama 3.2 Vision for image understanding. FastAPI + Docker deployment.

Impact

Improved ETA prediction accuracy by 30% through multi-modal context integration.

Features

Multi-Agent Architecture
Vision Question Answering
Image Captioning
Structured Data Extraction
Intelligent Reasoning

Tech Stack

LangGraphLangChainLlama 3.2 VisionFastAPIDocker

Problem

Database teams spent hours manually optimizing slow SQL queries in production analytics workloads.

Solution

Built an AI-assisted SQL optimization platform that analyzes, rewrites, and optimizes queries automatically.

Architecture

LangChain + OpenAI for query analysis, Python for SQL parsing, n8n for workflow automation and feedback loops.

Impact

Reduced average query execution time by 40% across analytics workloads.

Features

AI Query Rewriting & SQL Optimization
Performance Analysis
Workflow Automation
Intelligent Feedback Loop

Tech Stack

LangChainOpenAIPythonn8n

Problem

Social media platforms needed automated content moderation to detect and filter offensive language at scale.

Solution

Fine-tuned a BERT-based NLP model for multi-class offensive language detection with automated filtering.

Architecture

Hugging Face Transformers + BERT fine-tuning pipeline. Flask API for real-time inference.

Impact

Achieved 94% F1-score on offensive language detection, deployed in production moderation pipelines.

Features

Offensive Language Detection
Content Moderation
Automatic Filtering
Dashboard Reporting

Tech Stack

Hugging Face TransformersBERTScikit-learnFlask

Certifications

Credentials

Deep Learning Specialization

DeepLearning.AI

LangChain for LLM Application Development

DeepLearning.AI

AWS Cloud Practitioner Essentials

Amazon Web Services

Blog

Writing

Thoughts on AI engineering, RAG systems, LLM fine-tuning, and production best practices.

Contact

Get in Touch

Have a project in mind or just want to chat about AI? I'd love to hear from you.

Phone

+91 87121 63880

Location

Ahmedabad, Gujarat, India