Lead Data Scientist · HP Inc.

Building production-grade AI for the world's largest enterprises.

I'm Kundan Singh Sorout — a Lead Data Scientist and Gen AI engineer with 11 years of engineering experience and 7 years deep in Data Science. I build multi-agent systems, RAG pipelines, diffusion models and MLOps platforms that actually ship — not demos.

11yrs
total experience
7yrs
data science
5
engineers led
M.Tech
AI/ML · NIT Warangal
About

A decade of shipping ML across healthcare, retail, manufacturing and ecommerce.

Kundan Singh Sorout

Lead Data Scientist · HP Inc. · Gurugram

I started as a PHP developer in 2015 and progressively moved deeper into the ML stack — from web scrapers and Flask microservices, to CNN-based image categorisation, to chatbots, to today's frontier of multi-agent LLM systems. Every layer left a fingerprint on how I build.

Today I lead a team of 5 engineers at HP Inc. building NEO, a multi-agent AI chatbot that sits between Sales & Marketing teams and HP's data warehouses. I'm a TensorFlow- and PyTorch-certified deep learning practitioner with an M.Tech in AI/ML from NIT Warangal.

0+
years of engineering
0+
years in data science
0
companies shipped at
3×
cloud platforms (GCP/AWS/Azure)
Career

Eleven years. Six companies. One direction — deeper into the stack.

  1. Jan 2024 — Present Current

    Lead Data Scientist

    HP Inc. · Gurugram

    Leading a team of 5 engineers building NEO — a multi-agent AI chatbot that surfaces Sales & Marketing insights from internal data warehouses. Also shipping the External Signals Multi-Agent Sales Intelligence Platform that reads B2B client signals from D&B Hoovers and turns them into actionable recommendations for the field.

    • LangChain
    • Qwen3
    • Ollama
    • ChromaDB
    • Streamlit
    • Selenium
  2. Jan 2022 — Feb 2024

    Senior Data Scientist

    R-Systems (a Blackstone company) · Noida

    Built an Avatar Video Chatbot end-to-end — Stable Diffusion XL + GFP-GAN for face generation, BARK for voice, and Llama 2 + LangChain + PEFT/LoRA for the conversational core. Shipped SkullCandy's Tunein voice assistant on the Skull-IQ device (Rasa NLU + Dialogflow + Flask + Docker on GKE). Designed a custom Diffusion image-generation pipeline (UNet2D + DDPM) and led a Dataiku CI/CD migration that lifted team productivity by 20%.

    • Stable Diffusion
    • Llama 2
    • LoRA
    • Rasa
    • GKE
    • Dataiku
  3. Mar 2020 — Jan 2022

    Data Scientist

    Big Oh Tech / Craterzone · Noida

    Owned product categorisation models for an ecommerce catalog — hybrid CNN + RNN and VGG16 + BiLSTM architectures on millions of SKUs. Trained YOLOv5 detectors for fine-grained product part detection on manufacturing floors.

    • VGG16
    • BiLSTM
    • YOLOv5
    • PyTorch
  4. Jan 2019 — Mar 2020

    Python & Data Science Engineer

    Clavax Technologies · Gurgaon

    Built Flask microservices, a Rasa-powered chatbot integrated with Telegram, and an IMAP-based email-text classification system that triaged inbound mail at scale.

    • Flask
    • Rasa
    • NLP
    • IMAP
  5. Jun 2015 — Dec 2018

    PHP & Python Developer

    PhpYouth · Delhi NCR

    Where it started. Django REST APIs, web scraping with BeautifulSoup & Selenium, and PHP backends — the foundational engineering muscle that still informs how I build ML systems today.

    • Django
    • REST
    • PHP
    • BeautifulSoup
    • Selenium
Education

M.Tech, Artificial Intelligence & Machine Learning

NIT Warangal · Sep 2019 — Aug 2021

Education

B.Tech, Computer Science

MDU Rohtak · Jul 2011 — Jun 2015

Certified

TensorFlow & PyTorch

Deep Learning practitioner

Selected projects

What I've actually shipped — not slides, not POCs.

A short list of systems running in production today, or that ran in production at the time. Quietly, reliably, in front of real users.

HP Inc.

External Signals Platform

A B2B sales-intelligence platform that pulls D&B Hoovers signals on enterprise accounts, vectorises them, and serves a multi-agent reasoning layer that proposes outreach plays.

  • Streamlit
  • Selenium
  • Qwen3-Embedding
  • ChromaDB
  • Ollama
R-Systems / Blackstone

Avatar Video Chatbot

A talking-head conversational avatar — Stable Diffusion XL for visuals, GFP-GAN for face restoration, BARK for voice, Llama 2 + LangChain for dialogue, PEFT/LoRA for personality fine-tunes.

  • SDXL
  • GFP-GAN
  • BARK
  • Llama 2
  • LoRA
SkullCandy

Tunein on Skull-IQ

Voice assistant deployed on SkullCandy's Skull-IQ headphone platform. Rasa NLU + Dialogflow handled intents, Flask served inference, GKE handled scale.

  • Rasa NLU
  • Dialogflow
  • Flask
  • Docker
  • GKE
Computer Vision

Custom Diffusion Image Generation

A from-scratch UNet2D + DDPM diffusion model trained on a custom domain dataset — built to internalise how diffusion really works rather than relying on a hosted API.

  • UNet2D
  • DDPM
  • PyTorch
IoT · Azure ML

Smart Refrigerator Vision

An on-device CV pipeline for a connected refrigerator — inventory tracking, freshness estimation, and anomaly alerts, deployed on Azure ML.

  • Azure ML
  • CNN
  • IoT
Toolkit

The full stack — from raw data to production agents.

Machine Learning

  • Linear & Logistic Regression
  • Random Forest, XGBoost
  • KNN, SVM, K-Means
  • PCA & dimensionality reduction
  • Time-series forecasting

Deep Learning

  • CNN, RNN, LSTM, GRU
  • Transfer Learning
  • YOLO, R-CNN, VGG16
  • BERT & Transformers
  • Diffusion Models (UNet2D, DDPM)

Gen AI & LLMs

  • Multi-Agent Systems
  • RAG & Vector DBs
  • LangChain & orchestration
  • Fine-tuning (PEFT / LoRA)
  • Local inference (Ollama, vLLM)

Languages & Data

  • Python, PHP
  • MySQL, PostgreSQL, MariaDB
  • MongoDB, BigQuery
  • Kafka streaming
  • Pandas, NumPy, scikit-learn

Cloud & MLOps

  • GCP, AWS, Azure
  • S3, Redshift
  • Docker, GKE
  • Dataiku CI/CD
  • Azure ML

NLP & Voice

  • Rasa NLU, Dialogflow
  • Text classification
  • Speech synthesis (BARK)
  • Prompt engineering & evals
  • Embeddings & retrieval
Code

Open work on GitHub.

A few of the public repos I keep on github.com/kundan121 — a mix of original projects, learning experiments, and forks I'm actively studying.

Plus active forks I'm reading from — NeMo-Agent-Toolkit (NVIDIA), open-notebook (open-source NotebookLM), MiroFish (swarm intelligence), neuralforecast, and more on the profile.

Also — I teach

I run a small 1-on-1 mentorship
for serious students.

Three private classes a week. Industry-grade Data Science, LLMs, Multi-Agent Systems and Quant — taught the way I'd have wanted to learn it. This isn't about the money — it's about the kick of changing one career at a time.

Visit the mentorship page
₹10,000 /student /month (Duo)
3 classes / week
1 : 1 or batch of 2
Get in touch

Have a project, a role, or a question?

I read every message myself. The fastest way to reach me is email or LinkedIn — I usually reply within 48 hours.