Sanket Deshpande
Download CV
MS Computer Science Georgia State University • 2023-2025
BS Computer Science Vellore Institute of Technology • 2016-2020

Hi, I'm Sanket Deshpande

I build and ship AI products end-to-end — from LLMs and multimodal systems to distributed training and real-time inference. My edge: speed, ownership, and turning research into production.

Shipped AI Products: Cre8able (multimodal video-editing, 70% workload reduction) & Whisker (LLM pet-care assistant, sub-second streaming)
Applied ML Systems: 3D CNN-Transformer hybrids (+15% performance), multi-GPU training (7× faster epochs)
Multi-Agent & LLM Systems: Built debate frameworks with LLM-as-Judge evaluation, adversarial dialogue systems, and RAG-powered recommendation engines
Stack: PyTorch, Transformers, FastAPI, Docker, Postgres, AWS/GCP, real-time inference pipelines
Experience

Professional Journey

Building impactful solutions across various industries

Founding ML Engineer

Digital Studio Labs • San Francisco, CA

Jul 2025 - Present
  • Cre8able: Built and launched a multimodal video-editing platform from idea to live beta in 6 weeks; cut creator editing workload by 70% using Gemini + FFmpeg on FastAPI/S3.
  • Shipped Whisker, an LLM pet-care assistant in 2 weeks with a containerized RAG backend (Postgres + FastAPI) and sub‑second response streaming; scaled to 50+ weekly beta users.
  • Designed context-aware recommendations combining planning context with chat history, increasing acceptance of suggested edits by 25%.
FastAPI FFmpeg Postgres LLM

Machine Learning Researcher

TReNDS • Atlanta, GA

May 2024 - Aug 2025
  • Built 3D ML pipelines for cognition prediction on a 12k+ subject MRI dataset, turning neuroimaging into predictive insights.
  • Designed a custom 3D CNN with Multi‑Head Self‑Attention + Squeeze‑and‑Excitation, achieving 0.34 correlation (+15% vs. CNN baselines).
  • Optimized distributed training with PyTorch DDP on a 4× GPU HPC cluster (SLURM), achieving 7× faster epochs with robust logging/checkpointing.
  • Published at IEEE ISBI 2025 and EMBS 2025, strengthening explainability of cerebellum‑related cognition models.
PyTorch DDP SLURM Neuroimaging

Computer Vision Engineer

MORSE Studio • Atlanta, GA

Aug 2023 - Apr 2024
  • Designed a mmWave radar perception pipeline using Range/Doppler FFT analysis to characterize material signatures; released open‑source experiments.
  • Built a C‑based UDP socket tool for TI mmWave radar (IWR16843 + DCA1000) enabling faster capture and multithreaded stream handling.
  • Prototyped event‑driven imaging with Luxonis neuromorphic sensors; decoded high‑frequency LED signals using Lomb‑Scargle and Python concurrency.
mmWave C/C++ Signal Processing

C++ Engineer I

Harman International • Bangalore, India

Oct 2020 - Jun 2023
  • Delivered production‑grade C++ navigation algorithms for premium OEMs (Mercedes, BMW, Audi), impacting real‑time routing at scale.
  • Engineered optimized A* navigation (C++14), resolving 350+ critical bugs and delivering 10+ major features under Agile/Scrum.
  • Reverse‑engineered map APIs to classify ambiguous road links; improved routing accuracy in dense urban networks.
  • Implemented cyclic node elimination to reduce search frontier and ECU usage; applied RAII, multithreading, and smart pointers.
C++14 Algorithms Embedded

Computer Vision Intern

KPIT • Pune, India

May 2018 - Jul 2018
  • Built a real‑time sensor‑fusion pipeline (Raspberry Pi, camera, Arduino, CAN) to stream multi‑modal data for CV training, saving 5 hrs/week.
  • Developed an offline tracking tool using YOLOv2 + KCF/Kalman, eliminating manual labeling and saving 2 hrs per dataset.
OpenCV YOLOv2 Python
Projects

Featured Work

A selection of projects that showcase my technical expertise

Cre8able — Multimodal Video Editing

Built and launched from idea to live beta in 6 weeks. Cut creator editing workload by 70% with Gemini + FFmpeg on FastAPI/S3. Context-aware recommendations improved edit acceptance by 25%.

Gemini FFmpeg FastAPI S3

Whisker — LLM Pet-Care Assistant

Shipped in 2 weeks. Containerized RAG backend (Postgres + FastAPI) with sub-second response streaming. Scaled to 50+ weekly active beta users.

RAG LLM FastAPI Postgres

Multi-Agent Debate for Movie Consensus

Multi-agent framework with LLM agents engaging in adversarial dialogue to refine reasoning. LLM-as-Judge evaluation improved coherence over single-agent baselines. RAG with ChromaDB + Gemini 1.5 Flash for personalized recommendations.

LLM RAG ChromaDB Gemini

Visual Py-SLAM Toolkit

End-to-end modular SLAM pipeline from raw video to trajectory visualization. Implemented ORB feature detection/matching, essential matrix decomposition, RANSAC outlier rejection, and scale initialization for robust camera trajectory estimation.

SLAM Computer Vision Python ORB

Statistical ML for Fronto-Cerebellar Circuitry

Preprocessed sMRI/dMRI from ABCD dataset (N=10K) using SPM12, DARTEL, FSL. Built Bayesian Ridge, SVR, Neural Networks for cognitive prediction. Achieved r² of 0.065 ± 0.009 with nested 5x5 cross-validation, outperforming fronto-parietal models.

Neuroimaging ML Bayesian SPM12

Detection of Milli-sized Objects with mmWave Radar

Detected millimeter-sized objects using 60 GHz IWR1683ISK FMCW mmWave radar. Detected penny-sized objects at 2m distance. Analyzed Range/Doppler FFTs across 6 materials and 5 distances using MATLAB Radar Toolbox and Python.

mmWave Radar MATLAB Signal Processing

Pose-Assisted TrackFormer

Enhanced TrackFormer (TUM/Facebook) by integrating pose estimation for improved identity consistency. Developed Pose Encoding Module with Keypoint R-CNN extracting (17 x 3) pose features, fused with track embeddings via learnable linear layer.

Object Tracking Transformer PyTorch MOT
Publications

Articles & Research

Technical articles, blog posts, and research contributions

Fronto‑Thalamo‑Cerebellar Circuitry in Predicting Cognition and Behavior of ABCD Adolescents

2025

IEEE ISBI 2025 (1‑page abstract), EMBS 2025 (4‑page paper)

Validated the cerebellum’s role in cognition and advanced explainability for neuroimaging‑based prediction models.

View
Skills

Technical Expertise

Technologies and tools I work with regularly

ML / AI

PyTorch
Transformers
CLIP
Multimodal Pipelines
Computer Vision
Scikit‑learn
CUDA

LLMs & RAG

Hugging Face
LangChain
Pinecone / FAISS
Custom RAG Pipelines
Streaming APIs
Evaluation

Production

FastAPI
Docker
Postgres
Async/Streaming APIs
Git

Scaling & Infra

DDP
SLURM
AWS / GCP
Multi‑GPU Training
Real‑time Inference
Certifications

Professional Certifications

Industry-recognized certifications and credentials

AWS Certified Solutions Architect

Amazon Web Services

Issued: March 2023 • Valid until: March 2026

Verify Credential →

Google Cloud Professional Developer

Google Cloud

Issued: January 2023 • Valid until: January 2025

Verify Credential →

Certified Kubernetes Administrator (CKA)

Cloud Native Computing Foundation

Issued: November 2022 • Valid until: November 2025

Verify Credential →

Let's Connect

Feel free to reach out through any of these channels