Welcome to my portfolio

Hi, I’m
Sushanth!

AI/ML Engineer
📍 Dallas, Texas & Brooklyn, New York
🪼
CS & ML since 2017
🐡
Messy Data → Real Systems
🐬
Shipped ML Systems for Impact
🪸
Data Storytelling

I focus on building AI-powered systems and ML pipelines at scale.

I've architected recommendation engines for 30,000+ retail stores, optimized supply chains for Coca-Cola bottlers, engineered edge-optimized LLMs at Carnegie Mellon, And now, I advance business-critical analytical workflows using RAG systems.

I’ve dedicated my career to integrating AI into the product lifecycle and building the foundation that allows models to scale, adapt, and drive measurable business impact.

At my core, I enjoy building solutions that people rely on every day. If you’re working on something ambitious where my experience could help, I’d be glad to connect!

Machine Learning & Deep Learning
Large Language Models
MLOps & Model Deployment
Time Series & Forecasting
A/B Testing & Causal Inference
Data Storytelling & Visualization

Featured Projects

blah blah blah

LLM Inference Optimization using Speculative Decoding

Graduate AI Research Showcase | CMU

Optimized inference pipelines for LLaMA2 and GPT-2 using speculative decoding, pruning, and quantization in PyTorch to reduce latency and memory usage. Built modular benchmarking workflows to evaluate accuracy–performance tradeoffs across CPU and GPU environments, targeting real-time and edge deployment scenarios.

PyTorchLarge Language ModelsHugging FaceArtificial Intelligence
Latency1.2x speedup with significant memory gains
Accuracy94% tokens retained

Real-Time Traffic Forecasting for Efficient Demand Routing

Operationalizing AI Project | CMU

Built an end-to-end ML system for traffic resource planning using near real-time data, focusing on reliable production use with MLOps. Engineered data pipelines, model evaluation, monitoring, and feedback loops required to sustain performance under changing real-world conditions. The resulting system maintained sub-5% forecast error while enabling continuous validation to support operational decision-making.

PythonMLOpsDockerMLFlow
Reliability99.9% Up-time
Error (MAPE)<5%

Driver Safety System for Real-Time Hazard Detection

AI Capstone Project | CMU x TCS

Designed a lightweight computer vision–based hazard detection system using deep learning in PyTorch, improving detection accuracy by 18% in critical regions. Optimized end-to-end inference and cloud-edge integration to achieve latency reduction, enabling deployment on edge devices for autonomous driving applications

PythonPyTorchNeural NetworksComputer Vision
Latency↓22%
Accuracy↑<18% in critical regions

Loan Default Prediction & Return Optimization

ML Project | CMU

Developed an end-to-end machine learning pipeline to evaluate consumer loan risk and optimize investment returns using historical LendingClub data. The project combined default prediction with return forecasting to move beyond binary risk assessment and toward decision-aware lending strategies. Using ensemble models and neural networks, the system achieved strong predictive performance (AUC 0.91) and informed a return-based investment strategy that increased profitability by 18% while reducing exposure to high-risk loans.

PythonEnsemble ModelsNeural Networks
Throughput10K/min
Latency<100ms

Bookwormy (In Progress)

A novel way for kids to rediscover reading

Building an AI-powered literacy platform that uses Large Language Models to turn reading into an interactive, gamified experience. Currently architecting the multi-tenant backend and implementing retrieval-augmented generation (RAG) to provide safe, age-appropriate content dynamically.

Generative AIVectorDBRAG SystemsEducation Technology
StatusData Enrichment & Research Phase
ArchitectureRAG-based

Experience

Industry, research, and applied ML work

Applied ML Engineer

Bluebird Technologies
Dallas, TX & Brooklyn, NY2025 - Present
  • Built a RAG pipeline with document ingestion, chunking, embeddings, vector retrieval, and LLM inference, optimized for low-latency and retrieval relevance in decision-support workflows.
  • Defined and monitored retrieval and generation quality (recall@k, context relevance, response stability), with safeguards for hallucination and stale context as data evolved.
  • Developed pipelines processing 10M+ events/month, transforming transactional and behavioral data into model-ready features.
  • Implemented automated monitoring for data quality, drift, and forecast performance (MAPE, bias, stability) with alerting for production regressions.
RAGMLMLOpsWorkflow Optimization

Analyst Intern

Bluebird Technologies
Dallas, TX & Brooklyn, NY2025 - Present
  • Developed baseline time-series and regression models to quantify seasonality, trend, and variance, forming the reference layer for downstream anomaly detection.
  • Designed diagnostics and drift checks, identifying instability and false positives before rollout and increasing stakeholder confidence in production analytics.
PythonMLSQLMLOps

Graduate Research Assistant

Carnegie Mellon University
Pittsburgh, PASummer'23
  • Built a time-series forecasting model to predict adoption levels for various cryptocurrency tokens in underserved markets
  • Analyzed 100K+ records of behaviour and token engagement research data to segment customer bases
  • Ranked various proposals based on our model, and delivered results to Heinz College stakeholders
Machine LearningResearchTeachingEconometrics

Graduate Teaching Assistant

Carnegie Mellon University
Pittsburgh, PASpring'23 & Fall'23
  • Applied Econometrics : Causal Inference & Hypothesis Testing using R
  • Supported the Professor by conducting office hours, recitations, and grading for 80 graduate students
Machine LearningResearchTeachingEconometrics

Machine Learning Engineer

Quantiphi
Mumbai, India2021 – 2023
    • Large-Scale Production Engineering: Engineered and deployed recommendation and out-of-stock detection systems for 30,000+ retail stores using Python and PySpark, generating $1.2M in incremental revenue through proactive inventory management.
    • Cloud Migration & MLOps: Led the migration of legacy ML pipelines to a scalable architecture on Azure Databricks. Automated end-to-end workflows using Airflow and MLflow for experiment tracking, reducing deployment latency and ensuring stable production environments.
    • Data Reliability Infrastructure: Built automated data-quality validation and anomaly-detection pipelines (incorporating XGBoost and HMM/GMM experiments) to ensure the integrity of pricing and sales systems, improving data accuracy by 35%.
    • Inference & Consumption: Exposed model outputs via REST APIs and interactive dashboards, enabling real-time inference consumption by downstream applications and optimizing the Space-to-Sales ratio for field operations.
    • Strategic Research & Optimization: Conducted cross-functional research on pricing anomalies and labor demand forecasting, translating complex analytical findings into actionable workforce allocation strategies for executive stakeholders.
    Machine LearningResearchTeachingEconometrics

Education

Formal training and core coursework

Carnegie Mellon University
Pittsburgh, PA
2023 – 2025
M.S. in Information Systems & Management
Data Science & ML
Graduate Program AI Systems Analytics
Relevant coursework
  • Machine Learning
  • Graduate AI
  • Operationalizing AI
  • Unstructured Analytics
  • Computer Vision
  • Database Systems
  • Advanced Business Analytics
  • Applied Econometrics
  • A/B Testing
Manipal Institute of Technology
Karnataka, India
2017 – 2021
B.Tech in Computer Science
Specialization in Intelligent Systems
CS Foundations Systems AI/ML
Relevant coursework
  • Software Engineering
  • Data Structures
  • Algorithms
  • Object-Oriented Programming
  • Cloud Computing
  • Parallel Programming
  • NLP
  • Computer Vision
  • Advanced Mathematics

Creative

Art + Writing outside of work

Art

Happy
Acrylic • 2024
Friend
Acrylic • 2024
Light
Digital • 2025

Writing

Manipal The Talk Network

Sub-Head of Writing

Led the creative direction of our university's largest media ord

Read
Head of Content

Cultural Fest'19

Head of Content for MIT's cultural fest hosting over 20k students

Read
The Editorial Board, Manipal

Writer

Wrote 4 Articles for the Editorial Board

Read

Connect

Reach out for collaborations, roles, or a quick chat.

Send me a message

This uses your email app :)