cv

Basics

Name Kuber Shahi
Email kshahi@ucsd.edu
Url https://kubershahi.github.io/
Summary Machine Learning Engineer and Data Scientist with 3+ years of experience architecting scalable AI systems, production data pipelines, and NLP solutions. Currently researching uncertainty quantification for medical image registration at UCSD's Biomedical Image Analysis Group.

Education

  • Sep 2024 - Jun 2026

    La Jolla, California

    Master's
    University of California San Diego
    Computer Science (AI Specialization)
    • Statistical NLP, Computer Vision I, AI Agents, ML Systems, Software Engineering, Algorithm Design and Analysis, Machine Learning for Robotics, Medical Image Computing
  • Jun 2021 - May 2022

    Sonipat, India

    Postgraduate Diploma
    Ashoka University
    Computer Science and Physics
    • Capstone Project in PPML, Advanced Machine Learning, Advanced Algorithms, Database Management Systems, Blockchain and Cryptocurrencies
  • Jun 2018 - May 2021

    Sonipat, India

    Bachelor's
    Ashoka University
    Computer Science
    • Intro to Machine Learning, Advanced Programming, Algorithm Design and Analysis, Distributed Systems, Operating Systems, Computer Networks, Computer Security and Privacy, Theory of Computation

Work

  • Aug 2025 - Dec 2025
    Machine Learning Intern
    Melio
    • Architected a modular GCP Vertex AI training pipeline to migrate legacy notebook code into decoupled, containerized modules, accelerating experimentation and cross-team feedback loops by 20%.
    • Engineered a model-agnostic architecture to enable multi-model support for rapid development, leveraging Vertex AI Experiments for automated performance tracking and versioning.
  • Jun 2022 - May 2024
    Data Scientist
    Vayana Network
    • Optimized distributed data pipelines with PySpark on AWS EMR/S3 to process millions of invoices, reducing ETL latency by 20% and improving analytics reliability.
    • Developed a Business Network Analysis platform using NetworkX and community detection algorithms, identifying high-value customers and contributing to 15% business growth.
    • Designed and deployed a Management Information System (MIS) with Apache Superset for real-time reporting, improving cross-functional decision-making by 15%.
    • Spearheaded an Entity Resolution system using NLP-based vectorization and blocking techniques to deduplicate company records, improving data accuracy by 30% through scalable string-matching algorithms.
    • Developed rapid proofs-of-concept (PoCs) to evaluate emerging ML technologies and architected ad-hoc data solutions, driving operational agility and data-informed strategic decision-making.
  • Nov 2021 - Mar 2022
    Data Science Intern
    Ageless Partners
    • Led the design and prototyping of a wearable-based fitness recommendation system, coordinating a team of 3 interns to develop a personalized engine that improved user engagement and retention by 20%
    • Applied causal inference and statistical modeling using Python and Plotly to identify key drivers of user health outcomes, enhancing de-aging research insights by 15%

Research

  • Jan 2026 - Present
    Graduate Student Researcher
    University of California San Diego
    • Researching uncertainty quantification for medical image registration; investigating predictive uncertainty vs.registration error via supervised error regression using TorchIO synthetic deformation.
  • Aug 2021 - Dec 2021
    Capstone Project in PPML
    Ashoka University
    • Advisors: Professor Mahavir Jhawar and Professor Debayan Gupta
    • Researched secure multi-party computation (MPC) techniques to enable privacy-preserving neural network training across distributed systems.
    • Implemented SecureNN protocols in C++ and analyzed their effectiveness in ensuring data privacy and computational efficiency for real-world, data-sensitive applications.
  • May 2021 - Aug 2021
    Research Intern
    Mphasis Lab, Ashoka University
    • Advisor: Professor Mahavir Jhawar
    • Researched and implemented Privacy-Preserving Machine Learning (PPML) protocols such as SecureML and BLAZE in C++, evaluating each protocol’s efficiency and security for sensitive datasets.
    • Co-led the development of a secure, optimized PPML protocol for business-specific needs under Prof. Mahavir Jhawar’s guidance, improving the protocol’s reliability and applicability for real-world solutions.
  • Jan 2021 - May 2021
    Independent Study Module in Applied Cryptography
    Ashoka University
    • Advisor: Professor Mahavir Jhawar
    • Examined and evaluated the security vulnerabilities in email clients that support the two primary forms of end-to-end email encryption: OpenPGP and S/MIME.
    • Illustrated the attacks on various email clients outlined in the Mailto: Me Your Secrets paper and suggested countermeasures against them.
  • Jan 2021 - May 2021
    Independent Study Module in Secure Machine Learning
    Ashoka University
    • Advisor: Professor Debayan Gupta
    • Studied the impact of adversarial attacks such as Data Poisoning and Model Evasion on the performance and reliability of machine learning (ML) models.
    • Demonstrated the attacks in the Subpopulation Data Poisoning Attacks paper to understand the impact of poisoning attacks on real-world machine learning models.

Teaching

  • Sep 2025 - Jun 2026
    Teaching Assistant
    Pol Sci Department, UC San Diego
    • Theories of Technology and National Security (Poli 145), Spring 2026, Prof. Michael F. Joseph, Class Size: 180. Tasks: hold office hours, grade assignments and exams, and assist students with technological concepts (both old and emerging) and their impact on state security.
    • Political Inquiry (Poli 30D), Fall 2025, Prof. Scott Desposato, Class Size: 210. Tasks: conduct weekly discussion sections, hold office hours, grade assignments and exams, and assist students with statistical concepts and STATA.
  • Aug 2020 - May 2021
    Teaching Assistant
    CS Department, Ashoka University
    • Discrete Mathematics (CS-1104), Spring 2021, Prof. Subash Bhalla, Class Size: 100. Tasks: holding weekly office hours, setting and grading assignments and test papers, and facilitating online lectures.
    • Probability & Statistics (CS-1208), Monsoon 2020, Prof. Mahavir Jhawar, Class Size: 80. Tasks: holding weekly office hours, grading assignments and test papers, and facilitating online lectures.
  • Jun 2019 - Jul 2019
    STEM Facilitator
    Maa Anandmayee Memorial School
    • Partnered with MakersBox Delhi to establish a STEM lab at the school by leading equipment selection, procurement, and setup while teaching students hands-on projects in cutting-edge technologies like Robotics, IoT, VR, and Lego.

Skills

Programming Languages Python, C/C++, JavaScript, SQL (NoSQL), R, Shell Scripting (Bash), HTML/CSS, MATLAB
Machine Learning & AI PyTorch, TensorFlow, LLMs, Hugging Face, LangChain, Scikit-learn, Pandas, NumPy, NLP, Computer Vision, Causal Inference
Data & Distributed Systems Apache Spark, Hive/Hadoop, ETL, GCP (Cloud Storage, Vertex AI), AWS (S3, EC2, Lambda), MySQL, PostgreSQL, MongoDB
Software & Infrastructure React.js, Node.js, Django, CI/CD, Git, Kubernetes, Docker, Linux, Selenium
Analytics & Visualization Apache Superset, Power BI, Plotly, Dash, Statistical Data Analysis
Professional Skills Project Management, Technical Communication, Problem Solving, Analytical Thinking, Team Leadership, Cross-functional Collaboration

Certificates

Machine Learning
Stanford University Oct 2021
HTML, CSS, and JavaScript for Web Developers
Stanford University Jul 2020

Awards

  • Jun 2022
    Bronze Medal
    CS Awards, Ashoka University
    Awarded for being ranked third in my batch during my PG Diploma degree.
  • Jun 2021
    Silver Medal
    CS Awards, Ashoka University
    Awarded for being ranked second in my batch during my undergraduate degree.
  • May 2022
    Dean's List
    Office of Academic Affairs, Ashoka University
    Appeared on the Dean's List: Monsoon'18, Spring'19, Monsoon'19, Spring'20, Monsoon'20, Spring'21, Monsoon'21, and Spring'22.

Leadership and Activities

  • Aug 2021 - May 2022
    Resident Assistant
    Ashoka University Resident Assistants
    • Managed a residence floor of 70 students, supporting their academic and personal needs while fostering community through team-building and wellness events.
  • Jan 2021 - May 2021
    Peer Mentor
    Office of Learning Support (OLS), Ashoka University
    • Mentored an undergraduate student on academic and career decisions, providing guidance on course selection, research opportunities, and professional development.
  • Sep 2018 - Feb 2020
    Team Member
    Ashoka University Volleyball Team
    • Represented Ashoka University in inter-collegiate volleyball tournaments, competing as part of a team across multiple university-level competitions.
  • Sep 2018 - Feb 2020
    Events Team and Member
    Ashoka University International Student Association (AUISA)
    • Co-organized Ubuntu, an inter-collegiate cultural festival, for two consecutive years, alongside various other cultural and educational events on campus, bringing together international and domestic students to foster cross-cultural exchange and community building.

Languages

English
Bilingual Proficiency
Nepali
Native Proficiency
Maithili
Native Proficiency
Hindi
Full Professional Proficiency