Twitterazzi
POS tagging, NER (Spacy), Sentiment Analysis
Developed a web app for influencer Twitter activity analysis, employing POS tagging, NER, sentiment analysis, and wordcloud generation.
I am a Machine Learning Engineer with 4 years of experience in Generative AI, NLP, and Large-Scale ML Systems. Currently, I am completing my Master’s in Data Science at UC San Diego and actively seeking full-time opportunities where I can apply AI to solve impactful real-world problems.
Beyond industry work, I conduct AI-driven healthcare research at Dr. Lee’s lab (UCSD), where I develop multi-modal models to detect early signs of schizophrenia using speech-based biomarkers.
2024-2025
2020-23
Psychiatry Res 2024
Developed multi-modal models on speech data, integrating acoustic and linguistic features to detect social isolation.
IZA 2021
Collaborated with Dr. Kanika Mahajan to investigate how implicit gender preferences in job ads impact labor market outcomes. Employed NLP techniques such as tf-idf, topic modeling, and word clouds to analyze disparities between gender-preferred roles.
Assisted Dr. Ashwini Deshpande in studying the effects of district splits on program effectiveness. Compiled a dataset encompassing 800 districts and 7000 blocks through web scraping and census data. Analyzed district splits in India between 2011-20 and examined their influence on NREGA implementation.
AAEA 2021
Collaborated with Dr. Kanika Mahajan to assess the impact of Covid-19 on supply chain disruptions. Responsible for data cleaning and computing distances between production zones and retail centers using Google Maps API.
POS tagging, NER (Spacy), Sentiment Analysis
Developed a web app for influencer Twitter activity analysis, employing POS tagging, NER, sentiment analysis, and wordcloud generation.
Edge Detection, Hough Transformation, Google Vision API
Designed a web app that recognizes books on a bookshelf image. Segmented images using Canny edge detection and Hough transformation to identify book spines, followed by digitization and text extraction to identify books.
Convolutional Autoencoders, Image Restoration
Trained Convolutional Autoencoders on 13,000 images to rectify three types of obfuscations (blur, pixelation, and speckle noise) in facial images.
Data Synthesis, Population Modeling, IPF
Collaborated with Dr. Debayan Gupta to study Covid-19 spread. Employed DataSynthesizer model using NSS data to upscale the dataset 15 times, generating around 15 million observations. Presented an Iterative Proportional Fitting (IPF) approach for integrating census datasets to produce Synthetic Population.
CNN, Image Classification,
Trained a Convolutional Neural Network (CNN) to identify hand gestures. Achieved a ∼98% classification accuracy across 11 gesture classes using the VPLU dataset of 1100 images.
Web Scraping, UI Development, Cloud Deployment
Developed a stock analysis web app with over 20 years of financial data. The app used to serve over 100 monthly active users and includes a built-in DCF calculator for quick intrinsic value calculations. Hosted on Heroku.