
Data-Driven Decision Making
I hold a Master's in Data Science from NYU and am passionate about uncovering patterns, telling stories with code, and building tools that connect data to real-world decisions. Whether designing predictive models, analyzing behavior, or crafting intuitive interfaces, I love solving meaningful problems at the intersection of statistics, technology, and human experience.
Outside of work, you'll find me exploring new cities and cultures, water diving, — or getting unreasonably excited about lifting a heavy weight.
A/B Testing, Exploratory Data Analysis, Segmentation & Cohort Analysis, Funnel Analysis, Business Metrics, Behavioral Analysis, Causal Inference, Decision Modeling, Data Visualization (Tableau, Power BI)
Python (Pandas, NumPy, Scikit-learn, Dash, Plotly), SQL, R
ETL & Data Integration, Data Cleaning & Reconciliation, Distributed Computing (Spark, Hadoop), Cloud Platforms (AWS), Version Control (Git)
Predictive Modeling, Regression & Classification, Time Series Forecasting, Feature Engineering, Ensemble Methods (XGBoost, LightGBM, CatBoost), NLP & LLMs, Explainable AI (SHAP, LIME), Model Validation
ETH Tech
New York, NY
July 2025 – September 2025
Probability of Default Modeling – Applied ML in Finance Project
New York, NY
October 2024 – December 2024
Guotai Junan Securities
Shanghai, China
June 2024 – August 2024
ZADS Fund
Shenzhen, China
February 2023 - April 2023
China Everbright Bank
Beijing, China
June 2022 – August 2022

Built a probability of default prediction pipeline on 1M+ loan records—cleaned data, engineered 14 financial ratios as features and applied machine learning to model credit risk.

Built a bond yield prediction pipeline using decision tree models for Guotai Junan Securities, and developed stepwise regression models for fund duration estimation.

Processed 200K+ customer records from 8 relational tables; used WOE and IV to reduce 700+ features to 50, building binary classification models to predict asset change.

Engineered 16 high-frequency alpha factors using market microstructure data (Level-2/order book), inspired by academic literature and proprietary research. Built a machine learning pipeline integrating 10 low-correlation factors to predict T+1 stock returns.

This data analysis project explores movie ratings from over 1,000 participants across 400 films. I used statistical testing to investigate patterns in preferences—like how gender or birth order might influence opinions on Shrek or The Lion King. I also built regression models using Ridge, LASSO, and Elastic Net to predict ratings based on viewer demographics.
Passionate about strength training and pushing physical limits. Love the discipline and progress that comes with consistent training. Whether it's powerlifting, progressive overload, or just getting unreasonably excited about lifting heavy weights.
During my undergraduate years, I competed in a 15-event all-around fitness competition — including powerlifting's big three (squat, bench press, deadlift), sprints, standing long jump, endurance runs, and more. I won 9 individual events and took home the overall championship title.

Cross-country adventure, covering 4000 miles across 14 states, from New York City to San Diego, California. Learned to deal with the unexpected alone and to be ready to react and change plans as needed.

