Post Job Free
Sign in

Data Analyst

Location:
San Francisco Bay Area, CA
Posted:
December 16, 2024

Contact this candidate

Resume:

Jingyi (Roxie) Zhao

786-***-**** • **********@********.*** • https://www.linkedin.com/in/jingyi-z• Open to relocate EDUCATION

University of California, Berkeley Berkeley, CA

Master of Analytics in Industrial Engineering & Operations Research department Aug 2023 - Aug 2024 Related Course: Financial Engineering, Machine Learning, Deep Learning, Database Management, Data Analytics University of Miami Miami, FL

Bachelor of Science in Business Administration, Business Analytics Major, Accounting Minor Aug 2019 - May 2023

• GPA: 3.85/4.0; Provost’s Honor Roll & Dean’s List 2020-2022 SKILLS

• Programming Languages: Python (Pandas, NumPy, Scikit-Learn, TensorFlow, gurobipy), SQL (MySQL, PostgreSQL), R, MATLAB, C++, Simio,

• Tools: Tableau, Power BI, Airflow, Bloomberg, Advanced Excel (Pivot Table, Functions, Marcos, etc.), Microsoft Office

• Machine Learning & Analytics: Regression, Decision Trees, Random Forest, SVM, KNN, Neural Networks, GANs, NLP, Causal Inference, Statistical Modeling, Time Series Forecasting, A/B testing, Association Rules, Clustering, etc. PROFESSIONAL EXPERIENCE

CarbonSustain Startup Berkeley, CA

Full-stack AI & Data Analyst Intern May 2024 – Sep 2024

• Optimized dataset completeness with 25% increase in volume by web scraping through API and developed complex SQL queries for data deep-dive and analysis on financial and GHG emission data.

• Created a prediction model for marketing group to attract new clients, with 20% increase in new users. The model implemented advanced machine learning models using XGBoost and Random Forest to forecast the company's five-year carbon emissions, with a 10% boost in model accuracy.

• Provided stakeholders with data insights for sustainability initiatives through created SHAP value charts in python to enhance model interpretability.

• Developed comprehensive technical documentation on emission prediction model, and maintained model accuracy by periodic model validation, and ensured clear communication of methodologies and changes to stakeholders. China Galaxy Securities Leading securities company in China Remote Data Analyst Intern Aug 2022 – Oct 2022

• Leveraged data mining and ETL processes to ingest and transform large-scale structured datasets (food and beverage sector) from diverse sources (Bloomberg, Wind, etc.), extracted over 10,000 rows of livestream data to support team’s analysis

• Conducted predictive financial analysis using RNN model for food and beverage companies with Python and Bloomberg, leading to a 5% improvement in portfolio performance through data-driven portfolio optimization analysis with gurobipy

• Operated anomaly detection on market trends metrics using Python, creating interactive dashboards in Tableau to enable proactive risk mitigation and inform decision-making for portfolio management

• Improved stock price prediction accuracy by achieving an RMSE of 2.9 using TensorFlow LSTM-based Neural Network model, to identify high alpha stocks, incorporating into industry report to provide valuable investment insights Haitong Securities Leading securities company in China Shanghai, China Bond Market Research Analyst Intern Sep 2021 – Dec 2021

• Executed end-to-end bond market analysis and underwriting RMB 5.5 billion in bonds, including conducting due diligence, documentation, crafting roadshow slides, and coordinating with stakeholders, resulting to 4 successful high-profile project bids

• Developed 10+ key metrics to evaluate macroeconomics and bond market through market research analysis.

• Established dynamic data visualizations in Tableau dashboards for real-time performance tracking, serving 15+ stakeholders and improved cross-functional insights and boost efficiency by 20%

• Transformed and streamlined DCF valuation model and sensitivity analysis for 7 issuers from Excel to Python script, resulting in a 15% improvement in evaluation efficiency for the DCM team, boosting productivity

• Created a forecasting model (XGBoost) in Python to assess bond credit risks, leveraging macroeconomics indicators and issuer financial data, achieved 69.13% accuracy through advanced feature engineering and model optimization SELECTED PROJECTS

Personalized Movie Recommendation System (https://github.com/RoxieZhao/MovieRecommendation.git) Jan 2024 – Apr 2024

• Scraped a over 100,000 movies dataset from IMDB, built a hybrid recommendation model combining content-based and collaborative filtering techniques in Python, boosting user engagement and increased movie views

• Implemented statistical techniques such as TF-IDF, cosine similarity, and SVD for algorithmic efficiency, coupled with grid search and cross-validation for parameter optimization, achieving 73% recall after fine-tuning

• Enhanced user experience by developing a customized Streamlit web interface with customized CSS, leading to a 30% increase in user interaction and a more intuitive platform for personalized recommendations JD.com User Purchase Power Prediction Project (https://github.com/RoxieZhao/UserPurchasePowerPrediction.git) Aug 2023 – Dec 2023

• Streamlined ETL pipeline for 2.5 million user data developed in AWS, elevating processing speed by 20% and optimizing data handling for improved productivity

• Conducted e-commerce customer segmentation sales strategies using Random Forest, CART, Boosting algorithms in Python, achieving a 79% improvement in identifying users’ purchase tendency, driving in a 10% increase in sales campaign performance across various channels

• Extracted important features and performed EDA, developed a rule-based propensity score, suggested roll-out plans for customized digital marketing strategies



Contact this candidate