Hongfei Lyu
New York, NY 646-***-**** ad6sz4@r.postjobfree.com
EDUCATION
Columbia University New York, NY
Master of Science in Applied Analytics (GPA: 4.0) Sep 2023 – Dec 2024
Coursework: Storytelling With Data, Data Analytics Using SQL and Relational Database, Python for Data Analysis, Machine Learning: Concepts and Applications, Data Modeling, Applied Analytics Framework and Method
University of California, San Diego San Diego, CA
Bachelor of Science in Mathematics-Computer Science Sep 2019 - Jun 2022
Honors: Provost Honors Coursework: Data Science in Practice, Supervised Machine Learning Algorithms, Data Analysis and Inference, Intro to Mathematical Statistics, Advanced Data Structures, Business Analytics
TECHNICAL SKILLS
Languages: SQL, Python3, R, Java, C++, C#, MATLAB ML Toolkits: Scikit-Learn, TensorFlow, Keras, Pytorch...
Data Visualization: JMP, Matplotlib, Seaborn, Tableau, Power BI, Plotly, ggplot2 MS Office: MS Excel, MS PPT
Database: PostgreSQL, ER studio, MongoDB, PgAdmin4, Spark, Neo4j, MySQL, MS Access Website: HTML, CSS, Flask
Analytics: Adobe Analytics, Statistical Modeling, A/B Testing, NLP, Time Series Analysis, Cluster Analysis, Survival Analysis
PROFESSIONAL EXPERIENCE
HUICHUN Real Estate Development Company Shanxi, China
Data Analyst Intern Jun 2022 – Sep 2022
Built O.L.S. regression to predict residential housing prices within the city using Python; Conducted industry research and competitor analysis to develop recommendations on list prices, presenting to senior management for strategic planning
Collected and analyzed news, industry reports, and government statistics to design the company’s marketing pitch deck
Conducted sentiment analysis on customer reviews and feedback using nltk and proposed CRM operations suggestions to improve offerings and address negative sentiments, enhancing customer satisfaction by 13%
iFLYTEK (AI Solution and IT Company with $3.5B market cap) Qingdao, China
Project Manager Intern - Operation Research and Data Team Aug 2021 – Nov 2021
Cleaned and analyzed 12k+ records of user behavior data via SQL to optimize user experience for company’s Global Chinese Learning Platform official website
Built logistic regression, K-NN, SVM and Random Forest models using Python to evaluate product’s interactive performance and obtained 89% average accuracy, providing recommendations on improving website design and increasing conversion rate by 20%
Collected consumption attitude data via questionnaires & interviews and created target customer profiles and Tableau dashboard
Acquired and onboarded the initial 6 higher education institutions for newly launched digital learning course management system
Haier (Fortune Global 500) Qingdao, China
Product Manager Intern - Supply Chain Analytics Team Dec 2020 - Jan 2021
Researched and estimated service prices for 100+ partner suppliers on Haier’s digital supply chain network platform COSMOPlat
Designed and Updated platform database with PostgreSQL including suppliers’ ERP, CRM, and other supporting systems data
Segmented suppliers’ firmographic data using SQL; Conducted A/B testing on operating systems and evaluated order-to-delivery times, production efficiency, and product customization to identify target suppliers for consumer groups
Collaborated with cross-functional teams and stakeholders to ensure feasibility of operations plan and platform fundings
PROJECT EXPERIENCE
NYC Noice Complaint Searching Engine Jan 2024 – May 2021
Extracted and processed non-relational and relational noise complaint data through web scraping and APIs.
Stored and retrieved data with PostgreSQL, MongoDB, and Neo4j to fulfill real-time complex data requests.
Visualized results through an interactive webpage using HTML and CSS, allowing users to check specific noise situations
Predictive Modelling for Used Car Pricing Sep 2023 – Dec 2023
Executed data cleaning and preprocessing on 40k+ used car listings, including parsing, feature extraction and restructuring in R.
Designed XGBoost model with best-tuned parameters to predict sale prices of used cars, leveraging their features and condition.
Achieved 19th place in a field of over 500 participants in the Kaggle competition.
Uber CarShare Market Analysis Sep 2023 – Dec 2023
Collected and cleaned data on performance of general car rental market, industry counterpart and benchmark product.
Analyzed data to extract actionable insights for Uber’s strategic planning, employing Tableau for effective data visualization.
Utilized Canva and PowerPoint to create interactive poster and PowerPoint, facilitating impactful communication of findings.
Eco-Friendly Packaging Market Research Design Sep 2023 – Dec 2023
Designed a randomized controlled experiment to analyze customer purchase decisions influenced by eco-friendly packaging.
Run preliminary simulations to estimate potential outcomes and determine necessary effect size for large-scale implementation.
Demonstrated the experiment and gather additional data through surveys including binary choices and Likert scale.
Performed statistical analysis using chi-square and ANOVA tests to interpret survey results and extract meaningful insights.