Post Job Free
Sign in

Machine Learning Data Analysis

Location:
Jersey City, NJ
Posted:
June 27, 2024

Contact this candidate

Resume:

Hongfei Lyu

New York, NY 646-***-**** ad6sz4@r.postjobfree.com

EDUCATION

Columbia University New York, NY

Master of Science in Applied Analytics (GPA: 4.0) Sep 2023 – Dec 2024

Coursework: Storytelling With Data, Data Analytics Using SQL and Relational Database, Python for Data Analysis, Machine Learning: Concepts and Applications, Data Modeling, Applied Analytics Framework and Method

University of California, San Diego San Diego, CA

Bachelor of Science in Mathematics-Computer Science Sep 2019 - Jun 2022

Honors: Provost Honors Coursework: Data Science in Practice, Supervised Machine Learning Algorithms, Data Analysis and Inference, Intro to Mathematical Statistics, Advanced Data Structures, Business Analytics

TECHNICAL SKILLS

Languages: SQL, Python3, R, Java, C++, C#, MATLAB ML Toolkits: Scikit-Learn, TensorFlow, Keras, Pytorch...

Data Visualization: JMP, Matplotlib, Seaborn, Tableau, Power BI, Plotly, ggplot2 MS Office: MS Excel, MS PPT

Database: PostgreSQL, ER studio, MongoDB, PgAdmin4, Spark, Neo4j, MySQL, MS Access Website: HTML, CSS, Flask

Analytics: Adobe Analytics, Statistical Modeling, A/B Testing, NLP, Time Series Analysis, Cluster Analysis, Survival Analysis

PROFESSIONAL EXPERIENCE

HUICHUN Real Estate Development Company Shanxi, China

Data Analyst Intern Jun 2022 – Sep 2022

Built O.L.S. regression to predict residential housing prices within the city using Python; Conducted industry research and competitor analysis to develop recommendations on list prices, presenting to senior management for strategic planning

Collected and analyzed news, industry reports, and government statistics to design the company’s marketing pitch deck

Conducted sentiment analysis on customer reviews and feedback using nltk and proposed CRM operations suggestions to improve offerings and address negative sentiments, enhancing customer satisfaction by 13%

iFLYTEK (AI Solution and IT Company with $3.5B market cap) Qingdao, China

Project Manager Intern - Operation Research and Data Team Aug 2021 – Nov 2021

Cleaned and analyzed 12k+ records of user behavior data via SQL to optimize user experience for company’s Global Chinese Learning Platform official website

Built logistic regression, K-NN, SVM and Random Forest models using Python to evaluate product’s interactive performance and obtained 89% average accuracy, providing recommendations on improving website design and increasing conversion rate by 20%

Collected consumption attitude data via questionnaires & interviews and created target customer profiles and Tableau dashboard

Acquired and onboarded the initial 6 higher education institutions for newly launched digital learning course management system

Haier (Fortune Global 500) Qingdao, China

Product Manager Intern - Supply Chain Analytics Team Dec 2020 - Jan 2021

Researched and estimated service prices for 100+ partner suppliers on Haier’s digital supply chain network platform COSMOPlat

Designed and Updated platform database with PostgreSQL including suppliers’ ERP, CRM, and other supporting systems data

Segmented suppliers’ firmographic data using SQL; Conducted A/B testing on operating systems and evaluated order-to-delivery times, production efficiency, and product customization to identify target suppliers for consumer groups

Collaborated with cross-functional teams and stakeholders to ensure feasibility of operations plan and platform fundings

PROJECT EXPERIENCE

NYC Noice Complaint Searching Engine Jan 2024 – May 2021

Extracted and processed non-relational and relational noise complaint data through web scraping and APIs.

Stored and retrieved data with PostgreSQL, MongoDB, and Neo4j to fulfill real-time complex data requests.

Visualized results through an interactive webpage using HTML and CSS, allowing users to check specific noise situations

Predictive Modelling for Used Car Pricing Sep 2023 – Dec 2023

Executed data cleaning and preprocessing on 40k+ used car listings, including parsing, feature extraction and restructuring in R.

Designed XGBoost model with best-tuned parameters to predict sale prices of used cars, leveraging their features and condition.

Achieved 19th place in a field of over 500 participants in the Kaggle competition.

Uber CarShare Market Analysis Sep 2023 – Dec 2023

Collected and cleaned data on performance of general car rental market, industry counterpart and benchmark product.

Analyzed data to extract actionable insights for Uber’s strategic planning, employing Tableau for effective data visualization.

Utilized Canva and PowerPoint to create interactive poster and PowerPoint, facilitating impactful communication of findings.

Eco-Friendly Packaging Market Research Design Sep 2023 – Dec 2023

Designed a randomized controlled experiment to analyze customer purchase decisions influenced by eco-friendly packaging.

Run preliminary simulations to estimate potential outcomes and determine necessary effect size for large-scale implementation.

Demonstrated the experiment and gather additional data through surveys including binary choices and Likert scale.

Performed statistical analysis using chi-square and ANOVA tests to interpret survey results and extract meaningful insights.



Contact this candidate