Post Job Free
Sign in

Data Engineer Business Process

Location:
San Ramon, CA
Posted:
October 21, 2024

Contact this candidate

Resume:

Mahathi Lanka

Data Engineer

Phone: 980-***-**** Email: ad9lk8@r.postjobfree.com LinkedIn

SUMMARY

•Experienced Data Engineer with over 3 years of expertise in statistical analysis, predictive modeling, and large-scale data project management.

•Proficient in Python libraries (NumPy, Pandas) and R for data wrangling, analysis, text analytics, and data mining.

•Skilled in utilizing Apache Kafka, Hadoop, Nifi, and Airflow for efficient ETL processes and data workflows.

•Expertise in data warehousing with Amazon Redshift, Snowflake, and Google Big Query.

•Proficient in creating dashboards and visualizations using Tableau, Power BI, Matplotlib, and Plotly, enhancing data-driven insights for business decision-making.

•Strong background in advanced statistical techniques, including hypothesis testing, A/B testing, and time-series analysis.

•Specialized in developing business process models and workflows, with expertise in gathering and analyzing Business Requirements Documents (BRD) and functional specifications for business process owners and stakeholders in top financial institutions.

•Used Confluence to create and maintain detailed documentation, including project plans, technical specs, and business process workflows, allowing cross-functional teams to access and collaborate in real-time.

•Experienced in AWS services (EC2, S3, RDS, Lambda) for cloud-based data storage, processing, and analysis.

•Demonstrated ability to design and structure UAT and production environments, streamline processes, and enhance newly implemented applications.

•Configured and managed cloud storage solutions (e.g., Cloud Storage, BigQuery) to efficiently store and query large volumes of structured and unstructured data.

•Ability to design and structure UAT and production environments, streamline processes, and enhance newly implemented applications.

•Skilled in designing and deploying comprehensive real-time data processing pipelines using Azure technologies such as Azure Stream Analytics, Azure Event Hubs, and Azure Databricks. This expertise resulted in scalable, high-performance solutions that provided timely insights and effectively supported critical business operations with low latency.

•Strong collaborative skills, working with business and technical stakeholders to align designs and process changes with business strategies.

•Collaborated with data scientists and engineers to optimize data pipelines and improve data processing efficiency, reducing latency and enhancing overall system performance.

•Proficient in advanced MS Excel, PowerPoint, and spreadsheet functionalities for comprehensive data analysis and reporting.

WORK EXPERIENCE

Epsilon Jul 2023 - Current

Data Analyst

•Designed and implemented comprehensive data validation processes to ensure the accuracy and integrity of data sets used for analysis.

•Designed SQL table schemas and implemented stored procedures and complex SQL statements using stored procedures and common table expressions (CTEs) to extract and store customer purchase and session data, improving data retrieval efficiency and reducing query response times by 30%.

•Managed and optimized large-scale data projects using Apache Kafka, Hadoop, and Nifi, ensuring efficient data handling and analysis.

•Implemented AWS Step Functions to orchestrate multiple AWS services into serverless workflows for efficient and reliable execution.

•Collaborated with data engineers and operations teams to implement ETL processes, writing and optimizing SQL queries for data extraction and analysis, leading to a 70% improvement in operational efficiency with Snowflake.

•Utilized AWS services such as EC2, S3, and Lambda to ensure seamless deployment of analytical models and tools, enhancing data operations and processing.

•Responsible for ETL development with successful design, development, and integration of components within the Snowflake ETL Platform.

•Implemented data governance policies and procedures to ensure compliance with regulatory requirements and data security standards.

•Worked alongside data engineers to design and enhance data warehouse architectures, boosting scalability and performance.

•Conducted data cleaning and preprocessing activities to ensure the accuracy and quality of data used for analysis.

•Handled the management and optimization of SQL and NoSQL databases, including MySQL, PostgreSQL, and MongoDB, to ensure efficient data storage and retrieval.

•Conducted root cause analysis on data discrepancies and anomalies, implementing corrective actions to enhance data accuracy.

•Participated in training sessions on big data tools and cloud technologies, gaining foundational knowledge in data engineering.

•Enhanced business operations by designing and implementing robust PL/SQL solutions for Procure-to-Pay (P2P) and Order-to-Cash (O2C) processes, optimizing data models, incident management and improving system performance.

•Designed rich data visualizations to model data into human-readable form using Tableau and Matplotlib.

•Participated in weekly and daily Scrum meetings with LOB and Data Ops teams to discuss issues and progress.

Infosys Ltd, India Dec 2020 - Jul 2022

Data Analyst

•Analyzed complex data sets using statistical and data analysis techniques to identify trends, patterns, and insights.

•Crafted Python scripts to streamline data collection, cleaning, and preprocessing, significantly reducing manual labor and enhancing accuracy.

•Created advanced visualizations and dashboards in Tableau and Power BI, facilitating better understanding and quicker decision-making by developing various chart types including Pie Charts, Bar Charts, Tree Maps, Circle Views, Line Charts, Area Charts, and Scatter Plots.

•Conducted market research and competitive analysis using external data sources to provide strategic recommendations to stakeholders.

•Proficient in AWS infrastructure services, including EC2, S3, Lambda, Redshift, and QuickSight, contributing to a 20% improvement in data processing efficiency, cost reduction, process automation, faster analytics, and data-driven decision-making.

•Leveraged Databricks and optimized SQL queries and utilized indexing strategies to enhance data retrieval processes, leading to improved data processing efficiency, exploration, and actionable insights.

•Conducted trend analysis using historical data and provided forecasting insights to aid in strategic planning and decision-making processes.

•Leveraged advanced Excel functionalities including pivot tables, VLOOKUP, and macros for detailed data analysis and reporting.

•Developed and maintained business intelligence reports using tools like Power BI, delivering real-time insights to stakeholders.

•Developed and refined predictive models using Scikit-learn and Random Forest, increasing accuracy for customer behavior forecasting and market trend analysis.

•Performed in-depth exploratory data analysis to uncover novel possibilities and furnish suggestions for improving operational effectiveness and business outcomes.

•Engaged in collaborative efforts with subject matter experts to grasp business requirements and transform them into actionable data analysis tasks, providing data-driven insights and supporting decision-making processes.

•Conducted in-depth root cause analysis for production issues, successfully addressing over 300 defects and enhancing issue resolution efficiency by approximately 20%.

•Developed and deployed data-driven insights into production environments, integrating with operational systems for real-time decision support.

PROJECTS

Disease Prevention Dashboard using Tableau

•Developed an interactive Tableau dashboard for nutritionists and market analysts to explore foods based on compound levels and understand their disease prevention potential, aiding in targeted marketing campaigns.

•Implemented various data visualizations, such as bubble charts, treemaps, and pie charts, to provide comprehensive insights into food compounds and their associated health benefits.

Trading Webpage

•Engineered a high-performance trading website using Node.js, Express, and MongoDB, implementing MVC architecture, user authentication, session management, and scalable RESTful APIs for efficient communication.

•Designed responsive UIs with HTML, CSS, and JavaScript, integrating real-time flash messaging for prompt user feedback and ensuring seamless frontend-backend communication via Mongoose for data consistency and validation.

Dynamic Sales Performance Dashboard for E-commerce Retailer

•Developed aS3 and active sales performance dashboard in Power BI for an e-commerce retailer, integrating data from SQL Server and Amazon S3, and creating various visualizations to enhance user interactivity and decision-making.

•Collaborated with stakeholders to define and track key performance indicators (KPIs), improving sales forecasting accuracy by 25% and reducing report generation time by 40%.

SKILLS

•Languages: Python, R, C, Java, SQL, PL/SQL, NoSQL, Shell Scripting

•Cloud Technologies: AWS S3, IAM, EC2, Amazon QuickSight, Athena, AWS Glue, Amazon Redshift, Amazon Sagemaker, Azure

•Data Visualization Tools: Tableau, Power BI, Excel

•Databases: PostgreSQL, SQL Server, Oracle, MongoDB, DynamoDB

•Machine Learning: k-NN, SVM, Random Forest, Decision Trees, Regression, Classification and Clustering

•Libraries: NumPy, Pandas, Seaborn, Matplotlib, Plotly, Scikit-Learn

•Other Skills and Tools: JIRA, Data Structures, Microsoft Office, Git, Snowflake, PySpark, ETL, Data Mining, Data Cleansing, Eclipse, Agile methodologies, Salesforce, GCP, PowerPoint, Adobe Analytics, Alteryx

EDUCATION

University of North Carolina at Charlotte (UNCC) Master of Science in Computer Science

Velagapudi Ramakrishna Siddhartha Engineering College, India

Bachelor of Technology in Computer Science

CERTIFICATION

• AWS Certified Developer – Associate



Contact this candidate