Post Job Free
Sign in

Data Engineer

Location:
Jersey City, NJ
Salary:
80000
Posted:
July 15, 2024

Contact this candidate

Resume:

PRANITHA VARSHINI S

DATA ENGINEER

Jersey City, NJ 929-***-**** ad69fr@r.postjobfree.com

SUMMARY

Master's graduate in Applied Artificial Intelligence with industrial experience as a Data Engineer in design, development, deploying, and large-scale supporting large-scale distributed systems. Proficient in complete Software Development Life Cycle (SDLC) for projects using methodologies like Agile and Waterfall. Working experience in Amazon Web Services (AWS) Cloud services such as EC2, S3, Redshift, Autoscaling, and Cloud Formation. Experience in creating and maintaining data warehouses, data marts, and data lakes, utilizing SSIS for building and populating dimensions, facts, and aggregates. Proficient in designing, developing, and maintaining Apache Spark applications for large-scale data processing and analytics. Exceptional communication skills, translating technical concepts into accessible language for non-technical stakeholders, ensuring clear alignment between data engineering efforts and business goals.

EDUCATION

Stevens Institute of Technology Hoboken, NJ GPA: 3.71 May 2023

Master of Engineering in Applied Artificial Intelligence (Minor: Electrical Engineering)

B V Raju Institute of Technology Medak, India GPA:3.67 May 2021

Bachelor of Technology in Electronics and Communication Engineering (Minor: Robotics)

SKILLS

SQL, Python, Apache Airflow, Apache Spark, SparkSQL, HiveSQL, MapReduce, Hadoop, Databricks, AWS S3, Amazon Redshift, AWS Glue, Tableau, PowerBI, SSRS, SSIS, HDFS, Docker, C, HTML, ML Models, Google Colab, Jupyter Notebook, Data Modeling, Shell, Machine Learning Models, EDA, NLP, NLTK.

EXPERIENCE

Akamai Technologies, USA May 2022 – Present

Data Engineer

Spearheaded the migration of on-premises data infrastructure to AWS cloud, resulting in reduction in operational costs and improved scalability for handling growing data volumes.

Managed data pipelines and ETL processes to ensure smooth and efficient data flow utilizing the AWS cloud environment, processing over 10TB of data daily.

Applied Python libraries such as Pandas and NumPy to manipulate and clean data, reducing data cleaning time by 40% and enhancing data quality by 20%.

Collaborated with cross-functional teams including Data Analysts & Scientists and Software Engineers to understand data requirements and deliver solutions that meet business needs.

Designed and maintained data models in Tableau using techniques such as star schemas and data normalization, ensuring optimal performance and responsiveness of dashboards even with large datasets.

Engineered and optimized data models to support advanced analytics and reporting needs, resulting in a 40% improvement in query performance and enabling timely business insights.

Collaborated with external vendors and partners to integrate third-party data sources into the company's data ecosystem, expanding data insights and enhancing analytics capabilities.

Trigent Software Inc, India Jan 2020 – Aug 2021

Data Engineer

Designed and implemented a robust ETL pipeline using Apache Airflow, reducing data processing time by 30% and ensuring timely availability of critical business insights.

Deployed and managed distributed computing frameworks like Apache Spark for processing and analyzing large-scale datasets, improving processing speed.

Optimized and fine-tuned Python code for data transformation processes, resulting in a 25% discount in processing time and allowing for near-real-time data availability for analytics teams.

Created interactive Power BI dashboards to convey data relationships, trends & patterns, and to provide insights into key performance indicators (KPIs) communicating insights to non-technical stakeholders.

Delivered regular reports and presentations to management on project progress, resource allocation, and strategic recommendations based on data analysis.

CERTIFICATIONS

Cognizant - Agile Methodology Virtual Experience Program

Cambridge - Business English Certification (B2 Vantage)

Google - Data Analytics Professional Certification

Databricks - Databricks Lakehouse Fundamentals V2 Accreditation



Contact this candidate