PRANITHA VARSHINI S
DATA ENGINEER
Jersey City, NJ 929-***-**** ad69fr@r.postjobfree.com
SUMMARY
Master's graduate in Applied Artificial Intelligence with industrial experience as a Data Engineer in design, development, deploying, and large-scale supporting large-scale distributed systems. Proficient in complete Software Development Life Cycle (SDLC) for projects using methodologies like Agile and Waterfall. Working experience in Amazon Web Services (AWS) Cloud services such as EC2, S3, Redshift, Autoscaling, and Cloud Formation. Experience in creating and maintaining data warehouses, data marts, and data lakes, utilizing SSIS for building and populating dimensions, facts, and aggregates. Proficient in designing, developing, and maintaining Apache Spark applications for large-scale data processing and analytics. Exceptional communication skills, translating technical concepts into accessible language for non-technical stakeholders, ensuring clear alignment between data engineering efforts and business goals.
EDUCATION
Stevens Institute of Technology Hoboken, NJ GPA: 3.71 May 2023
Master of Engineering in Applied Artificial Intelligence (Minor: Electrical Engineering)
B V Raju Institute of Technology Medak, India GPA:3.67 May 2021
Bachelor of Technology in Electronics and Communication Engineering (Minor: Robotics)
SKILLS
SQL, Python, Apache Airflow, Apache Spark, SparkSQL, HiveSQL, MapReduce, Hadoop, Databricks, AWS S3, Amazon Redshift, AWS Glue, Tableau, PowerBI, SSRS, SSIS, HDFS, Docker, C, HTML, ML Models, Google Colab, Jupyter Notebook, Data Modeling, Shell, Machine Learning Models, EDA, NLP, NLTK.
EXPERIENCE
Akamai Technologies, USA May 2022 – Present
Data Engineer
Spearheaded the migration of on-premises data infrastructure to AWS cloud, resulting in reduction in operational costs and improved scalability for handling growing data volumes.
Managed data pipelines and ETL processes to ensure smooth and efficient data flow utilizing the AWS cloud environment, processing over 10TB of data daily.
Applied Python libraries such as Pandas and NumPy to manipulate and clean data, reducing data cleaning time by 40% and enhancing data quality by 20%.
Collaborated with cross-functional teams including Data Analysts & Scientists and Software Engineers to understand data requirements and deliver solutions that meet business needs.
Designed and maintained data models in Tableau using techniques such as star schemas and data normalization, ensuring optimal performance and responsiveness of dashboards even with large datasets.
Engineered and optimized data models to support advanced analytics and reporting needs, resulting in a 40% improvement in query performance and enabling timely business insights.
Collaborated with external vendors and partners to integrate third-party data sources into the company's data ecosystem, expanding data insights and enhancing analytics capabilities.
Trigent Software Inc, India Jan 2020 – Aug 2021
Data Engineer
Designed and implemented a robust ETL pipeline using Apache Airflow, reducing data processing time by 30% and ensuring timely availability of critical business insights.
Deployed and managed distributed computing frameworks like Apache Spark for processing and analyzing large-scale datasets, improving processing speed.
Optimized and fine-tuned Python code for data transformation processes, resulting in a 25% discount in processing time and allowing for near-real-time data availability for analytics teams.
Created interactive Power BI dashboards to convey data relationships, trends & patterns, and to provide insights into key performance indicators (KPIs) communicating insights to non-technical stakeholders.
Delivered regular reports and presentations to management on project progress, resource allocation, and strategic recommendations based on data analysis.
CERTIFICATIONS
Cognizant - Agile Methodology Virtual Experience Program
Cambridge - Business English Certification (B2 Vantage)
Google - Data Analytics Professional Certification
Databricks - Databricks Lakehouse Fundamentals V2 Accreditation