Data Governance Engineer

Location:

Seattle, WA

Salary:

95,000

Posted:

March 27, 2025

Contact this candidate

Resume:

Pardhu Kancharla

Seattle, WA *******************@*****.*** 203-***-**** www.linkedin.com/in/pardhukancharla

Professional Summary

Results-driven Data Engineer with 2 years of full-time experience, specializing in data pipelines, cloud-based ETL solutions, and big data analytics. Proficient in SQL, Google Big Query, AWS Redshift, and Snowflake, with expertise in ETL tools, Apache Airflow, and data governance. Adept at building scalable data workflows, optimizing performance, and ensuring data integrity in cloud environments such as AWS, Google Cloud, and Azure.

Technical Skills

Programming Languages: Python, SQL, Shell Scripting

Databases & Big Data: SQL, Google Big Query, AWS Redshift, Snowflake

Data Pipelines & ETL: Google Dataflow, AWS Glue, Google Cloud Data Fusion

Workflow Orchestration: Apache Airflow, Prefect, Kubernetes

Cloud Platforms: AWS, Google Cloud Platform (GCP), Microsoft Azure

Data Visualization: Tableau, Power BI, Google Data Studio, Looker

Data Governance & Security: IAM, Compliance & Data Governance, GDPR, HIPAA

Databases & Big Data: SQL, Google Big Query, AWS Redshift, Snowflake

Data Pipelines & ETL: Google Dataflow, AWS Glue, Google Cloud Data Fusion

Workflow Orchestration: Apache Airflow, Prefect

Cloud Platforms: AWS, Google Cloud Platform (GCP), Microsoft Azure

Data Visualization: Tableau, Power BI, Google Data Studio

Data Governance & Security: IAM, Compliance & Data Governance

Professional Experience

Data Engineer

Client: Infotech Hyderabad, India Jan 2021 to July 2022

Developed and maintained ETL data pipelines using Google Dataflow, AWS Glue, and Google Cloud Data Fusion, ensuring efficient data movement and transformation.

Optimized Big Query, Redshift, and Snowflake queries, improving performance and reducing data processing time by 30%.

Designed and automated workflows with Apache Airflow and Prefect, reducing manual intervention in data pipelines by 40%.

Designed and implemented scalable data pipelines on Azure, leveraging Data Modeling, ETL (Extract, Transform, Load) processes, and Data Warehousing to efficiently process and store large datasets. Utilized Scala and C# to develop optimized backend services that integrate with M365 and RESTful APIs for seamless data flow.

Implemented data governance policies, ensuring compliance with cloud IAM and security best practices.

Developed interactive dashboards in Tableau, Power BI, and Google Data Studio, providing key insights to stakeholders.

Led AWS and GCP cloud migration initiatives, improving scalability and reducing operational costs.

Applied analytical skills and mathematical modeling to enhance predictive analytics in business intelligence solutions. Integrated Microsoft 365 services for seamless collaboration while ensuring secure authentication and authorization in cloud computing environments.

Integrated multiple data sources into cloud data warehouses using AWS Glue, Dataflow, and Big Query, streamlining data ingestion and transformation.

Built real-time data streaming solutions using Apache Kafka and AWS Kinesis, improving data availability and decision-making speed.

Data Engineering

Net soft Mate, India Aug 2020 to Jan 2021

Assisted in building and monitoring ETL pipelines for structured and semi-structured data.

Analysed large datasets using Big Query, AWS Redshift, and Snowflake to support data-driven decision-making.

Created automated workflows in Apache Airflow, improving the reliability of data ingestion processes.

Worked with data governance frameworks to enhance compliance and security in cloud environments.

Developed data visualizations in Power BI and Tableau for reporting and analysis.

Supported AWS and GCP cloud-based data warehouse optimization, enhancing query performance and storage efficiency.

Assisted in implementing CI/CD pipelines for automated deployment of ETL workflows, reducing deployment time by 50%.

Conducted cloud security assessments, ensuring compliance with industry standards for AWS and GCP environments.

Education

Master of Science in Computer/Information Technology Services and Management

Campbellsville University - Kentucky [ August 2024]

Bachelor of Science in Computer Science

Dhanakula Institute of Technology – Andhra Pradesh, India [ July 2020]

Projects

Automated Data Quality Monitoring System

Designed and implemented a data quality framework using Python and Apache Airflow.

Automated anomaly detection and alerting for data integrity issues across multiple pipelines.

Reduced data inconsistencies by 35% through proactive issue identification.

Real-Time Data Pipeline for E-Commerce Analytics

Built a real-time data pipeline using Apache Airflow and Google Dataflow to process streaming data from an e-commerce platform.

Stored processed data in Google Big Query and created interactive dashboards in Tableau for sales and customer behaviour analysis.

Cloud-Based Data Lake on AWS

Designed and implemented a data lake architecture on AWS using S3, Glue, and Redshift.

Automated data ingestion and transformation workflows using Python and AWS Lambda.

Data Governance Framework Implementation

Developed and implemented a data governance framework for a healthcare organization, ensuring compliance with HIPAA regulations.

Configured cloud IAM roles and policies to enforce role-based access control (RBAC).

Real-Time Data Pipeline for E-Commerce Analytics

Built a real-time data pipeline using Apache Airflow and Google Dataflow to process streaming data from an e-commerce platform.

Stored processed data in Google Big Query and created interactive dashboards in Tableau for sales and customer behaviour analysis.

Cloud-Based Data Lake on AWS

Designed and implemented a data lake architecture on AWS using S3, Glue, and Redshift.

Automated data ingestion and transformation workflows using Python and AWS Lambda.

Data Governance Framework Implementation

Developed and implemented a data governance framework for a healthcare organization, ensuring compliance with HIPAA regulations.

Configured cloud IAM roles and policies to enforce role-based access control (RBAC).

Achievements

Reduced data pipeline processing time by 25% through query optimization and parallel processing techniques.

Recognized as “Top Performer” during internship for contributions to data pipeline automation and documentation.

Developed a real-time monitoring system that improved data reliability by 40%.

Successfully led a team initiative to migrate legacy ETL workflows to a cloud-native architecture, enhancing scalability and cost-efficiency.

Delivered a data visualization solution that improved decision-making efficiency for business stakeholders by 20%.

Reduced data pipeline processing time by 25% through query optimization and parallel processing techniques.

Recognized as “Top Performer” during internship for contributions to data pipeline automation and documentation.

Delivered a data visualization solution that improved decision-making efficiency for business stakeholders by 20%.

Contact this candidate