Asad Saeed Chohan
Location: Fairfax, VA, USA Phone: 571-***-**** Email: ***********@*****.*** Status: Green Card
Expert in data architecture & strategy, data engineering, data platform, business intelligence, and machine learning.
Professional Experience
Solutions by STC, KSA. Jul 2021-Jan 2025
Senior Data Architect
Served public & government sector clients as professional services lead data platform and enterprise data architect. Fulfilled client’s requirement for data mesh, Real-time analytics, data warehouse, Data & AI strategy and data governance.
Designed a telecom data mesh on Azure, decentralizing data ownership for customers, network, products, and fraud domains, integrating source systems like CRM, billing, OSS, BSS, network logs, call records, and fraud detection platform.
Developed and delivered ETL/ELT pipelines with Azure Data Factory, Event Hub for ingesting diverse data into Azure Synapse.
Built data transformations ETL workflows using Databricks, and PySpark(Python) for initial & incremental processing.
Used Medallion Architecture to streamline data ingestion, transformation, and analytics across Bronze, Silver, and Gold layers.
Planned and implemented a phased migration of retail client's on-premises SQL Server to the Azure Synapse, integrating data from POS systems, inventory management, customer loyalty programs, and supply chain databases for enhanced analytics.
Reverse engineered SSIS data pipelines for remapping on Data Factory and Databricks, enabling required transformations. Data was structured into Bronze and Silver layers, with the Gold layer stored in Azure Synapse for analytics and reporting.
Designed real-time analytics integrating Bosch IoT Suite (interfacing traffic systems, energy grids, and environmental data) with Azure IoT Hub and Azure Event Grid. Utilized Azure Event Hubs for real-time event streaming, with Azure Functions handling real-time trigger processing.
Implemented data pipelines using Azure Data Factory to transform data, storing it in Azure Data Lake Storage before ingesting into Azure Synapse Analytics for analytics.
Initiated data governance practices from TOGAF framework in Informatica Axon and IDQ.
Supported internal enterprise team to build conceptual, Logical and Physical data models blue prints as IT integrator company. Conducted assessment of data warehouse operations, identified gaps and delivered a roadmap to deficiencies.
Evaluated, extended and documented the Data & AI strategy for Saudi 2030 vision and group’s alignment. Presented the CTO with an updated Data and AI strategy, future roadmap, and key initiatives for year 2025.
Saudi Aramco Rowan, KSA. Feb 2020 – Jun 2021
Senior Data Specialist
Led the migration of offshore data platform to Azure databricks platform, ensuring a seamless transition integrating data from RimDrill, Synergy Life, RigSense, and other BSS systems for reporting and analytics.
Engaged stakeholders to gather and validate mapping documentation created for the legacy Cloudera data platform.
Designed the compute resources, ensuring cost-efficient performance for high-volume data processing and analytics workloads.
Orchestrated Azure data factory pipelines to publish data on Databricks platform raw storage area for further transformations.
Implemented Databricks workflows to detect changes and trigger transformations in stored procedures. Merged transformed data from load-ready schemas to target warehouse tables using merge SCD operations.
Enabled cluster liquidation in Databricks to enhance query performance and optimize storage utilization.
Initiated data strategy & policies, ensuring compliance with data industry standards.
Mobily Telecom, KSA. Nov 2018 - Jan 2020
Big Data Platform ETL Professional
Identified the flaws in bigdata implementation and educated team to leverage Datalake together with Teradata warehouse.
Initiated data lakehouse architecture to align with future analytics vision, supporting data-driven decision making and BI.
Orchestrated scalable ETL framework on NiFi, ensuring metadata capturing and structures documented integration. Utilized HVR, Kafka and NiFi for data sourcing by integrating network traffic into Cloudera hadoop for analytics.
Successfully ingested CDR data for DPI analysis, enabling the development of targeted marketing campaigns. Addressed Kafka event sequence disruption by developing a windowing method in PySpark to streamline event sequencing.
Enabled real-time triggers for CMS on billing dunning post-paid customers utilizing HVR, Kafka and Apache NiFi processors.
Enriched data with related information utilizing Spark and integrated results with Hive tables. Extended spark capabilities to integrate data in hive, supporting major and minor column changes in SCD scenarios.
Built aggregation of B2C & B2B customer usage trends on Hive using Spark, enhancing data analysis and reporting capabilities.
Devised python scripts to convert JSON, XML and other non-structured formats to CSV for data warehouse consumption.
Teradata Corporation, KSA. Jan 2015 - Oct 2018
Data Scientist
Wrangled ADS datasets, performed feature engineering, integrated multiple sets into one dataset for machine learning. Used K-Means, Z-score, and K-NN to isolate outliers, improving data quality and analysis outcomes.
Implemented SVM through Scikit-learn to improve customer churn prediction by 7%, optimizing retention efforts.
Conducted network ROI analysis by implementing CapEx strategies; visualized predicted traffic and costs on Power BI. Helped decision-makers analyze current and predicted traffic, coverage, and investment opportunities effectively.
Enhanced credit fraud prediction accuracy by 9% using Social Network Analysis (SNA) for better fraud prediction. Leveraged graph techniques to identify fraud patterns, improving the accuracy of credit fraud detection models.
Collaborated with sales to boost revenue by upselling bundled packages using advanced upselling models. Applied market basket analysis on 1M+ transactions, increasing sales by 4% through optimized offers.
Teradata Corporation, Pakistan Nov 2011 - Dec 2014 Data Engineer
Developed Teradata utilities (Fload, BTEQ, Mload, TPT) for data extraction, loading and transformation in ETL framework.
Established ETL pipelines for operational systems, integrated data sources such as CRMs, ERPs, ATMs, Online Banking, and campaign solutions into the data warehouse logical data model.
Implemented data pipelines in IBM Datastage using transformation systems like Source Qualifier, Lookup, Joiner and more.
Developed stored procedures, functions, views, and triggers for intermediate data transformations. Facilitated incremental, full and leftover processing within ETL jobs and sequences.
Integrated data from raw and transformed layer to target LDM data warehouse schemes according to required SCD technique.
Makkays Hi-Tech Systems, Pakistan. May 2010 - Oct 2011
Software Engineer
Analyzed the ERP enhancement requirements to upgrade from a business and technology perspective. Used ASP.NET and Crystal Reports to develop new ERP modules.
Ora-Tech Systems, Islamabad, Pakistan. Dec 2009 - Apr 2010
Software Engineer Intern
Re designed the ERP development activities of Pakistan Telecommunication Authority.
Certifications
Azure Databricks Platform Architect
Databricks, USA.
Certified AI Specialist
Arcitura, Canada.
Certified Scrum Fundamentals
6SigmaStudy, USA.
Certified Data Scientist
Arcitura, Canada.
Certified Six Sigma Yellow Belt 6SigmaStudy, USA.
Certified Big Data Professional Arcitura, Canada.
Certified Teradata Professional
Teradata Corporation, USA.
Education
Master of Software Engineering
Bahria university, Islamabad, 2011-13.
Bachelor of Software Engineering
Bahria university, Islamabad, 2005-09.
Technologies
Data Platforms:
Azure Synapse, Teradata, Hive, Cloudera, Impala, Databricks, SQL Server, Oracle.
Data Engineering:
Azure ADF, Apache Nifi, Informatica PowerCenter, Datastage, Spark, SSIS.
Artificial Intelligence APIs:
OpenAI ChatGPT-4 Turbo, Deepseek-R1.
Stream processing:
Apache Flink, Confluent Kafka, PySpark, HVR.
Machine Learning:
Anaconda, Scikit-learn, Teradata Miner.
Data Governance:
Informatica Axon, Data Quality and Data catalog.
Architecture & Design:
Erwin Designer, Microsoft Visio.
Visualization:
Power BI, Seaborn.
Languages:
SQL, Python, Java.
Courses & Trainings
Azure data platform architect
Microsoft.
Microsoft data fabric
Microsoft.
Azure Synapse Analytics
Microsoft.
Snowflake Concepts & Architecture
Snowflake.
Project Management
Learning by LinkedIn.
ITIL Certification
Learning by LinkedIn.
Big data analytics badge.
IBM.
Tableau 10 A-Z hands on for data science.
Udemy.
Machine Learning A-Z hands on Python & R
Udemy.
Data science methodology
IBM.
Management skills
Team Management:
Led and developed team’s skill set to achieve goals.
Vendor Management:
Managed vendor contracts and quality delivery.
Project Management:
Delivered projects on time and within given resources.
Strategic Thinking:
Executed long-term growth strategies for data initiatives.
Change Management: Implemented effective transition data strategy.
Requirements Management: Defined and managed stakeholder needs.
Resource Management: Optimized resources to enhance efficiency.