Career
Data Scientist
WPP
· June 2023 - Present
- Conceived, architected, and led the development of WPP's taxonomy mapping platform — a semantic knowledge system using embedding-based similarity and weighted graph algorithms, generating 513,000+ mappings across 719 audience segments, 90 markets, and 9 ad-tech platforms including Meta, Google, Amazon, TikTok, and LinkedIn.
- Define and drive the end-to-end technical roadmap in collaboration with product and engineering stakeholders, prioritising model accuracy, pipeline automation, and feature development.
- Productionized the taxonomy mapping system as a Python library powering the Audience Translator web application — adopted by 5,000+ users across 90 markets, with configurable embedding model weighting, automated KPI generation, and modular evaluation pipelines.
- Led the design and integration of automated mapping pipelines with Airflow, reducing turnaround from weeks to under one week (60% reduction) and saving ~60 hours of manual effort per mapping cycle.
- Designed and implemented a rigorous evaluation framework using text corruption strategies and ground-truth validation, establishing a 70% precision baseline that guided subsequent model improvements.
- Integrated fine-tuned BERT-based models and developed a proprietary weighted similarity algorithm, improving mapping accuracy from 70% to 86% through systematic experimentation across 7+ model configurations.
- Architected LLM-powered features (GPT-4, Gemini 1.5 Pro) for taxonomy enrichment and lightweight RAG, increasing taxonomy utilisation in mappings by 16% without dependency on external databases.
- Designed an LLM supervision layer using GPT-4/4o to pre-validate mappings before human review, reducing manual validation effort by 60% and improving team throughput from days to hours.
- Led investigation into geo-targeted advertising data quality across 590K+ device IDs in Snowflake, improving device and email matching accuracy by 12%; designed a QA framework for partner data validation and onboarding.
- Evaluated Habu clean room technology for secure cross-organizational data collaboration, providing strategic technical recommendations from a data science perspective.
- Architected a LangGraph-based multi-agent system for automated ad-tech platform research, generating validated technical reports with API specifications, audience taxonomies, and reach estimates.
- Developed MCP server infrastructure to expose the taxonomy mapping system's capabilities to AI agents, enabling programmatic access across the organization.
Junior Data Scientist
Choreograph / WPP
· March 2022 - June 2023
- Won WPP Data Challenge #5; participated in Data Challenge #4.
- Core contributor to the Audience Knowledge Graph (AKG) project: designed and implemented taxonomy mapping algorithms, data normalization pipelines, and entity relationship modelling to support downstream analytics and activation; built Looker Studio dashboards for pipeline monitoring.
- Developed the initial semantic similarity algorithms using pre-trained BERT models from Hugging Face, establishing the technical foundations for the production mapping system.
- Designed and applied entity resolution techniques to map and deduplicate entities across datasets, optimizing algorithms for performance and scalability.
WPP NextGen Leader
WPP
· June 2022 - August 2022
10-week internship providing cross-agency exposure to WPP's creative, media, and campaign operations across the group.
Data Intern
Choreograph / WPP
· November 2021 - March 2022
- Identified a critical gap in the team's data fusion workflow and independently proposed an NLP-based solution for automated feature matching between datasets.
- Designed and prototyped the initial taxonomy mapper — the foundational project that evolved into WPP's production taxonomy mapping platform.
- Deployed compute-intensive model training and evaluation workloads on Kubeflow for scalable processing.
Internship Program
Pagoda Projects
· February 2021 - March 2021
Main skills focus on:
- Digital Competency
- Employability Skills
- Intercultural Fluency
- Workplace Basics for Graduates
Junior Data Analyst & Second Line Technical Support
TalkPool AG
· June 2019 - September 2019
Served as an intern for a remote monitoring system in Haiti in the field of Data Science and second line technical support. Main duties included monitoring and analyzing data of communication towers fetched from cloud system.
Community Volunteer
Rizq
· May 2019
Being a people-powered movement united to end hunger, our success truly depends on the commitment of our volunteers. For this reason, I served as a volunteer for a total of 15 hours.
Community Volunteer
Sultan Qaboos University
· September 2017
Attended a clinical attachment program at the Emergency Medicine department of Sultan Qaboos University Hospital.
Master's in Applied Data Science
University of Lancashire
· September 2020 - September 2021
Thesis on Performance of Optical Flow using Spiking Neural Networks. Major courses were Artificial Intelligence & Machine Learning, Internet of Things and Big Data Technologies.
Besides learning numerous data technologies and doing related projects, I learned Linux OS for configuring ESP32 for IoT applications.
Bachelor's in Electrical Engineering
National University of Sciences and Technology
· October 2016 - May 2020
Participated in National Engineering Robotic Competition (NERC) for two consecutive years, in the line following category. For the most part I worked on the H-Bridge for the motors of the robot. Besides that, joined membership for horse riding club which I continued for most of my sophomore year. Joined Photography club in my freshman year and worked as an associate photographer in the EME Olympiad 2016 event.
Graduated with major elective courses: Artificial Intelligence, Computer Vision, Digital System Designs and Electric Drives. The project Micro-Inverter with Lithium Ion battery was completed as a final year project.
Advanced Levels
Pakistan School Muscat
· May 2014 - May 2016
Ordinary Levels
Pakistan School Muscat
· May 2012 - May 2014
Primary Education
Pakistan School Muscat