About Me
I'm Nancy - a Senior Data Engineer who turns messy data into business gold. With 6+ years of experience from fullstack dev to AI-powered analytics, I build pipelines, strategies and solutions that make data work harder for your business.
I am fluent in PySpark, SQL, Python, dbt, cloud data platforms and data governance. I know how to pair that technical expertise with AI and LLM-driven insights to unlock business growth.
I live in the intersection of engineering, insights and innovation - where ideas turn into impact and complex problems find elegant, scalable solutions.
Core Expertise
PySpark
SQL
Python
dbt
AWS Cloud
Apache Iceberg
Apache Spark
Palantir Foundry
Docker
Agentic AI
RAG Systems
Data Governance
Professional Experience
Senior Data Engineer
@ Ringier South Africa.
—
Senior Data Engineer
@ Ringier South Africa.
—
Property Data Infrastructure
- Designed and engineered scalable data marts, reducing reporting turnaround time by 40% and enabling richer insights across property listings, customer interactions, and marketplace trends.
Customer & Listing Behavior Modeling
- Built advanced data models to capture high-value property interactions (new listings, reactivations, conversions), directly supporting lead generation tracking and revenue growth.
Data Quality & Governance
- Led investigations into root causes of broken listing hierarchies, mismatched agent-property relationships, and pricing inconsistencies, restoring 98% data integrity across millions of records.
- Implemented automated validation checks and alerting systems, cutting data issue resolution time from days to hours and enhancing trust in marketplace analytics.
Data Engineer
@ Dalberg Data Insights.
—
Data Engineer
@ Dalberg Data Insights.
—
- Engineered and automated data pipelines processing millions of call detail records (CDRs) and mobile money logs, enabling gender-based insights at scale to support social and economic development initiatives.
- Designed and deployed scalable, dockerized pipelines for telecom clients, processing up to 20 TB of call records daily and reducing processing time by 50%.
- Developed automated workflows to measure the impact of development bank interventions on farmer livelihoods in Uganda, improving program evaluation accuracy and reducing manual reporting effort.
- Created a COVID-19 lockdown preparedness index for Uganda’s Ministry of Health, powering real-time dashboards that guided national pandemic response and resource allocation.
- Built and deployed ML models leveraging mobile phone data to estimate gender inequalities in Ghana’s labor markets, combining automation with advanced analytics to inform evidence-based policymaking.
Data Scientist
@ Tabiri Analytics.
—
Data Scientist
@ Tabiri Analytics.
—
- Designed and implemented statistical threat hunting models that analyzed millions of security logs to proactively detect suspicious activity, reducing the log review workload for cybersecurity engineers by 75% and accelerating incident response.
- Developed an interactive dashboard to visualize threat model outputs and enabled cybersecurity engineers to provide feedback on false positives and false negatives, improving model accuracy and operational efficiency
Software Engineer
@ Kemri Faces
—
Software Engineer
@ Kemri Faces
—
- Developed a web-based point of care Health Register that combined the five Maternal and Child Health registers. This improved efficiency of the nurses, reducing long queues and increased the number of expectant mothers who completed all the antenatal visits.
Education
- Master of Science in Information Technology - Carnegie Mellon University
- Bachelor of Computer Science - Masinde Muliro University