ETL &
Beyond.

// a collection of data solutions that empower business decisions

Transforming raw data into strategic intelligence — from pipeline architecture to executive dashboards, built for the decisions that matter.

Stack Architecture End-to-End
🗄️
Source Systems
ERP · Excel · Operational DBs
L1
⚙️
ETL / Integration
SSIS · Python · Staging Tables
L2
🏛️
Data Warehouse
SQL Server · PostgreSQL · Star Schema
L3
🔷
Transform Layer
T-SQL · dbt · Dimensional Models
L4
📊
BI & Reporting
Power BI · Executive Dashboards
L5

Data Engineering
with Purpose

Specialising in end-to-end BI solutions — from raw data ingestion to polished dashboards that enable leadership teams to act with confidence.

Every project in this portfolio represents a real business challenge solved through rigorous data modelling, clean SQL, and thoughtful visualisation design.

Projects span both real-world client work and independently developed concept builds — each labelled accordingly as Client Engagement or Conceptual.

T-SQL dbt Python SSIS PostgreSQL SQL Server Power BI ETL Pipelines Dimensional Modelling
5
Projects Delivered
15+
Reports Built
−40%
Reporting Time Cut
4
Industries Served

Featured Projects

■ Project #01 — Revenue & Customer Analytics
⬡ Conceptual
Revenue & Customer
Analytics

Without a proper BI layer, revenue sat in spreadsheets, churn was identified too late, and product data was too raw to use—leaving leadership without clear insight. Thus leading to the centralisation of key metrics (MRR, churn, feature adoption, growth) into a BI solution modelled for efficiency.

SaaS App Data Python ETL BigQuery dbt Models Power BI
Primary Metric
MRR
Schema Design
Star ★
Dashboards
4 Views
Reports Built
  • Executive Overview
  • Revenue Analytics
  • Customer Retention
  • Product Usage
■ Project #02 — Operations BI
⬡ Client Engagement
Operational Reporting
Automation

An operations team producing weekly reports from disconnected departmental Excel files faced a 3–4 day reporting cycle riddled with data inconsistencies. The solution centralised all operational data into a warehouse and replaced static spreadsheets with live Power BI dashboards — cutting reporting time by 40% and surfacing real-time KPIs.

Supply Chain Procurement Inventory SQL + SSIS ETL Data Warehouse Power BI
Reporting Time
−40%
Data Sources
4 Systems
Dashboards
4 Views

This query ranks every supplier by their average delivery delay, giving the operations team an instant view of which vendors are consistently late and by how much — the backbone of the Supplier Delays dashboard.

-- Vendor performance: avg delay days by supplier
SELECT
    v.vendor_name,
    AVG(DATEDIFF(day, po.expected_date, po.received_date)) AS avg_delay_days,
    COUNT(po.order_id)                                        AS total_orders
FROM  purchase_orders  po
JOIN  vendors          v  ON po.vendor_id = v.vendor_id
WHERE po.received_date IS NOT NULL
GROUP BY v.vendor_name
ORDER BY avg_delay_days DESC;
SQL Power BI SSIS Excel Data Warehouse Operational BI ETL Pipelines
Business Impact
  • Reporting cycle cut from 3–4 days to same-day
  • Identified procurement bottlenecks via heatmaps
  • Data accuracy improved through centralised model
  • Weekly reports replaced by real-time dashboards
Dashboards Built
  • Procurement Performance
  • Inventory Turnover
  • Supplier Delays
  • Fulfilment KPIs
Data Sources
  • Supply Chain System
  • Procurement System
  • Inventory System
  • Excel Department Reports
■ Project #03 — Manufacturing Safety
⬡ Conceptual
Workplace Safety Compliance
Reporting

Across multiple sites, OSHA compliance and incident data lived in a patchwork of departmental Excel files — each formatted differently, updated inconsistently, and emailed to a central coordinator for manual consolidation. By the time monthly safety reports were ready, the data was weeks old and risks had gone undetected. We replaced this fragmented process with a centralised PostgreSQL + dbt pipeline feeding automated Power BI safety dashboards — enabling real-time compliance monitoring and proactive risk identification.

Incident Reports Inspection Logs Training Records PostgreSQL dbt Models Power BI
Reporting Time
Automated
Data Sources
4 Systems
Dashboards
3 Views

This query calculates the OSHA-standard incident frequency rate per 200,000 work hours for each plant location — the universal safety benchmark that lets management compare risk levels across sites and flag which facilities need immediate attention.

-- Incident frequency rate per 200,000 work hours
SELECT
    plant_location,
    COUNT(incident_id)                                        AS total_incidents,
    SUM(hours_worked)                                          AS total_hours,
    ROUND(COUNT(incident_id) * 200000.0 / SUM(hours_worked), 2) AS incident_rate
FROM  fact_incidents   fi
JOIN  dim_plants        dp  ON fi.plant_id = dp.plant_id
GROUP BY plant_location
ORDER BY incident_rate DESC;
PostgreSQL dbt Power BI Excel Star Schema OSHA Compliance Safety Analytics
Business Impact
  • Reporting reduced from 2 days to fully automated
  • Standardised KPI calculations across all departments
  • Real-time tracking of OSHA compliance scores
  • Faster identification of high-risk operational areas
Dashboards Built
  • Safety Compliance Overview
  • Operational Safety Analytics
  • Executive Safety Summary
Key KPIs Tracked
  • OSHA Compliance Score
  • Training Completion Rate
  • Incident Frequency Rate
  • Near-Miss Reporting Rate
■ Project #04 — Legacy Modernisation
⬡ Client Engagement
Refactoring a 5,000+ Line
Legacy SQL Stored Procedure

A single monolithic SQL Server stored procedure exceeding 5,000 lines handled everything — extraction, transformation, business logic, and report aggregation. Execution took 45–60 minutes and any change risked breaking downstream reports. The procedure was decomposed into a modular SSIS ETL architecture backed by a dimensional data warehouse, cutting run times to 10–15 minutes and dramatically improving maintainability.

Operational DBs SSIS Extraction Staging Tables Transform Layer Star Schema DWH BI Reports
Execution Time
−75%
Procedure Lines
5,000+
Architecture
Modular

This is the core fact table load that replaced the 5,000-line monolith — aggregating sales revenue and order counts by date, product, and region into a clean, modular step that all downstream BI reports now depend on.

-- Before: monolithic CASE logic buried in 5,000+ line proc
-- After: isolated, testable transformation step
INSERT INTO fact_sales_summary (date_key, product_key, region_key, total_revenue, order_count)
SELECT
    dk.date_key,
    pk.product_key,
    rk.region_key,
    SUM(s.amount)      AS total_revenue,
    COUNT(s.order_id)  AS order_count
FROM  staging_sales   s
JOIN  dim_date        dk  ON CAST(s.order_date AS DATE) = dk.full_date
JOIN  dim_product     pk  ON s.product_id  = pk.source_product_id
JOIN  dim_region      rk  ON s.region_code = rk.region_code
GROUP BY dk.date_key, pk.product_key, rk.region_key;
SQL Server T-SQL SSIS Star Schema Data Warehouse ETL Pipelines Legacy Refactoring
Performance Gains
  • Execution time reduced from 45–60 min → 10–15 min
  • 5,000+ line monolith split into isolated ETL packages
  • Star schema design optimised query plans for BI tools
  • New metrics added without touching existing logic
Refactoring Steps
  • Logic decomposition
  • Staging layer build-out
  • Modular transformation layer
  • Dimensional DWH loading
Skills Demonstrated
  • Legacy system refactoring
  • SQL optimisation
  • Dimensional modelling
  • BI pipeline performance tuning
■ Project #05 — Procurement Analytics
⬡ Client Engagement
SQL Server Procurement
Analytics Platform

Procurement reports were manually assembled from ERP exports every month — a 5–7 day cycle that left management unable to answer basic spend questions. A centralised SQL Server data warehouse, SSIS ETL pipeline, and Power BI dashboard suite replaced the process entirely, delivering real-time spend visibility, contract compliance monitoring, and strategic supplier analysis from a single source of truth.

ERP Purchasing Accounts Payable Supplier Master SSIS ETL SQL Server DWH Power BI
Reporting Cycle
5–7 Days
Top 10 Suppliers
62% Spend
Dashboards
4 Views

This query surfaces the top 20 suppliers by total purchase order value for the current fiscal year, including each supplier's percentage share of overall spend — directly powering the spend concentration view that revealed just 10 suppliers account for 62% of all procurement costs.

-- Supplier spend concentration: top 20 by total PO value
SELECT
    ds.supplier_name,
    ds.category,
    SUM(fpo.line_value)                                               AS total_spend,
    ROUND(SUM(fpo.line_value) * 100.0 / SUM(SUM(fpo.line_value)) OVER(), 2) AS spend_pct
FROM  fact_po_lines   fpo
JOIN  dim_supplier    ds   ON fpo.supplier_key  = ds.supplier_key
JOIN  dim_time        dt   ON fpo.date_key      = dt.date_key
WHERE dt.fiscal_year = YEAR(GETDATE())
GROUP BY ds.supplier_name, ds.category
ORDER BY total_spend DESC
OFFSET 0 ROWS FETCH NEXT 20 ROWS ONLY;
SQL Server T-SQL SSIS Power BI Star Schema Procurement Analytics Spend Analysis
Strategic Impact
  • Eliminated 5–7 day manual monthly reporting cycle
  • Packaging materials identified as 38% of total spend
  • Plastic packaging costs flagged at +12% year-over-year
  • Enabled data-driven supplier negotiations and contract compliance monitoring
Dashboards Built
  • Executive Procurement Overview
  • Supplier Spend Analysis
  • Category Spend Analysis
  • Procurement Efficiency
Fact & Dim Tables
  • fact_purchase_orders
  • fact_supplier_invoices
  • dim_supplier / dim_category
  • dim_contract / dim_time
Let's Build Together

Have a data challenge?

Whether it's a reporting overhaul, a new analytics pipeline, or a dashboard redesign — let's talk.

Start a Conversation →