John Paul Castro
Senior Data Platform & BI Architect | Modern Data Stack · Azure · dbt · Airflow · Kimball
Featured Project
JDE Data Platform
Live — Railway Cloud
Professional Summary
Data platform architect with 20+ years building enterprise-grade data systems in aerospace, entertainment, and telecommunications. I design the infrastructure that turns ERP chaos into clean, governed, decision-ready data — from legacy ETL pipelines to cloud-native Lakehouse architectures on Azure Databricks. Deep expertise in Kimball dimensional modeling, medallion architecture (Bronze/Silver/Gold), open-source data engineering (dbt Core, Apache Airflow, PostgreSQL), and end-to-end ERP integration (JD Edwards, GEAC, tcmIS). Dual master's degrees (MBA & MS Computer Science). Hands-on engineer who builds it, owns it, and delivers it. Bilingual: English and Spanish.
Core Competencies
Cloud & Data Platforms
Data Architecture
Data Engineering
Databases & Query Languages
BI & Reporting
Software Development
ERP & Integration
Experience
Senior Data Architect — Independent / Consulting
Self-Employed
- → Designed and built full medallion architecture platform for JDE Edwards data — Bronze extraction, Silver transformation, Gold aggregation
- → Built Node.js extractors pulling from JDE SQL Server into PostgreSQL via Apache Airflow orchestration
- → Implemented dbt Core models with proper naming standards, surrogate keys, and Kimball dimensional modeling
- → Built an MDM (Master Data Management) layer that unifies customer records across 5 separate ERP systems using Splink probabilistic record linkage — resolving 82 duplicates from 333 source records into 251 golden customer entities
- → Designed the full MDM pipeline: Node.js extraction → PostgreSQL Bronze → dbt Silver normalization → Python/Splink matching → golden record output with cross-reference mapping and consolidated sales visibility
- → Deployed full stack to Railway cloud: Fastify API + Next.js dashboard + PostgreSQL
Senior Data Warehouse & BI Architect
Incora (formerly Wesco Aircraft) · Valencia, CA
- → Served as Chief Data Architect for a global aerospace distributor with 3 ERP systems (JD Edwards, tcmIS, GEAC) across multiple continents — unifying all systems into a cloud-native Azure Databricks Lakehouse with Delta Lake and Unity Catalog, supporting Finance, SIOP, and compliance reporting at enterprise scale
- → Re-engineered a legacy Cognos ETL pipeline running 8+ hours daily into an optimized SSIS solution completing in under 30 minutes (94% reduction) — then migrated to Azure Databricks for real-time, scalable processing serving thousands of daily transactions
- → Led a full Kimball dimensional model redesign during the 2014 Haas merger migration from Cognos to QlikView; established snake_case naming standards and a layered data architecture (gdl, jdl, tdl) that remains the backbone of Finance and SIOP reporting more than a decade later
- → Recovered millions of zeroed F4211 pricing records in approximately two hours by reconstructing correct values from F42199 history — restoring critical data integrity under high-pressure conditions where all prior recovery attempts had failed
- → Designed and built a mission-critical .NET C# / JavaScript web application for Boeing's 787 Dreamliner tooling program, replacing disparate booth interfaces with a unified JDE DB2-integrated system supporting tool checkout, check-in, and billing across multiple manufacturing sites
- → Engineered a real-time warehouse StatusBoard application — still deployed across multiple distribution facilities — using .NET C#, JavaScript, and a Windows service polling JDE; delivers live visibility into picker performance, aisle status, and priority orders
- → Established enterprise data governance standards and authored complex T-SQL, DB2, Oracle PL/SQL, and PySpark/Spark SQL transformations processing billions of rows in high-transaction, real-time workloads
- → Built and maintained bidirectional JDE integrations for post-acquisition systems; developed custom compliance-reporting automation for Bombardier and a multi-customer web portal delivering real-time inventory and consignment visibility
- → Served as primary SQL expert across MS SQL, DB2, and Oracle; lead data resource for PwC and external audit/consulting engagements; owned JDE PY and DEV environment refreshes
Data & Application Consultant
The Walt Disney Company
- → Contributed to a $50M+ enterprise PowerBuilder/Sybase-to-Java/DB2 migration; responsible for data transformation logic using T-SQL and ADO.NET, and development of views and stored procedures
Software Developer — Consultant
Jaguar Consulting
- → Built a multi-tenant Rights Management platform serving NBA, WNBA, MLB, Hallmark, National Geographic, NBC Enterprises, MGM, Lionsgate, and others — handling inventory, royalties, and accounting via a robust 3-tier VB.NET/ASP/Sybase/T-SQL architecture
Information Technology Instructor
Learning Tree University · Chatsworth, CA
- → Taught Microsoft Visual C++, C Programming (Basics and Advanced), and software engineering principles at the collegiate level
IT Manager
800 Direct, Inc. / CyberRep.com
- → Managed IT operations across two California telecenters (400+ workstations, team of 12), overseeing mission-critical applications for 30+ clients and modernizing legacy Clipper systems to Visual Basic 6.0, SQL Server, and web-based architectures
Education
Master of Business Administration (MBA)
California State University, Northridge
Master of Science, Computer Science
California State University, Northridge
Bachelor of Science, Biology
California State University, Northridge
Certifications & Training
- → dbt Fundamentals Certification — dbt Labs
- → IBM Cognos Data Manager Certification
- → Qlik Data Modeling for Qlik Sense — 2021
- → Cognos Report Studio & Data Manager Training — 2013–2015
Interested in working together?
Contact JP — johnpaulcastro@gmail.com