John Paul Castro

Senior Data Platform & BI Architect | Modern Data Stack · Azure · dbt · Airflow · Kimball

📞 (818) 943-5159✉ johnpaulcastro@gmail.com🔗 linkedin.com/in/johnpaul-castro📍 Cleveland, TN · Open to remote (US)

Featured Project

JDE Data Platform

Live — Railway Cloud

Professional Summary

Data platform architect with 20+ years building enterprise-grade data systems in aerospace, entertainment, and telecommunications. I design the infrastructure that turns ERP chaos into clean, governed, decision-ready data — from legacy ETL pipelines to cloud-native Lakehouse architectures on Azure Databricks. Deep expertise in Kimball dimensional modeling, medallion architecture (Bronze/Silver/Gold), open-source data engineering (dbt Core, Apache Airflow, PostgreSQL), and end-to-end ERP integration (JD Edwards, GEAC, tcmIS). Dual master's degrees (MBA & MS Computer Science). Hands-on engineer who builds it, owns it, and delivers it. Bilingual: English and Spanish.

Core Competencies

Cloud & Data Platforms

Azure DatabricksDelta LakeUnity CatalogDLTAzure Data FactoryAzure Synapse AnalyticsAzure App Services

Data Architecture

Kimball Dimensional ModelingMedallion ArchitectureLakehouse DesignData GovernanceData Mart & Warehouse Design

Data Engineering

dbt CoreApache AirflowSSIS 2016/2022Databricks WorkflowsQlik ReplicateCognos Data Manager

Databases & Query Languages

PostgreSQLMS SQL ServerDB2Oracle PL/SQLT-SQLSpark SQLPySpark

BI & Reporting

Qlik SenseQlikViewPower BI (Embedded)Cognos

Software Development

PythonJavaScriptNode.jsReactNext.jsClerkStripeC#.NETHTML/CSS

ERP & Integration

JD Edwards (EnterpriseOne)GEACtcmIS

Experience

Senior Data Architect — Independent / Consulting

Self-Employed

2026 — Present
  • → Designed and built full medallion architecture platform for JDE Edwards data — Bronze extraction, Silver transformation, Gold aggregation
  • → Built Node.js extractors pulling from JDE SQL Server into PostgreSQL via Apache Airflow orchestration
  • → Implemented dbt Core models with proper naming standards, surrogate keys, and Kimball dimensional modeling
  • → Built an MDM (Master Data Management) layer that unifies customer records across 5 separate ERP systems using Splink probabilistic record linkage — resolving 82 duplicates from 333 source records into 251 golden customer entities
  • → Designed the full MDM pipeline: Node.js extraction → PostgreSQL Bronze → dbt Silver normalization → Python/Splink matching → golden record output with cross-reference mapping and consolidated sales visibility
  • → Deployed full stack to Railway cloud: Fastify API + Next.js dashboard + PostgreSQL

Senior Data Warehouse & BI Architect

Incora (formerly Wesco Aircraft) · Valencia, CA

August 2006 — March 2026
  • → Served as Chief Data Architect for a global aerospace distributor with 3 ERP systems (JD Edwards, tcmIS, GEAC) across multiple continents — unifying all systems into a cloud-native Azure Databricks Lakehouse with Delta Lake and Unity Catalog, supporting Finance, SIOP, and compliance reporting at enterprise scale
  • → Re-engineered a legacy Cognos ETL pipeline running 8+ hours daily into an optimized SSIS solution completing in under 30 minutes (94% reduction) — then migrated to Azure Databricks for real-time, scalable processing serving thousands of daily transactions
  • → Led a full Kimball dimensional model redesign during the 2014 Haas merger migration from Cognos to QlikView; established snake_case naming standards and a layered data architecture (gdl, jdl, tdl) that remains the backbone of Finance and SIOP reporting more than a decade later
  • → Recovered millions of zeroed F4211 pricing records in approximately two hours by reconstructing correct values from F42199 history — restoring critical data integrity under high-pressure conditions where all prior recovery attempts had failed
  • → Designed and built a mission-critical .NET C# / JavaScript web application for Boeing's 787 Dreamliner tooling program, replacing disparate booth interfaces with a unified JDE DB2-integrated system supporting tool checkout, check-in, and billing across multiple manufacturing sites
  • → Engineered a real-time warehouse StatusBoard application — still deployed across multiple distribution facilities — using .NET C#, JavaScript, and a Windows service polling JDE; delivers live visibility into picker performance, aisle status, and priority orders
  • → Established enterprise data governance standards and authored complex T-SQL, DB2, Oracle PL/SQL, and PySpark/Spark SQL transformations processing billions of rows in high-transaction, real-time workloads
  • → Built and maintained bidirectional JDE integrations for post-acquisition systems; developed custom compliance-reporting automation for Bombardier and a multi-customer web portal delivering real-time inventory and consignment visibility
  • → Served as primary SQL expert across MS SQL, DB2, and Oracle; lead data resource for PwC and external audit/consulting engagements; owned JDE PY and DEV environment refreshes

Data & Application Consultant

The Walt Disney Company

2005 — 2006
  • → Contributed to a $50M+ enterprise PowerBuilder/Sybase-to-Java/DB2 migration; responsible for data transformation logic using T-SQL and ADO.NET, and development of views and stored procedures

Software Developer — Consultant

Jaguar Consulting

2002 — 2005
  • → Built a multi-tenant Rights Management platform serving NBA, WNBA, MLB, Hallmark, National Geographic, NBC Enterprises, MGM, Lionsgate, and others — handling inventory, royalties, and accounting via a robust 3-tier VB.NET/ASP/Sybase/T-SQL architecture

Information Technology Instructor

Learning Tree University · Chatsworth, CA

2000 — 2002
  • → Taught Microsoft Visual C++, C Programming (Basics and Advanced), and software engineering principles at the collegiate level

IT Manager

800 Direct, Inc. / CyberRep.com

1999 — 2002
  • → Managed IT operations across two California telecenters (400+ workstations, team of 12), overseeing mission-critical applications for 30+ clients and modernizing legacy Clipper systems to Visual Basic 6.0, SQL Server, and web-based architectures

Education

Master of Business Administration (MBA)

California State University, Northridge

Master of Science, Computer Science

California State University, Northridge

Bachelor of Science, Biology

California State University, Northridge

Certifications & Training

  • dbt Fundamentals Certification — dbt Labs
  • IBM Cognos Data Manager Certification
  • Qlik Data Modeling for Qlik Sense — 2021
  • Cognos Report Studio & Data Manager Training — 2013–2015

Interested in working together?

Contact JP — johnpaulcastro@gmail.com