Data Engineering & Governance

Turning raw data into reliable infrastructure

Senior Data Engineer with 10+ years of experience designing, orchestrating, and supporting data pipelines and analytical data models. Proven track record transforming complex, fragmented data into scalable, reusable datasets using modern ETL tools across multi-cloud environments.

10+
Years of Experience
3
Microsoft Certifications
Multi
Cloud Platforms
01

About

I'm Janesyn Abit, a Data Engineering and Governance Associate Manager with a decade-long career building the backbone of modern data platforms. I specialise in designing end-to-end data pipelines that are resilient, observable, and built to scale.

My work sits at the intersection of engineering and governance — not just moving data, but ensuring it is trusted, documented, and reusable across the organisation. I've delivered solutions for enterprise clients at both Accenture and Telstra, operating across AWS, Azure, and GCP environments.

I'm passionate about data quality, lineage, and the invisible engineering that makes analytics reliable for everyone downstream.

Pipeline Architecture
Designing scalable batch and streaming pipelines using Spark, Airflow, and dbt across multi-cloud environments.
Data Governance
Implementing data catalogues, lineage tracking, and quality frameworks that build trust in enterprise data assets.
Analytical Modelling
Transforming fragmented source data into clean, reusable dimensional models and Snowflake data warehouses.
Cloud Engineering
Deploying and optimising data infrastructure on AWS, Azure, and GCP — selecting the right tool for the workload.
02

Experience

Accenture
Senior Role
Business Specialist / Data Engineer

Designed and maintained enterprise-grade data pipelines for large-scale clients, delivering end-to-end ETL solutions across cloud platforms. Collaborated with business stakeholders to translate complex requirements into scalable data models and governance frameworks. Led data quality initiatives and contributed to the implementation of data cataloguing and lineage tooling.

Python SQL Spark Azure Databricks dbt
Telstra
Senior Role
Business Specialist / Data Engineer

Supported and extended Telstra's core data platform, building reliable pipelines that served analytical and operational consumers. Worked across data ingestion, transformation, and serving layers — ensuring SLA compliance, pipeline observability, and data integrity at enterprise scale. Contributed to data governance policies and metadata management practices.

Snowflake Airflow AWS GCP Python SQL
03

Skills & Tech Stack

Languages
  • Python
  • SQL
  • Bash / Shell
  • YAML / JSON
Data Processing
  • Apache Spark
  • Databricks
  • dbt (data build tool)
  • Apache Airflow
Cloud Platforms
  • AWS (S3, Glue, Redshift)
  • Azure (ADF, Synapse, ADLS)
  • GCP (BigQuery, Dataflow)
Data Warehousing
  • Snowflake
  • Azure Synapse
  • Amazon Redshift
  • Google BigQuery
Governance & Quality
  • Data Cataloguing
  • Lineage Tracking
  • Data Quality Frameworks
  • Metadata Management
Visualisation & BI
  • Power BI
  • Microsoft SQL Server
  • Azure Database Admin
04

Projects

Phase 1 — Open Source

NOAA Weather data pipeline built end-to-end on free-tier accounts.

Live Project
Project 01
NOAA Weather Pipeline — Databricks

End-to-end data pipeline ingesting live weather forecast data from the NOAA public API across four US cities. Built on Databricks Serverless Compute using managed Delta tables throughout — Bronze raw ingest, Silver parse and cleanse, Gold daily aggregations, and a SQL serving layer with a live dashboard. Deliberately avoids legacy DBFS patterns in favour of Unity Catalog managed tables.

Databricks Delta Lake PySpark Spark SQL Python
View on GitHub
Live Project
Project 02
NOAA Weather Pipeline — Snowflake

End-to-end data pipeline in Snowflake processing NOAA weather forecast data across four US cities. Built a RAW → CLEAN → SERVING medallion architecture using internal stages, VARIANT JSON parsing, and typed transformations. Includes a published Snowsight dashboard with temperature trends, rain risk, and detail views. Demonstrates cross-platform data movement from Databricks to Snowflake.

Snowflake SQL JSON Snowsight Databricks
View on GitHub
Phase 2 — Cloud Vendors

Same NOAA pipeline rebuilt on each major cloud platform's native services.

Coming Soon
Project 03
NOAA Weather Pipeline — GCP

Cloud-native pipeline using Cloud Storage, Dataflow, BigQuery, and Looker Studio.

GCP BigQuery Dataflow Python
Coming Soon
Project 04
NOAA Weather Pipeline — Azure

Cloud-native pipeline using Data Factory, ADLS Gen2, Synapse Analytics, and Power BI.

Azure Synapse Data Factory Power BI
Coming Soon
Project 05
NOAA Weather Pipeline — AWS

Cloud-native pipeline using S3, Glue, Redshift, and QuickSight.

AWS Glue Redshift QuickSight
05

Certifications

Microsoft
MCSA: SQL 2016 Database Development
Achieved 2021
Microsoft
Power BI Data Analyst Associate
Achieved 2022
Microsoft
Azure Database Administrator Associate
Achieved 2023
06

Contact

Let's build something great together.

I'm open to senior data engineering roles, consulting engagements, and collaborative projects — particularly where data governance, pipeline reliability, or cloud architecture are at the core.

Reach out via email or connect with me on LinkedIn and GitHub.