15%
Compute Cost Reduction
Guilherme Xavier
Senior Data Engineer | Building Scalable GCP & Snowflake Platforms | Reducing Compute Costs & Automating Workflows | Python, dbt, GenAI
15%
Compute Cost Reduction
65%
Query Performance Increase
80+
Manual Hours Removed Weekly
3M+
Events Processed Daily
June 2024 - Present
Migrated legacy Dataproc jobs to Airflow + dbt, reduced compute costs by 15%, and improved query performance by 65%.
Automated onboarding for 15+ debt settlement partners into a centralized data platform supporting $700K+ in monthly revenue.
Built Snowflake streaming and Pub/Sub integrations across 30+ pipelines to enable real-time decisioning and operational analytics.
January 2022 - April 2024
Built and scaled 100+ Airflow pipelines and real-time ingestion services handling 3M+ events per day.
Designed an in-house communications orchestrator processing 70K+ daily messages, reducing third-party costs by 65%.
Managed a 5TB OpenSearch cluster and delivered 50+ dashboards for operational monitoring and analytics.
June 2021 - January 2022
Led migration of 50 on-premises sources (10TB) to Azure and improved performance by 40% with Delta Lake architecture.
Developed 20 pipelines in Azure Data Factory, Synapse, and Databricks integrating SQL Server, REST APIs, and SharePoint.
Established CI/CD practices across 5 repositories, reducing deployment time by 10% and improving release quality.