0 viewsjobseeker
Michael P. — Senior Data Engineer from United States

Michael P.

Senior Data Engineer

United States 6+ years
Open to offersNew to Platform
Languages
English
Video Introduction
No video introduction yet
The candidate has not added a video.
Contact information and social networks are private. Connect to unlock.
Hidden

About

Michael P. is an accomplished Staff Data Engineer with over nine years of expertise in designing and operating scalable data platforms, particularly within the e-commerce, marketplace, and enterprise sectors. His deep proficiency in technologies such as Snowflake, SQL, and Python has enabled him to deliver high-impact analytics solutions and optimize performance and cost structures effectively. At Chewy, Michael led key data engineering projects like the “Customer 360 & Order Intelligence” platform, crafting end-to-end ELT pipelines and fostering collaborations across Product, Marketing, and Supply Chain divisions. His innovative solutions in streaming and near-real-time ingestion with Apache Kafka have significantly reduced data latency. Prior to Chewy, Michael advanced Uber's data capabilities through efficient pipeline architectures that supported complex analytics and reporting tasks. He began his career at General Mills where he honed his skills in managing enterprise ETL processes and developing robust data models. Michael also holds a Master of Science in Applied Mathematics from the University of Lynchburg.

Experience

  • Staff Data Engineer

    Chewy · 2020 — Present
    Led the development of the 'Customer 360 & Order Intelligence' data platform, creating end-to-end ELT pipelines in Snowflake for comprehensive customer, order, and fulfillment analytics across various business functions. Architected an analytics layer in Snowflake and dbt, focused on transformation models with SQL-first development and enforced CI/CD practices to enhance efficiency. Managed domain-based data pipelines that sourced data from MSSQL systems, SaaS tools, and APIs using Fivetran, ensuring schema evolution without disruption. Implemented Databricks SQL notebooks and PySpark jobs for large-scale transformation, exploratory analysis, and validation tasks, orchestrated through Airflow. Introduced data quality checks and anomaly detection systems, utilizing dbt tests and Python utilities for improved pipeline issue detection. Enhanced Snowflake performance and cost efficiencies via clustering and query refactoring. Developed near-real-time ingestion pipelines with Kafka, significantly decreasing data latency. Supported Reverse ETL workflows by preparing datasets for customer-facing and marketing systems, maintaining metric consistency. Packaged data utilities using Docker, collaborating with platform teams for Kubernetes deployment. Worked alongside platform and DevOps teams in managing data infrastructure utilizing Terraform and AWS. Engaged in cross-cloud analytics validation across AWS and Azure environments to maintain dataset accuracy. Provided mentorship to junior data engineers, conducted code reviews, and collaborated with stakeholders to create dimensional models for analysis.
  • Senior Data Engineer

    Uber · 2017 — 2020
    Designed scalable batch and streaming data pipelines to ingest and process high-volume event data using Kafka and Spark, facilitating near-real-time analytics for the marketplace. Developed and refined large-scale dimensional models in Hive and Presto, enabling effective experimentation and reporting. Conducted data profiling and schema validation to manage event changes and reduce data discrepancies. Collaborated with data scientists and product teams on projects involving pricing and marketplace health metrics, while enhancing pipeline reliability through various optimization techniques.
  • Data Engineer

    General Mills · 2015 — 2017
    Designed and managed enterprise ETL pipelines with SSIS to integrate data across manufacturing, finance, and supply-chain sectors from Oracle and SQL Server. Developed and optimized star and snowflake schemas to support analytics and reporting needs. Handled schema updates and source-system changes while minimizing disruption to reporting. Automated data validation and reconciliation processes using SQL and Python, improving data quality and reducing manual efforts. Worked closely with business stakeholders to create well-documented data models that aligned with reporting requirements.
  • Data Analyst

    General Mills · 2015 — 2015
    Created SQL-based analytical datasets and reports for finance and operations teams. Conducted ad-hoc analyses to identify data quality issues and supported the development of dashboards and executive reports. Collaborated with engineers to clarify metric definitions and enhance reporting logic, laying the groundwork for future data engineering responsibilities.

Skills & Expertise

Education

  • Master of Science, Applied Mathematics
    University of Lynchburg · 2013 — 2014
  • Bachelor of Science, Mathematics
    University of Lynchburg · 2009 — 2013