0 viewsjobseeker
baanu S. — Senior Data Engineer from United States

baanu S.

Senior Data Engineer

United States 3-6 years
Open to offersNew to Platform
Languages
English
Video Introduction
No video introduction yet
The candidate has not added a video.
Contact information and social networks are private. Connect to unlock.
Hidden

About

Baanu S. is a seasoned Data Engineer with over five years of experience specializing in analytics and pipeline development for retail, e-commerce, and pharmaceutical industries. He has a robust background in building scalable ELT pipelines using Python, PySpark, and dbt on AWS and Snowflake, which have significantly improved data models to drive revenue, enhance accuracy, and decrease defects. At Real Value Products, Baanu developed end-to-end ELT pipelines and retail analytics data warehouses, effectively increasing data accuracy by 20% and reducing SQL report query time. His work at Texas State University streamlined data extraction processes and improved dashboard interactivity, enhancing real-time KPI tracking. During his tenure at Amazon, Baanu engineered automated dashboards and ETL pipelines, resulting in increased anomaly detection and a 7.4% lift in Buy Box win rate, recovering $1.2M GMV. He holds a master's degree in Data Analytics and Information Systems from Texas State University and is certified as an AWS Certified Solutions Architect.

Experience

  • Data Engineer

    Real Value Products – Value RX · 2024 — Present
    Constructed ELT pipelines using Python and PySpark for data ingestion from vendor APIs and web-scraped sources into AWS S3. Employed Bronze/Silver/Gold medallion architecture featuring Spark SQL transformations to enhance data accuracy. Modeled a retail analytics data warehouse in dbt on Snowflake with star/snowflake schemas, ensuring data quality through incremental materializations and tests. Enhanced SQL Server stored procedures and batch loads using Snowflake’s zero-copy cloning, significantly reducing report query times. Created Power BI executive dashboards with custom DAX measures for revenue and SKU-level forecasting across major e-commerce platforms.
  • Data Engineering Assistant

    Texas State University · 2022 — 2024
    Developed SQL/Python pipelines for data extraction and transformation aimed at Banner/Canvas and campus surveys, streamlining weekly report preparation time through automation. Created interactive dashboards in Tableau and Power BI to facilitate tracking of KPIs related to enrollment and course performance for deans and advisors. Performed statistical analyses, including A/B tests and regression, to assess tutoring programs and identify key factors influencing GPA improvements. Established data-quality checks and constructed a data dictionary to standardize metrics, enhancing report accuracy.
  • Business Intelligence Analyst (Data Engineering)

    Amazon · 2020 — 2022
    Developed an automated vendor health dashboard leveraging Athena and QuickSight that consolidated various data points, significantly cutting manual reporting time. Engineered Python ETL pipelines utilizing Lambda and Glue to standardize data feeds from over 200 vendors, which led to improved data freshness. Designed SQL models in Redshift to monitor factors impacting Buy Box status, facilitating strategic interventions. Conducted root-cause analyses for manufacturing defects and implemented changes that reduced chargeback instances. Created an alerting pipeline using CloudWatch and SNS to monitor pricing compliance, drastically improving detection times.

Skills & Expertise

Education

  • Master’s in Data Analytics and Information Systems
    Texas State University · — — 2024