0 viewsjobseeker
Shardul A. — Senior Data Engineer from India

Shardul A.

Senior Data Engineer

India 3-6 years
Open to offersNew to Platform
Languages
English
Video Introduction
No video introduction yet
The candidate has not added a video.
Contact information and social networks are private. Connect to unlock.
Hidden

About

Shardul S. is a skilled Data Engineer with over five years of experience in architecting and optimizing data platforms within the healthcare and fintech sectors. At Calibo, he developed a Snowflake data platform that optimized processing for over 500,000 daily transactions, reducing computing costs by 35%. His work with Airbnb involved maintaining critical Airflow ETL pipelines and enhancing distributed payment data processing systems on AWS, achieving significant performance improvements and supporting over $10 billion in annual payment volume. Shardul's expertise in Snowflake, Spark, and Airflow, coupled with his ability to tie data infrastructure to tangible business results, has made him an asset in modern data engineering. Educated at Indiana University Bloomington with a Master of Science in Data Science, Shardul continues to deliver impactful data solutions, streamlining processes and enhancing data-driven decision making across industries.

Experience

  • Data Engineer

    ALTIMETRIK (Client: Calibo) · 2024 — 2026
    Architected a comprehensive Snowflake data platform that handled over 500K daily transactions from more than 10 source tables. Implemented external S3 stages, COPY commands, clustering keys, time-based partitioning, and materialized views to facilitate real-time sales analytics while achieving a 35% reduction in compute costs. Developed a recommendation engine utilizing Python and AWS services to analyze customer purchasing behavior across 10k+ SKUs, effectively generating personalized product suggestions. Enhanced the recommendation engine with a GenAI-powered explanation module that translated product suggestions into natural language for sales representatives. Executed dbt gold layer transformations and established a production star schema fact_deviation with 4 dimension tables, ensuring standardized naming conventions and data quality checks for over 200K deviation records. Collaborated with pharma quality teams and data scientists to build a data pipeline for pharma deviation analysis, integrating UI-triggered REST APIs with backend Snowflake processing, thus enabling timely time series analysis across 200K records over a 2-year span. Streamlined time-series deviation data from 3 source systems into Snowflake, reducing data preparation time by 60% for downstream modeling and reporting.
  • Data Engineer

    ALTIMETRIK (Client: Airbnb) · 2023 — 2024
    Maintained and improved critical Airflow ETL data pipelines that processed over 50M daily payment transactions using various payment methods, supporting substantial annual payment volume. Designed and deployed production-grade Scala applications for payment data processing across distributed Spark clusters implemented on AWS, managing over 10TB of data daily. Enhanced Spark job performance by optimizing executor memory allocation, shuffle partitions, and serialization techniques, resulting in a 35% improvement in job completion times. Executed SQL performance tuning strategies to reduce query execution time by 45%, including partition pruning and index optimization for billions of rows in distributed data warehouses. Developed a centralized logging and alerting infrastructure for over 20 production pipelines, significantly reducing pipeline failure detection time from 2 hours to 15 minutes and enhancing incident response capabilities.
  • Data Engineer Intern

    AUTIRE TECHNOLOGIES · 2022 — 2022
    Created and managed ETL pipelines that transferred data from AWS S3 to a data warehouse, integrating more than 5 diverse data sources including REST APIs and financial databases. Developed ETL processes to calculate and aggregate financial metrics, achieving a reduction in manual calculation time by 55%. Implemented event-driven automation on AWS, which led to a 20% decrease in monthly infrastructure costs and a 15% enhancement in pipeline response times.
  • Associate Data Engineer

    L & T INFOTECH (Client: Citibank) · 2018 — 2021
    Developed a machine learning-driven churn prediction pipeline that integrated CRM systems with external API data, engineering customer behavioral features based on transaction history and product usage patterns for downstream model training. Maintained alignment with project objectives by updating stakeholders on project status through effective written and verbal communication and technical documentation while delivering solutions within an Agile framework.

Skills & Expertise

Education

  • Master of Science in Data Science
    INDIANA UNIVERSITY BLOOMINGTON · 2021 — 2023
  • Bachelor's in Electronics & Telecommunications
    PUNE INSTITUTE OF COMPUTER TECHNOLOGY · 2014 — 2018