Resume Template 1
Resume Template 1
TECHNICAL SKILLS
WORK EXPERIENCE
➢ One checkout Platform – Real time streaming application for Financial data
Tech Stack – Flink, Python, Kafka, Oracle, Kafka Connect, Docker, AWS, Airflow
● Building one checkout platform for different business domains of Expedia i.e; Vrbo, Hotels, Car Rentals, Lodging
● Real time streaming platform captures & process financial data for Accounting and helps suppliers to track total amount to
be paid via Auto Pay or Request Pay model
● Crafted a scalable streaming solution using Apache Flink & Remote functions to handle ~800k booking streams each day
● Crafted generic scalable Native AWS solution for Salesforce to Redshift ingestion
● It helped to move ingestion pipelines from third party tool Informatica and saved cost for heavy license fee
● This generic framework helped other business units for smooth ingestion of newly onboarded Salesforce object into Redshift
DataLake
● Build generic & optimized ingestion pipeline for highly critical & confidential Employee Benefits Data
● Pipeline is designed in a way to handle GB’s of daily & weekly data together for different use cases like Audit, Payroll,
Reimbursement, Education Reimbursement etc
● Took complete ownership and worked closely with business teams to understand the requirements & deliver enriching dashboards
● Enhanced & optimized multiple pipelines, built for different business units like Peoplesoft, Audit, AEM, Immigration,
Accurate, MyDocs, Background Verification etc
● Reduced execution time by 50% and improved alerting system for different edge cases
● Leadership principles like Customer Obsession, Earn Trust and Think Big, helped me to keep improving existing systems
● Crafted data-sync logic by prioritizing datasets (High/Medium/Low tag) based upon criticality to meet SLO
● Built preemption logic to prioritize highly critical datasets when multiple low priority sync processes are running
● Designed Rest API in data ingestion for retention of GA data in order to optimize cluster space
● Created a new pipeline to ingest missing data from HDFS to ElasticSearch in case of cluster failure
● Crafted a Cassandra based real time ingestion pipeline for marketplace data in order to help the DWH team to reduce request
load from production MySQL. The Objective was to shift business users from production, to overcome data leaks & security issues
● Interacted with different business users to know about their use cases, ingestion tables, PII data and built data models accordingly
for faster insertion/updation of data
● Setup web interface Datastax Studio for users to query real time data from Cassandra using LDAP authentication
● Deployed an end to end solution for a leading US airlines; Aggregated a 360 view of customer's engagement throughout
the life-cycle of the trip
● Developed data pipelines from scratch; optimized data aggregation from 10+ independent sources and automated the
ETL process to roll out the solution
● The solution powers a web application; used by 1000+ CSRs and decision makers
● Built application on RESTFUL API's using Hadoop Ecosystem (HDFS, YARN), DataRush Applications (Distributed
Processing Engine), SQL and Python
EDUCATION
RECENT ACHIEVEMENTS
➢ Opera Ovation Award : For exceptional work on a Trip Narrative Project, 2017
➢ Conducted a firm wide Global Level session for 200+ employees on SHM encompassing designing, executing and best practices
on workflows
➢ Geek Of The Month : GeeksForGeeks; for contribution in technical content, 2016