0% found this document useful (0 votes)

164 views4 pages

Dice Resume CV Karthik S

Karthik has over 8 years of experience as a data engineer. He has extensive experience building data pipelines and performing ETL using Python and AWS. He is proficient in technologies like Spark, Hive, AWS Redshift, and databases including MySQL, Oracle, and MongoDB. Karthik has worked on multiple projects involving data ingestion, transformation, analytics and visualization.

Uploaded by

RAJU P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

164 views4 pages

Dice Resume CV Karthik S

Uploaded by

RAJU P

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Karthik

469-663-0198
Siram943@gmail.com
Data Engineer

Career Summary: Data Engineer with 8 years of experience in building data intensive applications and
creating pipelines using python and shell scripting with extensive knowledge on amazon web services
(AWS). Experience with Data Extraction, Transformation and Loading (ETL). Building a Data warehouse
using Star and Snowflake schemas. Well versed with scrum methodologies.

Professional Summary:
● 8+ years of experience in software development which includes Design and Development of Enterprise
and Web-based applications.
● Hands-on technical experience in Python, Java, Q++(Mastercraft), DB2 SQL, R programming with
primary exposure to the P & C Insurance domain.
● Experience with Amazon Web Services (Amazon EC2, Amazon S3, Amazon RDS, Amazon Elastic Load
Balancing, Amazon SQS, AWS Identity and access management, Amazon SNS, AWS Cloud Watch,
Amazon EBS, Amazon CloudFront, VPC, DynamoDB, Lambda and Redshift)
● Experience in using python integrated IDEs like PyCharm, Sublime Text, and IDLE.
● Experience in developing web applications and implementing Model View Control (MVC) architecture
using server-side applications Django and Flask.
● Working knowledge on Kubernetes to deploy scale, load balance, and manage Docker containers
● Good knowledge in Data Extraction, Transforming and Loading (ETL) using various tools such as SQL
Server Integration Services (SSIS), Data Transformation Services (DTS).
● Experience in Database Design and development with Business Intelligence using SQL Server
Integration Services (SSIS), SQL Server Analysis Services (SSAS), OLAP Cubes, Star Schema and
Snowflake Schema.
● Data Ingestion to Azure Services and processing the Data in In Azure Databricks.
● Creating and enhancing CI/CD pipeline to ensure Business Analysts can build, test, and deploy quickly.
● Building Data Warehouse using Star and Snowflake schemas.
● Extensive knowledge on Exploratory Data Analysis, Big Data Analytics using Spark, Predictive analysis
using Linear and Logistic Regression models and good understanding in supervised and unsupervised
algorithms.
● Worked on different statistical techniques like Linear/Logistic Regression, Correlational Tests, ANOVA,
Chi-Square Analysis, K-means Clustering.
● Hands-on experience on Visualizing the data using Power BI, Tableau, R(ggplot), Python (Pandas,
matplotlib, NumPy, SciPy).
● Integrating Azure Databricks with Power BI and creating dashboards.
● Good Knowledge in writing Data Analysis eXpression(DAX) in Tabular data model.
● Hands on knowledge in designing Database schema by achieving normalization.
● Proficient in all phases of Software Development Life Cycle (SDLC) including Requirements gathering,
Analysis, Design, Reviews, Coding, Unit Testing, and Integration Testing.
● Well versed with Scrum methodologies.
● Analyzed the requirements and developed Use Cases, UML Diagrams, Class Diagrams, Sequence and
State Machine Diagrams.
● Excellent communication and interpersonal skills with ability in resolving complex business problems.
● Direct interaction with client and business users across different locations for critical issues.
Technical Skills:

Programming Python, PySpark, Spark with Scala, JavaScript, Shell Scripting

Languages

Big Data Platforms Hortonworks, Cloudera

AWS Platform EC2, S3, EMR, Redshift, DynamoDB, Aurora, VPS, Glue, Kinesis, Boto3

Operating Systems Linux, Windows, UNIX

Databases Netezza, MySQL, UDB, HBase, MongoDB, Cassandra, Snowflake, NoSQL, SQL Server

Development Methods Agile/Scrum, Waterfall

IDE’s PyCharm, IntelliJ, Ambari, Jupiter Notebook

Data Visualization Tableau, Power BI, Cognos, DMS

Educational Details:
● Bachelor’s in Computer Science Engineering (CSE) | JNTUH | 2010 – 2014.
Work Experience:
Client: Microsoft
Location: Virginia
Role: Data Engineer
Projects: Data Reconciliation, SSIS, Data Quality, Data Analysis, Root Cause Analysis, Databricks,
pyspark Jan 2020 - present
Responsibilities:
● Worked as a Sr. Data Engineer with Big Data and Hadoop ecosystem components.
● Involved in converting Hive/SQL queries into Spark transformations using Scala.
● Created Spark data frames using Spark SQL and prepared data for data analytics by storing it in AWS,S3.
● Responsible for loading data from Kafka into HBase using REST API.
● Developed the batch scripts to fetch the data from AWS S3 storage and perform required
transformations in Scala using Spark framework.
● Used Spark streaming APIs to perform transformations and actions on the fly for building a common
learner data model which gets the data from Kafka in near real time and persists it to the HBase.
● Created Sqoop scripts to import and export customer profile data from RDBMS to S3 buckets.
● Developed various enrichment applications in Spark using Scala for cleansing and enrichment of
clickstream data with customer profile lookups.
● Troubleshooting Spark applications for improved error tolerance and reliability.
● Used Spark Data frame and Spark API to implement batch processing of Jobs.
● Used Apache Kafka and Spark Streaming to get the data from adobe live stream rest API connections.
● Automated creation and termination of AWS EMR clusters.
● Worked on fine tuning and performance enhancements of various spark applications and hive scripts.
● I have a good experience in replication tools like Hevo data,Rubrick,Carbonate availability, SharePlex,
NetApp Snapmirror, fivetran.IBm Spectrum Protect.

● Used various concepts in spark like broadcast variables, caching, dynamic allocation to design more
scalable spark applications.
● Identify source systems, their connectivity, related tables, and fields and ensure data suitability for
mapping, preparing unit test cases, and provide support to the testing team to fix defects.
● Defined HBase tables to store various data formats of incoming data from different portfolios.
● Developed the verification and control process for a daily data loading.
● Involved in daily production support to monitor and troubleshoot Hive and Spark jobs.
Environment: AWS EMR, S3, Spark, Hive, Sqoop, Scala, MySQL, Oracle DB, Athena, Redshift

Client: TransUnion, IL, Chicago. Oct 2016 - Dec 2019

Role: Data Engineer
Description: This project involved designing data pipelines in AWS using different services offered by the
platform. Our job was to interact with a team of data scientists and realize the data requirements and
implement these pipelines for data analysis and machine learning purposes.

Responsibilities:
● Extensively worked in Sqoop to migrate data from RDBMS to HDFS.
● Ingested data from various source systems like Teradata, MySQL, Oracle databases.
● Developed Spark application to perform Extract Transform and load using Spark RDD and Data frames.
● Created Hive external tables on top of data from HDFS and wrote ad-hoc hive queries to analyze the
data based on business requirements.
● Utilized Partitioning and Bucketing in Hive to improve hive query processing times.
● Performed incremental data ingestion using Sqoop as an existing application is generating data on a
daily basis.
● Migrated/reimplemented Map Reduce jobs to Spark applications for better performance.
● Handled data in different file formats like Avro and Parquet.
● Extensively used Cloudera Hadoop distributions within the project.
● Used GIT for maintaining/versioning the code.
● Created Oozie workflows to automate the data pipelines
● Involve in a fully automated CI/CD pipeline process through GitHub, Jenkins.
● Used Cloudera Manager for installation and management of Hadoop Cluster.
● Exported data from the HDFS environment into RDBMS using Sqoop for report generation and
visualization purposes.
● Involved in moving all log files generated from various sources to HDFS for further processing through
Flume.
● Invoked in creating Hive tables, loading with data, and writing Hive queries, which will invoke
MapReduce jobs in the backend. Experienced in handling large datasets using Partitions, Spark in
Memory capabilities, Broadcasts in Spark, Effective & efficient Joins.
● Worked in designing and deployment of Hadoop cluster and different big data analytic tools, including
Pig, Hive, Oozie, Zookeeper, Sqoop, Flume, Impala, Cassandra with Horton work distribution.
● Utilized the Apache Hadoop environment by Cloudera. Monitoring and Debugging Spark jobs which
are running on a Spark cluster using Cloudera Manager.
● Got Good Knowledge in(DMS) to maintain the data assassin in to single storage containers.
● Written Hive SQL queries for Ad-hoc data analysis to meet business requirements.
● Delivered Unit test plans Involved in Unit testing and documenting.
Environment: Cloudera (CDH 5.x), Spark, Scala, Sqoop, Oozie, Hive, HDFS, MySQL, Oracle DB, Teradata,
Linux, Shell Scripting.

Company: Cognizant Technology Solutions June 2014 - August 2016

Location: Chennai, India
Client: Anthem Inc., Healthcare industry
Role: Data Engineer.

Responsibilities:

● Creating web-based applications using Python on Django framework for data processing.
● Implementing the preprocessing procedures along with deployment using the AWS services and
creating virtual machines using EC2.
● Good knowledge in Exploratory data analysis and performed data wrangling and data visualization.
● Validating the data to check for the proper conversion and identifying and cleaning unwanted data,
data profiling for accuracy, completeness, consistency.
● Preparing standard reports, charts, graphs, and tables from a structured data source by querying data
repositories using Python and SQL.
● Developed and produced a dashboard, key performance indicators and monitor organization
performance.
● Define data needs, evaluate data quality, and extract/transform data for analytic projects and
research.
● Used Django framework for application development. Designed and maintained databases using
Python and developed Python based API (RESTful Web Service) using Flask, SQLAlchemy and
PostgreSQL.
● Worked on server-side applications using Python programming.
● Performed efficient delivery of code and continuous integration to keep in line with Agile principles.
● Experience in Agile Methodologies, Scrum stories and sprints experience in a Python based
environment,
● Importing and exporting data between different data sources using SQL Server Management Studio.
● Maintaining program libraries, user's manuals and technical documentation.
Environment: python, Django, RESTful web service, MySQL, PostgreSQL, Visio, SQL Server Management
Studio, AWS.

Teja
No ratings yet
Teja
5 pages
Anusha K Phone No: (929) 456-3121 Senior Data Engineer: Summary
No ratings yet
Anusha K Phone No: (929) 456-3121 Senior Data Engineer: Summary
7 pages
Deepak (Sr. Data Engineer)
No ratings yet
Deepak (Sr. Data Engineer)
10 pages
FAT Procedure
100% (1)
FAT Procedure
14 pages
Ram Madhav Resume
No ratings yet
Ram Madhav Resume
6 pages
Saurav Dudulwar Resume
No ratings yet
Saurav Dudulwar Resume
1 page
Vishwa SrDataEngineer Resume
No ratings yet
Vishwa SrDataEngineer Resume
4 pages
Azure Data Engineer - Samatha Gudala
100% (1)
Azure Data Engineer - Samatha Gudala
8 pages
Data Protection and Management Participant Guide 1 PDF
No ratings yet
Data Protection and Management Participant Guide 1 PDF
610 pages
Review Report On E-Farming
80% (5)
Review Report On E-Farming
35 pages
Btech 2
No ratings yet
Btech 2
66 pages
Continuous Integration
No ratings yet
Continuous Integration
9 pages
MIT06
No ratings yet
MIT06
40 pages
NCP1337 PWM Current-Mode Controller For Free Running Quasi-Resonant Operation
No ratings yet
NCP1337 PWM Current-Mode Controller For Free Running Quasi-Resonant Operation
15 pages
RAJU AWS Data Engineer Resume
No ratings yet
RAJU AWS Data Engineer Resume
6 pages
Vishal DataEngineer
No ratings yet
Vishal DataEngineer
3 pages
Cd00147165 SH 4 32 Bit Cpu Core Architecture Stmicroelectronics
No ratings yet
Cd00147165 SH 4 32 Bit Cpu Core Architecture Stmicroelectronics
522 pages
DE Sample Resume
No ratings yet
DE Sample Resume
6 pages
ECE201 SP2010 Solutions
100% (1)
ECE201 SP2010 Solutions
101 pages
HSM Access Provider Installation Guide: Protectserver
No ratings yet
HSM Access Provider Installation Guide: Protectserver
36 pages
Anil Kumar: Data Engineer
No ratings yet
Anil Kumar: Data Engineer
8 pages
Santosh Goud - Senior AWS Big Data Engineer
No ratings yet
Santosh Goud - Senior AWS Big Data Engineer
9 pages
Swapnik DE
No ratings yet
Swapnik DE
6 pages
Adithya Jatangi: Professional Summary
No ratings yet
Adithya Jatangi: Professional Summary
7 pages
Channel Mapping LTE
No ratings yet
Channel Mapping LTE
24 pages
Aslam Big Data Engineer
No ratings yet
Aslam Big Data Engineer
6 pages
Requirements For Installing Oracle Database 12.1 64-Bit AMD64-EM64T On SLES 12
100% (1)
Requirements For Installing Oracle Database 12.1 64-Bit AMD64-EM64T On SLES 12
4 pages
Midterm - With Solution
No ratings yet
Midterm - With Solution
5 pages
Cs8383 Object Oriented Programming Laboratory FINAL
No ratings yet
Cs8383 Object Oriented Programming Laboratory FINAL
87 pages
Security Issues in Cloud Computing
No ratings yet
Security Issues in Cloud Computing
8 pages
Resume Sample For US STAFFING
No ratings yet
Resume Sample For US STAFFING
6 pages
Nagaraju Bachu
No ratings yet
Nagaraju Bachu
6 pages
OpenShotVideoEditor UserGuide PDF
No ratings yet
OpenShotVideoEditor UserGuide PDF
121 pages
Mobile Security Checklist
No ratings yet
Mobile Security Checklist
27 pages
Yasaswi-Sr Data Engineer-Resume
100% (1)
Yasaswi-Sr Data Engineer-Resume
4 pages
Jyostna DataEngineer GCEAD
No ratings yet
Jyostna DataEngineer GCEAD
5 pages
Software Kal
No ratings yet
Software Kal
13 pages
Sai Krishna Sr. Big Data Engineer
No ratings yet
Sai Krishna Sr. Big Data Engineer
8 pages
Dice Resume CV PAVAN SRI HARSHA LAGHUVARAPU
No ratings yet
Dice Resume CV PAVAN SRI HARSHA LAGHUVARAPU
4 pages
Nikhil Kumar Mutyala - Senior Big Data Engineer
No ratings yet
Nikhil Kumar Mutyala - Senior Big Data Engineer
7 pages
Ajay Kadiyala Resume 2023 PDF
No ratings yet
Ajay Kadiyala Resume 2023 PDF
6 pages
Display Control Module
No ratings yet
Display Control Module
10 pages
Akash Resume
No ratings yet
Akash Resume
7 pages
Dice Resume CV SN
No ratings yet
Dice Resume CV SN
5 pages
Mahesh - Big Data Engineer
No ratings yet
Mahesh - Big Data Engineer
5 pages
SQL Server Internals English
No ratings yet
SQL Server Internals English
5 pages
Naresh DE
No ratings yet
Naresh DE
5 pages
2421 Lab 3
No ratings yet
2421 Lab 3
4 pages
Sahithi Devi
No ratings yet
Sahithi Devi
10 pages
J OHN
No ratings yet
J OHN
8 pages
SSREDDY
No ratings yet
SSREDDY
8 pages
Gagan
No ratings yet
Gagan
8 pages
IC304 Discrete Time Signal Processing
No ratings yet
IC304 Discrete Time Signal Processing
2 pages
Sai Kruthik Reddy Data Engineer
No ratings yet
Sai Kruthik Reddy Data Engineer
9 pages
Manoj Kumar
No ratings yet
Manoj Kumar
3 pages
John Pual
No ratings yet
John Pual
10 pages
HBT Sec Temaline Tema Voyager Multi DS en
No ratings yet
HBT Sec Temaline Tema Voyager Multi DS en
4 pages
SumanaV Bigdata
No ratings yet
SumanaV Bigdata
6 pages
LekhyaJ SrDE Resume
No ratings yet
LekhyaJ SrDE Resume
5 pages
AnilNamdev GCP Cloud Engineer
No ratings yet
AnilNamdev GCP Cloud Engineer
4 pages
Mir Shezan Data Analyst Resume
No ratings yet
Mir Shezan Data Analyst Resume
3 pages
Unit-1 Basics of C' Programming
No ratings yet
Unit-1 Basics of C' Programming
52 pages
WINNT SiS 900/7016 PCI Fast Ethernet Adapter Installation
No ratings yet
WINNT SiS 900/7016 PCI Fast Ethernet Adapter Installation
4 pages
Abdul Kareem Syed
No ratings yet
Abdul Kareem Syed
5 pages
Anisha ETL DataEngineer
No ratings yet
Anisha ETL DataEngineer
7 pages
Telesis 420 Quick Manual
No ratings yet
Telesis 420 Quick Manual
8 pages
Ravi Teja AWS Data Engineer
No ratings yet
Ravi Teja AWS Data Engineer
8 pages
1
No ratings yet
1
6 pages
Resume 1
No ratings yet
Resume 1
7 pages
Resume 2
No ratings yet
Resume 2
4 pages
Rohith DE
No ratings yet
Rohith DE
7 pages
Ankit Data Engineer Resume
No ratings yet
Ankit Data Engineer Resume
8 pages
Avaya POM Connectivity Guide - Engage 6.x
No ratings yet
Avaya POM Connectivity Guide - Engage 6.x
124 pages
Ricoh IM 9000 D0CZ-D0D0-D0D1 - PSG - v1.2
No ratings yet
Ricoh IM 9000 D0CZ-D0D0-D0D1 - PSG - v1.2
30 pages
Mathisha Jeeva
No ratings yet
Mathisha Jeeva
6 pages
Kiran Resume
No ratings yet
Kiran Resume
6 pages
Data Engineer Rithick Bisher
No ratings yet
Data Engineer Rithick Bisher
5 pages
Naveen's Resume - AWS DE
No ratings yet
Naveen's Resume - AWS DE
5 pages
Anvesh - Sr. Data Engineer
No ratings yet
Anvesh - Sr. Data Engineer
6 pages
TroubleshootingHP Laser 103 108 and Laser MFP 131-139
No ratings yet
TroubleshootingHP Laser 103 108 and Laser MFP 131-139
114 pages
TOPLINE-TI 1 0 EBAL en
No ratings yet
TOPLINE-TI 1 0 EBAL en
67 pages
Data Analyst 3
No ratings yet
Data Analyst 3
5 pages
Dinesh Katla AWS Backend Data Engineer Updated
No ratings yet
Dinesh Katla AWS Backend Data Engineer Updated
10 pages
Manideep Resume IXL
No ratings yet
Manideep Resume IXL
9 pages
DataEngineer Shreya Hadoop
No ratings yet
DataEngineer Shreya Hadoop
9 pages
DataEngineer Shreya AWS
No ratings yet
DataEngineer Shreya AWS
9 pages
Bharath Sai K DataEngineer
No ratings yet
Bharath Sai K DataEngineer
6 pages
Rajesh DataEngineer
No ratings yet
Rajesh DataEngineer
7 pages
DP-420 Designing and Implementing Cloud-Native Applications Using Microsoft Azure Cosmos DB Certification Exam Guide
From Everand
DP-420 Designing and Implementing Cloud-Native Applications Using Microsoft Azure Cosmos DB Certification Exam Guide
Anand Vemula
No ratings yet
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Dice Resume CV Karthik S

Uploaded by

Dice Resume CV Karthik S

Uploaded by

Karthik

Programming Python, PySpark, Spark with Scala, JavaScript, Shell Scripting

Big Data Platforms Hortonworks, Cloudera

Operating Systems Linux, Windows, UNIX

Development Methods Agile/Scrum, Waterfall

IDE’s PyCharm, IntelliJ, Ambari, Jupiter Notebook

Data Visualization Tableau, Power BI, Cognos, DMS

Client: TransUnion, IL, Chicago. Oct 2016 - Dec 2019

Company: Cognizant Technology Solutions June 2014 - August 2016

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.