0% found this document useful (0 votes)

58 views4 pages

Anil Korrapati Spark

- The document is a resume for Anil Korrapati that outlines his professional experience in IT with a focus on big data technologies like Hadoop, Spark, Hive and databases. - It details his 9+ years of experience working with big data systems using technologies such as Hadoop, Hive, Pig, Sqoop, HBase, Cassandra, Spark, Kafka and Flink. - It also lists skills in SQL, data warehousing, ETL processes, cloud migration and software testing as well as experience with methodologies like Agile and Waterfall.

Uploaded by

ravikumar lanka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views4 pages

Anil Korrapati Spark

Uploaded by

ravikumar lanka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Anil Korrapati

[email protected]
Frisco, TX 75035 Phone: (832)-217-5958
USA

SUMMARY

● 9+ years of professional IT industry experience encompassing a wide range of

skill set in Big Data technologies and Java/J2EE technologies.
● Experience in working with Big Data Technologies on systems which
comprises massive amounts of data running in highly distributed mode in
Cloudera, Hortonworks Hadoop distributions.
● Hands on experience in using Hadoop ecosystem components like Hadoop,
Hive, Pig, Sqoop, HBase, Cassandra, Spark, Spark Streaming, Spark SQL,
Oozie, ZooKeeper, Kafka, Flink, MapReduce and Yarn.
● Strong Knowledge on architecture and components of Spark, and efficient in
working with Spark Core, SparkSQL, Spark streaming.
● Implemented Spark Streaming jobs by developing RDD's (Resilient
Distributed Datasets) and used pyspark and spark-shell accordingly.
● Experience in configuring Spark Streaming to receive real time data from the
Apache Kafka and store the stream data to HDFS using Scala.
● Accomplished complex HiveQL queries for required data extraction from Hive
tables and written Hive UDF's as required.
● Pleasant experience of Partitions, bucketing concepts in Hive and designed
both Managed and External tables in Hive to optimize performance.
● Excellent understanding and knowledge of job workflow scheduling and
locking tools/services like Oozie and Zookeeper.
● Knowledge of ETL methods for data extraction, transformation and loading in
corporate-wide ETL Solutions and Data Warehouse tools for reporting and
data analysis.
● Experience in importing and exporting the data using Sqoop from HDFS to
Relational Database systems and vice-versa.
● Good Knowledge in UNIX Shell Scripting for automating deployments and
other routine tasks.
● Experience in relational databases like Oracle, Teradata, MySQL and SQL
Server.
● Used various Project Management services like JIRA for tracking issues, bugs
related to code and GitHub for various code reviews and Worked on various
version control tools like GIT, SVN.
● Extensive knowledge in Microsoft SQL Server Management Studio. Have
experience in setup, configuration, analysis, design, development, and
deployment of MS SQL Server suite of products with Business Intelligence in
SQL Server Reporting Services 2005/2008, SQL Server Analysis Services
2005/2008 and SQL Server Integration Services.
● Worked for the projects in the areas of Hardware and Software testing in
Client/Server Applications, Web Services,Functional testing, Performance
Testing, GUI Testing using Monkey tool, Hospital and Healthcare
Administration.
● Gained a great deal of knowledge in various domains like Wi-Fi, Bluetooth,
Peer to Peer Communications, Video Processing and Financial services and
Payroll management, Medical Billing and Revenue Management Services.
● Experienced in working in SDLC, Agile and Waterfall Methodologies.

Ahead INC Mar 2022 - Present

Richardson, TX

Responsibilities

● Performed SQL performance analysis on-perm

● Worked on Migvisor to analyze SQL queries to Postgresql and validated quality
after conversion
● Used Sterm tool to Migrate data from Onprem to GCP
● Worked on DVT to perform data testing manual and automation
● Used spark connecter to connect snowflake and extract data to save it on S3
bucket.
● Used snowflake JDBC connector to create temp tables using Java and Scala.
● Analyzed the existing Teradata data extraction project and converted into
Snowflake data extraction project. Technologies involved Java and Scala.
● Created a shell script to load the historical data which resides in S3 buckets to
snowflake.
● Worked on loading the data from AWS S3 to Snowflake table.
● Worked on SQL coding which inserts data from one snowflake table to
another.
● Used Snow SQL to load the data to snowflake.
● Involved in Azure Data migration from On-prem to Azure Cloud.
● Extract Transform and Load data from Sources Systems to Azure Data
Storage services using a combination of Azure Data Factory, T-SQL, Spark
SQL and U-SQL Azure Data Lake Analytics. Data Ingestion to one or more
Azure Services - (Azure Data Lake, Azure Storage, Azure SQL, Azure DW) and
processing the data in In Azure Databricks.
● Created Pipelines in ADF using Linked Services/Datasets/Pipeline/ to Extract,
Transform and load data from different sources like Azure SQL, Blob storage,
Azure SQL Data warehouse, write-back tool and backwards.
● Converted ORC data into parquet format using Data Bricks and ADF.
● Converted On-prem Spark code into Data Bricks workbook.

AT&T Jan 2019 - Mar 2022

West Plano Parkway, TX

Responsibilities

● Performed analysis and presented results using SQL, SSIS, MS Access, Excel,
and Visual Basic scripts.
● Data Ingestion implemented using SPARK, loading data from various CSV,
parquet, XML files.
● Data cleansing, transformation tasks are handled using SPARK using SCALA
and HIVE.
● Data Consolidation was implemented using SPARK, scala to generate data in
the required formats by applying various UDF for data repair, massaging data,
cleansing data, data filtering and store back to HDFS.
● Responsible for design development of Spark scala Scripts based on
Functional Specifications.
● Exploring with Spark improving the Performance and Optimization of the
existing scala scripts.
● Used Spark Data Frames API over Cloudera platform to perform analytics on
Hive data.
● Used Spark Dataframe Operations to perform required Validations in the data.
● Good understanding and knowledge of NoSQL databases like MongoDB, Hbase
and Cassandra.
● Involved in converting Hive/SQL queries into Spark data frame using Scala.

● Responsible for Job management using jenkins.

● Responsible for Performance Tuning of Spark Applications for setting right
Batch Interval time, correct level of Parallelism and Memory tuning.
● Responsible for handling large datasets using Partitions, Spark in Memory
capabilities, Broadcasts in Spark, Effective & efficient Joins, Transformations
and others during the Ingestion process itself.
● Importing and exporting data into HDFS and HIVE using Sqoop.
● Involved in creating Hive Tables, loading with data and writing Hive queries
which will invoke and run Map Reduce jobs in the backend.
● Worked on MongoDB have created collections to load large sets of semi
structured data coming from various sources.
● Worked with different file formats such as Text, Sequence files, Avro, ORC and
Parquet.
● Responsible for managing data coming from different sources.
● Responsible for loading and transforming large sets of structured, semi
structured and unstructured data.
● Analyzed large amounts of data sets to determine optimal way to aggregate
and report on it
● Implemented Flink Join on streaming datastreams and also checkpointing on
streaming services

PhyCare Solutions July 2016 – Jan 2019

South Setauket, NY

Responsibilities

● Performed analysis and presented results using SQL, SSIS, MS Access, Excel,
and Visual Basic scripts.
● Imported, exported and manipulated large data sets in multi-million-row
databases under tight deadlines.
● Wrote and automated tools and scripts to increase departmental efficiency
and automate repeatable tasks.
● Manipulated files and their associated data for rapid delivery to clients or
loading onto internal databases.
● Collaborated with project managers, legal counsel, and other team members
to gather data for projects
● Reporting the performance results with analysis and recommendations.

Uurmi Solutions (Private) Limited July 2013 – Jan 2016

Hyderabad, India

Responsibilities
● Associated in designing, developing, analysis and testing complete product
functionality and features changes.
● Participated in design review meetings to understand technical and functional
systems overview.
● Understanding functional requirements, feature changes and creating a
complete test plan to cover all possible scenarios and test strategy
documents.
● Worked under waterfall methodology.

● Involved in requirement collection and analysis.

● Participated in test reviews to ensure the requirements coverage.
● Created test plan and test scenarios.
● Reviewed test plans with onsite team and client.
● Defect reporting using Bugzilla and status reports.
● Designed test cases, performed peer reviews and test execution.
● Extensively involved in regression testing.
● Worked in the areas of Hardware and Software testing in Client/Server
Applications, Web Services,Functional testing, Performance Testing, GUI
Testing using Monkey tool, Hospital and Healthcare Administration.
● Participated in daily, weekly status reports and defect meetings with clients.
● Prepared test estimates, test plans, and test strategy documents.
● Provided estimates and collection of metrics.
● Test case design and execution.
● Prepared the traceability matrix.
● Reviewed requirement specification.
● Involved in testing I/O interface RS232, RS234, DVI interface, USB, Ethernet
1Gig.
● Involved in capturing in different resolutions (720P, 1280P, 1200) and storing
them to a device and retrieving them and playback on BARCO displays.

Srikanth Gottimukkula Professional Summary
No ratings yet
Srikanth Gottimukkula Professional Summary
3 pages
IT & Big Data Professional Profile
No ratings yet
IT & Big Data Professional Profile
7 pages
Farhan Data Engineer
No ratings yet
Farhan Data Engineer
9 pages
Dice Resume CV SN
No ratings yet
Dice Resume CV SN
5 pages
Sindhuja Macha BIDeveloper
No ratings yet
Sindhuja Macha BIDeveloper
3 pages
Big Data & Cloud Engineering Expert
No ratings yet
Big Data & Cloud Engineering Expert
4 pages
Data Scientist Profile Summary
No ratings yet
Data Scientist Profile Summary
8 pages
Mohit BigData 5yr
100% (1)
Mohit BigData 5yr
3 pages
Big Data Developer Resume - Kirthiga M
No ratings yet
Big Data Developer Resume - Kirthiga M
3 pages
Specialist Data Engineering Expertise
No ratings yet
Specialist Data Engineering Expertise
3 pages
Senior Data Engineer Profile Summary
No ratings yet
Senior Data Engineer Profile Summary
5 pages
Big Data Consultant Profile: Srikanth Sampathkumar
No ratings yet
Big Data Consultant Profile: Srikanth Sampathkumar
7 pages
Vipul Sinha BigData-Hadoop Dev
100% (1)
Vipul Sinha BigData-Hadoop Dev
8 pages
Resume Madhura Sarkar 2024-1
No ratings yet
Resume Madhura Sarkar 2024-1
3 pages
Tejaswai - Kondaveeti - Data Engineer
No ratings yet
Tejaswai - Kondaveeti - Data Engineer
2 pages
Aditya Paruchuri
No ratings yet
Aditya Paruchuri
7 pages
Ponnam R Mobile: 248 987 6154 Gmail ID:: Professional Summary
No ratings yet
Ponnam R Mobile: 248 987 6154 Gmail ID:: Professional Summary
6 pages
Data Engineering Expertise Overview
No ratings yet
Data Engineering Expertise Overview
8 pages
RutujaDighe DataEnginneer Resume
No ratings yet
RutujaDighe DataEnginneer Resume
5 pages
Big Data Developer Resume Overview
No ratings yet
Big Data Developer Resume Overview
8 pages
DE Sample Resume
No ratings yet
DE Sample Resume
6 pages
Ajay Kadiyala Resume 2023 PDF
No ratings yet
Ajay Kadiyala Resume 2023 PDF
6 pages
Bharath Sai K DataEngineer
No ratings yet
Bharath Sai K DataEngineer
6 pages
Senior Data Engineer with Big Data Expertise
No ratings yet
Senior Data Engineer with Big Data Expertise
10 pages
Data Engineer Resume: Sailaja Reddy
No ratings yet
Data Engineer Resume: Sailaja Reddy
6 pages
Abdul Kareem Syed
No ratings yet
Abdul Kareem Syed
5 pages
Senior Data Analyst & Modeler Resume
No ratings yet
Senior Data Analyst & Modeler Resume
5 pages
Hanumantha Rao Resume-1 (4391)
No ratings yet
Hanumantha Rao Resume-1 (4391)
4 pages
Varsha Vadapalli
No ratings yet
Varsha Vadapalli
4 pages
Vinod
No ratings yet
Vinod
4 pages
Dice Resume CV Karthik S
No ratings yet
Dice Resume CV Karthik S
4 pages
Resume Chiranjeevi
No ratings yet
Resume Chiranjeevi
3 pages
Data Analyst 5
No ratings yet
Data Analyst 5
6 pages
Big Data Developer CV: Skills & Experience
No ratings yet
Big Data Developer CV: Skills & Experience
7 pages
Data Engineer with Big Data Expertise
No ratings yet
Data Engineer with Big Data Expertise
8 pages
Data Warehouse ETL Consultant Profile
No ratings yet
Data Warehouse ETL Consultant Profile
10 pages
Manoj Kumar
No ratings yet
Manoj Kumar
3 pages
Ravali Data Engineer GCP
No ratings yet
Ravali Data Engineer GCP
8 pages
PVK DE v1
No ratings yet
PVK DE v1
3 pages
Resume 1
No ratings yet
Resume 1
7 pages
Siva Data Engineer
No ratings yet
Siva Data Engineer
10 pages
Nagaraju Bachu
No ratings yet
Nagaraju Bachu
6 pages
Chiranjeevi Narendra Oleti: Call To: E-Mail: Summary
No ratings yet
Chiranjeevi Narendra Oleti: Call To: E-Mail: Summary
3 pages
Vishal Mittal CV
No ratings yet
Vishal Mittal CV
3 pages
Mohit ShivramwarCV
No ratings yet
Mohit ShivramwarCV
5 pages
Saikiran Data - Engineer Resume
No ratings yet
Saikiran Data - Engineer Resume
7 pages
Resume
No ratings yet
Resume
5 pages
Bharath DE
No ratings yet
Bharath DE
7 pages
Data Engineering Expertise Overview
No ratings yet
Data Engineering Expertise Overview
8 pages
Anisha ETL DataEngineer
No ratings yet
Anisha ETL DataEngineer
7 pages
Aravind - Senior Azure Data Engineer
No ratings yet
Aravind - Senior Azure Data Engineer
5 pages
Venkata V.J Data Engineer
No ratings yet
Venkata V.J Data Engineer
7 pages
Nikhil Kumar: Senior Big Data Engineer
No ratings yet
Nikhil Kumar: Senior Big Data Engineer
7 pages
Jimmy Lamba Resume PDF
No ratings yet
Jimmy Lamba Resume PDF
8 pages
Ajay Resume
No ratings yet
Ajay Resume
3 pages
Building Big Data Pipelines with Beam
No ratings yet
Building Big Data Pipelines with Beam
8 pages
VaramSantosh Hadoop Resume
No ratings yet
VaramSantosh Hadoop Resume
5 pages
Pruthvi GCP - Data Engineer +++++++
No ratings yet
Pruthvi GCP - Data Engineer +++++++
8 pages
Hruthik Reddy - Senior Data Engineer
No ratings yet
Hruthik Reddy - Senior Data Engineer
4 pages
PySpark Transformations Tutorial
100% (1)
PySpark Transformations Tutorial
58 pages
Practical Mlops Ebook
No ratings yet
Practical Mlops Ebook
61 pages
Spark RDD Actions & Transformations
No ratings yet
Spark RDD Actions & Transformations
25 pages
What Is Data Modelling
100% (1)
What Is Data Modelling
12 pages
SQL Subquery Tutorial with Examples
100% (1)
SQL Subquery Tutorial with Examples
57 pages
Introduction To Snowflake Warehouses
No ratings yet
Introduction To Snowflake Warehouses
40 pages
Spark Interview Prep Guide
No ratings yet
Spark Interview Prep Guide
14 pages
ADF Course Deck V2
No ratings yet
ADF Course Deck V2
216 pages
Azure Databricks Course Slide Deck V4
100% (5)
Azure Databricks Course Slide Deck V4
308 pages
PySpark+Slides v1
100% (1)
PySpark+Slides v1
458 pages
Review Questions 6
No ratings yet
Review Questions 6
2 pages
SAP BW on HANA: Advanced DSO Guide
No ratings yet
SAP BW on HANA: Advanced DSO Guide
8 pages
Mensajes y Códigos de SQL
No ratings yet
Mensajes y Códigos de SQL
290 pages
Task For BD03 23-24
No ratings yet
Task For BD03 23-24
4 pages
(Ebook) Sams Teach Yourself ASP - NET in 21 Days by Chris Payne ISBN 9780672324451, 0672324458 Download Full Chapters
No ratings yet
(Ebook) Sams Teach Yourself ASP - NET in 21 Days by Chris Payne ISBN 9780672324451, 0672324458 Download Full Chapters
114 pages
DBMS Unit5
No ratings yet
DBMS Unit5
24 pages
Worksheet Xii Ip
No ratings yet
Worksheet Xii Ip
24 pages
Database Data Models Overview
No ratings yet
Database Data Models Overview
9 pages
Sco 206 Database Systems 2
No ratings yet
Sco 206 Database Systems 2
2 pages
Qlik Cloud Data IntegrationTALEND Adile
No ratings yet
Qlik Cloud Data IntegrationTALEND Adile
37 pages
Livro Percona Pratical MySQL Performance Optimization
No ratings yet
Livro Percona Pratical MySQL Performance Optimization
46 pages
Exam 1
No ratings yet
Exam 1
6 pages
Enqueue and Dequeue Function Module
No ratings yet
Enqueue and Dequeue Function Module
16 pages
Mlii-102 em 2024-25 KP
No ratings yet
Mlii-102 em 2024-25 KP
13 pages
SAP Profile Management Guide
No ratings yet
SAP Profile Management Guide
2 pages
Database Management Systems Lab Guide
No ratings yet
Database Management Systems Lab Guide
22 pages
Fix SKF @ptitude Analyst DB Error
No ratings yet
Fix SKF @ptitude Analyst DB Error
3 pages
Cognizant Tekstac Training Schedule
No ratings yet
Cognizant Tekstac Training Schedule
3 pages
Santander Interview
No ratings yet
Santander Interview
10 pages
Suggestion - HS Exam' 2025 (XII, COMS) - Unlocked
No ratings yet
Suggestion - HS Exam' 2025 (XII, COMS) - Unlocked
1 page
Explore The Azure OpenAI Schema - Training - Microsoft Learn 4
No ratings yet
Explore The Azure OpenAI Schema - Training - Microsoft Learn 4
3 pages
Database Management System 1 Oracle
No ratings yet
Database Management System 1 Oracle
9 pages
Large Scale and MultiStructured Databases
No ratings yet
Large Scale and MultiStructured Databases
223 pages
The Evolution of Database Management System
No ratings yet
The Evolution of Database Management System
2 pages
Oracle PL/SQL Lab Guide
0% (1)
Oracle PL/SQL Lab Guide
38 pages
Class XI Python Exam Prep
0% (1)
Class XI Python Exam Prep
6 pages
Ai System To Assist Legal Processes Using Natural Language Processing
No ratings yet
Ai System To Assist Legal Processes Using Natural Language Processing
20 pages
MD Aqueel Ahmad
No ratings yet
MD Aqueel Ahmad
4 pages
ABHAY
No ratings yet
ABHAY
25 pages
Database Management System Assessment
No ratings yet
Database Management System Assessment
1 page

Anil Korrapati Spark

Uploaded by

Anil Korrapati Spark

Uploaded by

Anil Korrapati

● 9+ years of professional IT industry experience encompassing a wide range of

Ahead INC Mar 2022 - Present

● Performed SQL performance analysis on-perm

AT&T Jan 2019 - Mar 2022

● Responsible for Job management using jenkins.

PhyCare Solutions July 2016 – Jan 2019

Uurmi Solutions (Private) Limited July 2013 – Jan 2016

● Involved in requirement collection and analysis.

You might also like