0% found this document useful (0 votes)

329 views3 pages

Data Lake Development in Clinical Analytics

Keshav Balivada has over 4 years of experience working with big data technologies like Apache NiFi, Hadoop, HDFS, Hive, Impala, Sqoop, and Spark SQL. He is currently a Senior Associate at Syneos Health working on a project to create a centralized clinical study data lake. Previous experience includes projects with Ernst & Young to build data lakes for banking and pharmaceutical clients. Technologies used include Python, SQL, Hive, Impala, MongoDB, SAS, and Spotfire.

Uploaded by

Joel Cerqueira Ponte

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

329 views3 pages

Data Lake Development in Clinical Analytics

Uploaded by

Joel Cerqueira Ponte

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

KESHAV BALIVADA

Email: keshavbalivada@[Link]
Contact No.: +91-8500360567
Work Experience: 4 years

PROFESSIONAL SUMMARY
 Currently working as Senior Associate, IT in SYNEOS HEALTH (Jan 2019 to Present) in BT
Application Engineering (Big data Analytics) team.
 2+ years of experience as Senior Analyst in ERNST&YOUNG, GDS Bangalore (Sept 2016 to Dec
2018).
 Good exposure in Python, Big data platform for banking domain.
 Hands on experience with Apache NiFi, Hadoop, HDFS, HUE, Hive, Impala, Sqoop,
Beeline, Oozie, Spark SQL and UNIX shell scripting.
 Hands on experience in writing HiveQL queries to process data for analysis.

EDUCATION

Bachelors of Technology from M.V.G.R College of Engineering in the stream of Mechanical Engineering.

TECHNICAL SKILLS & TOOLS

Languages : Good exposure on Python, SQL

Operating Systems : Windows XP/Vista, Linux
Visualizations : Spotfire, PowerBI
Database : SqlServer, MySQL, MongoDB
Hadoop Ecosystems : Hadoop,HUE, Hive,Impala, Sqoop, Oozie.
ETL : Apache Nifi, Alteryx.

PROJECT EXPERIENCE

Project#1:

Project Title : Sponsor Integration

Distribution : HortonWorks

Company : Syneos Health

Duration : Jan 2019 to Present

Creating a centralized data repository (Data Lake) for clinical study data and transform the data as per
sponsor requirement using Hive and Apache Nifi.
 Involved in end to end deployment of data lake in both test and production environment
 Design and develop layers of Data Lake.
 Transforming the data based on STM using Hive and creating tables.
 Creating and scheduling Apache NiFi workflows to send study data to sponsor on weekly basis.
 Analyzed large data sets by running Hive queries
 Involved in unit testing
 Handling the development for each sponsor study data
 Gained knowledge in Pharma and clinical domain.

Technology used: Hadoop, Apache Hive, Python, Apache NiFi, Groovy scripting.

Project #2:

Project Title : PNC Bank, US

Distribution : Cloudera

Company : Ernst & Young, Bangalore

Duration : 1 year 2 months

Creating a centralized data repository (Data Lake) for retail bank data and generate monthly reports
through SAS enterprise.
 Involved in analyzing the system and business with clients.
 Design and develop layers of Data Lake.
 Creating the HIVE external tables on top of the loaded data from HDFS.
 Involved in executing the hive scripts in Impala
 Analyzed large data sets by running Hive queries
 Involved in unit testing
 Handling the development for credit card area.
 Migrating existing SAS scripts to python.

Technology used: SAS, Cloudera Platform, Hadoop, Sqoop, Apache Hive, Impala, Python, Pyspark
Project #3:

Project Title : CITIBANK, US

Software/Tools/Technology Used : Python, JavaScript, MongoDB

Worked as a single resource in creating interactive Voice analytics Dashboards, Worked on text analytics
using python NLP (NLTK and Frame Net).
 Created a working prototype for highlighting text in a pdf with a dictionary of keywords using
python, NLTK
 Hosted the dashboards using python flask
 Worked directly with Clients.
 Ensured deliverables are on time.

Project #4:

Project Title : Chemours, US

Software/Tools/Technology Used : MS SQL Server 2012, SAP Database

 Understood and developed the high end business logic
 Worked as a single resource in creating SQL scripts for User to role mapping.
 Created visualization dashboards using Spotfire

Deep Learning POC:

 Currently upskilling on Deep learning out of my interest.

 Developing various POC’s using deep learning algorithms on Text extraction , OCR and Speech
to text converter.
Technology used: Python using fastai , tensorflow, keras, currently upskilling on pytorch

Hackathon In Ernst&Young:

Performed Data Quality checks for UK based real time project.

Technology used: Python with Machine Learning Algorithms

Extra-Curricular Activities:

 Allocate resources to the projects in pipeline.

 Create Bench reports of resources on weekly basis.
 Successfully looking after the resource management for FSO and generate weekly & monthly
reports to the US team.
 Handled a team of five across locations within India.

Achivements:
 Won 4th position in Hackathon out of 27 teams. Topic: Creating Data quality checks using
machine learning and deep learning python.
 Won Extra Miler award with 10000 INR.
 Won Spot Award with 2000 INR.
 Won Reward & Recognition award for all-rounder in learning, practice management and
knowledge transfer.
 Won Onsite Recognition award with 25000 INR for being the best resource in the development of
Data Lake for a Retail Bank Project.

Kiran Reddy Resume
No ratings yet
Kiran Reddy Resume
7 pages
Senior Data Engineer Resume
No ratings yet
Senior Data Engineer Resume
4 pages
Bhavana Raghupatruni
No ratings yet
Bhavana Raghupatruni
3 pages
Data Warehouse ETL Consultant Profile
No ratings yet
Data Warehouse ETL Consultant Profile
10 pages
Naveen Resume
No ratings yet
Naveen Resume
4 pages
Vishal Mittal CV
No ratings yet
Vishal Mittal CV
3 pages
Resume
No ratings yet
Resume
5 pages
PVK DE v1
No ratings yet
PVK DE v1
3 pages
ETL2
No ratings yet
ETL2
10 pages
SivaramChandran CV
No ratings yet
SivaramChandran CV
2 pages
Anil Korrapati Spark
No ratings yet
Anil Korrapati Spark
4 pages
BIGDATA DataEngineer Resume
No ratings yet
BIGDATA DataEngineer Resume
3 pages
Krishna Balam: Data Engineering Expert
No ratings yet
Krishna Balam: Data Engineering Expert
4 pages
Aslam: Data Scientist
No ratings yet
Aslam: Data Scientist
8 pages
ETL3
No ratings yet
ETL3
10 pages
Manoj Kumar Arram Resume PDF
No ratings yet
Manoj Kumar Arram Resume PDF
3 pages
Ab Initio to Talend Migration Overview
No ratings yet
Ab Initio to Talend Migration Overview
5 pages
Swapna
No ratings yet
Swapna
4 pages
Senior Data Analyst & Modeler Resume
No ratings yet
Senior Data Analyst & Modeler Resume
5 pages
Data Scientist - Docx .2
No ratings yet
Data Scientist - Docx .2
10 pages
Data Warehousing Expertise Overview
No ratings yet
Data Warehousing Expertise Overview
3 pages
DS Sample 4
No ratings yet
DS Sample 4
2 pages
Kavitha Parthasarathy: Data Science Expert
No ratings yet
Kavitha Parthasarathy: Data Science Expert
6 pages
Senior Support Engineer Resume
No ratings yet
Senior Support Engineer Resume
2 pages
DOB: 10 April 1995 + 91 9999732987
No ratings yet
DOB: 10 April 1995 + 91 9999732987
2 pages
Vinod
No ratings yet
Vinod
4 pages
Naukri CheteshkumarBhagat (4y 5m)
No ratings yet
Naukri CheteshkumarBhagat (4y 5m)
3 pages
Vishal
No ratings yet
Vishal
4 pages
Nishant Faye CV WB Format
No ratings yet
Nishant Faye CV WB Format
5 pages
Senior Data Engineering Manager Resume
No ratings yet
Senior Data Engineering Manager Resume
5 pages
Student Sample: Studentsample@buffalo - Edu
No ratings yet
Student Sample: Studentsample@buffalo - Edu
1 page
Data Engineering with Scala and Spark
No ratings yet
Data Engineering with Scala and Spark
10 pages
Data Scientist Resume - 14+ Years Experience
No ratings yet
Data Scientist Resume - 14+ Years Experience
4 pages
Data Warehouse Engineer Resume
No ratings yet
Data Warehouse Engineer Resume
2 pages
Srikanth Gottimukkula Professional Summary
No ratings yet
Srikanth Gottimukkula Professional Summary
3 pages
Data Engineer Expertise Overview
No ratings yet
Data Engineer Expertise Overview
3 pages
Python Developer: Django & MongoDB Expert
No ratings yet
Python Developer: Django & MongoDB Expert
3 pages
Azure ETL and Data Engineering Expert
No ratings yet
Azure ETL and Data Engineering Expert
6 pages
Anurag Kumar - 11.5 Yrs - Noida - Data APM
No ratings yet
Anurag Kumar - 11.5 Yrs - Noida - Data APM
5 pages
Nihar Meher Resume
No ratings yet
Nihar Meher Resume
3 pages
Rakesh Prasad: BI & Analytics Resume
No ratings yet
Rakesh Prasad: BI & Analytics Resume
6 pages
Data Warehousing & ETL Specialist Profile
No ratings yet
Data Warehousing & ETL Specialist Profile
2 pages
Mohit Chatterjee
No ratings yet
Mohit Chatterjee
2 pages
Resume Oracle PLSQL ETL BI Informatica PC IICS 14yrs Infosys MCA From NIT VenkatB CV v1 0
No ratings yet
Resume Oracle PLSQL ETL BI Informatica PC IICS 14yrs Infosys MCA From NIT VenkatB CV v1 0
10 pages
Aditya Shankar Resume 5
No ratings yet
Aditya Shankar Resume 5
5 pages
Mainframe Developer Resume - Abhijit Datta
No ratings yet
Mainframe Developer Resume - Abhijit Datta
5 pages
Om Prakash Resume
No ratings yet
Om Prakash Resume
4 pages
Big Data Consultant Profile: Srikanth Sampathkumar
No ratings yet
Big Data Consultant Profile: Srikanth Sampathkumar
7 pages
Amit CV-2
No ratings yet
Amit CV-2
1 page
CV PMP BigData BirendraKumarSingh
No ratings yet
CV PMP BigData BirendraKumarSingh
4 pages
Mukesh Kharga
No ratings yet
Mukesh Kharga
4 pages
CV Limit 5mb 6-26-24 Extcopyr
No ratings yet
CV Limit 5mb 6-26-24 Extcopyr
7 pages
Naukri PALLAVIVANGA (2y 4m)
No ratings yet
Naukri PALLAVIVANGA (2y 4m)
3 pages
CV Sumit Saxena 022024
No ratings yet
CV Sumit Saxena 022024
3 pages
Sonit Kumar: Data Science Profile
No ratings yet
Sonit Kumar: Data Science Profile
3 pages
Jayasree Yedlapally: Data Architecture Engineering - Senior
No ratings yet
Jayasree Yedlapally: Data Architecture Engineering - Senior
5 pages
Resume Nowsath SEP 2024
No ratings yet
Resume Nowsath SEP 2024
2 pages
Chapter 7
No ratings yet
Chapter 7
17 pages
Xcede Salary Guide
No ratings yet
Xcede Salary Guide
10 pages
Advanced Analytics Lead Role at Novartis
No ratings yet
Advanced Analytics Lead Role at Novartis
3 pages
ALTEN Presentation
No ratings yet
ALTEN Presentation
6 pages
Lpic1 Map PDF
No ratings yet
Lpic1 Map PDF
7 pages
Ambulance Booking System Report Expanded
No ratings yet
Ambulance Booking System Report Expanded
6 pages
Client Side Vs Server Side
No ratings yet
Client Side Vs Server Side
7 pages
UNIT-4B-Logic Based Testing
100% (1)
UNIT-4B-Logic Based Testing
38 pages
Laboratory Exercises #2
No ratings yet
Laboratory Exercises #2
3 pages
Bzqfcczt482 at Gmail Dot Com
No ratings yet
Bzqfcczt482 at Gmail Dot Com
2 pages
1743 Digital Watch Manual
No ratings yet
1743 Digital Watch Manual
7 pages
Data Breach 1
No ratings yet
Data Breach 1
6 pages
Linux QMI SDK Application Developers Guide 1.23
No ratings yet
Linux QMI SDK Application Developers Guide 1.23
55 pages
Astrology Software Evolution
100% (1)
Astrology Software Evolution
3 pages
Computer Languages Quiz - Quizizz
No ratings yet
Computer Languages Quiz - Quizizz
8 pages
(DBMS)
No ratings yet
(DBMS)
9 pages
OOAD Quiz 1
No ratings yet
OOAD Quiz 1
4 pages
Stonesoft IPsec VPN Client 5.4.3
No ratings yet
Stonesoft IPsec VPN Client 5.4.3
4 pages
IBM AIX Enhancements: and Modernization
No ratings yet
IBM AIX Enhancements: and Modernization
188 pages
A Description: Verilog-AMS Interface
No ratings yet
A Description: Verilog-AMS Interface
28 pages
Dr.Web Anti-Virus Version 12.9.4 Details
No ratings yet
Dr.Web Anti-Virus Version 12.9.4 Details
19 pages
Introduction to Domain-Driven Design
No ratings yet
Introduction to Domain-Driven Design
15 pages
Microsoft 365 vs Office 365 E3 Comparison
No ratings yet
Microsoft 365 vs Office 365 E3 Comparison
1 page
Online Insurance Management System
64% (22)
Online Insurance Management System
68 pages
What Is User Account?: Types of User Accounts
No ratings yet
What Is User Account?: Types of User Accounts
6 pages
User Manual STM32F103 PDF
0% (1)
User Manual STM32F103 PDF
655 pages
Nuclear Reactor Alarm System Analysis
No ratings yet
Nuclear Reactor Alarm System Analysis
7 pages
EMX 5.0: Easy, Fast, Flexible, Robust
No ratings yet
EMX 5.0: Easy, Fast, Flexible, Robust
18 pages
在线powerpoint演示文稿
100% (1)
在线powerpoint演示文稿
8 pages
Virtual Private Cloud
100% (1)
Virtual Private Cloud
17 pages
Lab Guide
No ratings yet
Lab Guide
32 pages
Class VIII Computer Quiz
No ratings yet
Class VIII Computer Quiz
2 pages
Stock Management
No ratings yet
Stock Management
12 pages
UAlberta - Engineering Course List
No ratings yet
UAlberta - Engineering Course List
26 pages

Data Lake Development in Clinical Analytics

Uploaded by

Data Lake Development in Clinical Analytics

Uploaded by

KESHAV BALIVADA

TECHNICAL SKILLS & TOOLS

Languages : Good exposure on Python, SQL

Project Title : Sponsor Integration

Company : Syneos Health

Duration : Jan 2019 to Present

Project Title : PNC Bank, US

Company : Ernst & Young, Bangalore

Duration : 1 year 2 months

Project Title : CITIBANK, US

Software/Tools/Technology Used : Python, JavaScript, MongoDB

Project Title : Chemours, US

Software/Tools/Technology Used : MS SQL Server 2012, SAP Database

Deep Learning POC:

 Currently upskilling on Deep learning out of my interest.

Performed Data Quality checks for UK based real time project.

 Allocate resources to the projects in pipeline.

You might also like