Data Team
Data Analytics Case Study: Part 2
6/26/2018 1
Preamble
You were given a dataset that describes selected customer performance on
a subset of traveloka products (The metadata for the dataset is explained in
the next slides)
Your objective:
• Overall objective of analysis is to create recommendation and
strategy to reduce customer churn and summarize your analysis in a
presentation.
• Customer churn = customer who haven’t bought in the last one year
from last_updated_date in the dataset
• last_updated_date = ‘2019-02-01’
2
Metadata for the dataset
You are given a dataset with the following fields
• Profile_id = hashed identity of a customer. One profile id represents one customer
[integer 64]
• Primary_sales_product_type = the product identifier [{‘FLIGHT’, ‘HOTEL’, ‘TRAIN’, ‘TRIP’}]
• first_booking_time = is the first time the customer booked a product in
primary_sales_product [unix timestamp in milliseconds]
• First_issued_time = is the first time the customer issued a product in
primary_sales_product [unix timestamp in milliseconds]
• Last_booking_time = is the last time the customer booked a product in
primary_sales_product [unix timestamp in milliseconds]
• Last_issued_time = is the first time the customer issued a product in
primary_sales_product [unix timestamp in milliseconds]
• Avg_booking_invoice_amount_idr = avg invoice amount in submitted booking [in IDR]
• Avg_issued_invoice_amount_idr = avg invoice amount in issuance [in IDR]
• Count_booking = total count of bookings of that particular product (from the day they
were acquired until now) [integer]
• Count_issued = total count of issuance of that particular product (from the day they were
acquired until now) [integer]
Hints
• You are free to use any resources online that may help you to do your
work. Don’t forget to list your citation.
• We encourage you to use any language as per your comfort for the
analysis. However we do recommend using a notebook for Python
(collab or any other notebook). For R, please use R studio
• Preferably use + GSlides/Powerpoint to summarise the finding and
recommendations
• You are required to submit the codes and presentation as a part of the
submission documents.
• Feel free to reach out to the recruiter for any questions / clarifications as
needed
Dataset
Data Set Link: link
Please reach out to your recruiter from Traveloka for any issue/difficulties with the
data access