0% found this document useful (0 votes)
138 views5 pages

Data Analytics Case Interview - Part 2

The document provides metadata for a customer dataset and instructions for an analysis case study. The objective is to analyze the dataset to create recommendations to reduce customer churn. The dataset contains fields with customer profile and transaction information. It describes customers who have not made a purchase in the last year. The analysis should be summarized in a presentation using Python, R, or other tools. The codes and presentation must be submitted.

Uploaded by

Bimo Pratama
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
138 views5 pages

Data Analytics Case Interview - Part 2

The document provides metadata for a customer dataset and instructions for an analysis case study. The objective is to analyze the dataset to create recommendations to reduce customer churn. The dataset contains fields with customer profile and transaction information. It describes customers who have not made a purchase in the last year. The analysis should be summarized in a presentation using Python, R, or other tools. The codes and presentation must be submitted.

Uploaded by

Bimo Pratama
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Data Team

Data Analytics Case Study: Part 2

6/26/2018 1
Preamble

You were given a dataset that describes selected customer performance on


a subset of traveloka products (The metadata for the dataset is explained in
the next slides)

Your objective:
• Overall objective of analysis is to create recommendation and
strategy to reduce customer churn and summarize your analysis in a
presentation.
• Customer churn = customer who haven’t bought in the last one year
from last_updated_date in the dataset
• last_updated_date = ‘2019-02-01’

2
Metadata for the dataset

You are given a dataset with the following fields


• Profile_id = hashed identity of a customer. One profile id represents one customer
[integer 64]
• Primary_sales_product_type = the product identifier [{‘FLIGHT’, ‘HOTEL’, ‘TRAIN’, ‘TRIP’}]
• first_booking_time = is the first time the customer booked a product in
primary_sales_product [unix timestamp in milliseconds]
• First_issued_time = is the first time the customer issued a product in
primary_sales_product [unix timestamp in milliseconds]
• Last_booking_time = is the last time the customer booked a product in
primary_sales_product [unix timestamp in milliseconds]
• Last_issued_time = is the first time the customer issued a product in
primary_sales_product [unix timestamp in milliseconds]
• Avg_booking_invoice_amount_idr = avg invoice amount in submitted booking [in IDR]
• Avg_issued_invoice_amount_idr = avg invoice amount in issuance [in IDR]
• Count_booking = total count of bookings of that particular product (from the day they
were acquired until now) [integer]
• Count_issued = total count of issuance of that particular product (from the day they were
acquired until now) [integer]
Hints

• You are free to use any resources online that may help you to do your
work. Don’t forget to list your citation.
• We encourage you to use any language as per your comfort for the
analysis. However we do recommend using a notebook for Python
(collab or any other notebook). For R, please use R studio
• Preferably use + GSlides/Powerpoint to summarise the finding and
recommendations
• You are required to submit the codes and presentation as a part of the
submission documents.
• Feel free to reach out to the recruiter for any questions / clarifications as
needed
Dataset

Data Set Link: link

Please reach out to your recruiter from Traveloka for any issue/difficulties with the
data access

You might also like