0% found this document useful (0 votes)
28 views2 pages

Data Analytics Case Study 2023-24

The document outlines a data analytics case study involving a sample dataset of research papers published between 2023-2024, including metadata like DOI, authors, and publication details. The objective is to analyze the data for insights relevant to researchers, publishers, and the public, with examples of potential questions and analysis methods provided. Submissions are due by January 24, 2025, and can be made in .docx or .ipynb formats, with guidelines for submission outlined.

Uploaded by

hrithvikwork
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
28 views2 pages

Data Analytics Case Study 2023-24

The document outlines a data analytics case study involving a sample dataset of research papers published between 2023-2024, including metadata like DOI, authors, and publication details. The objective is to analyze the data for insights relevant to researchers, publishers, and the public, with examples of potential questions and analysis methods provided. Submissions are due by January 24, 2025, and can be made in .docx or .ipynb formats, with guidelines for submission outlined.

Uploaded by

hrithvikwork
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Data Analytics Case Study

About the Data

The data shared herewith is a sample snapshot of research papers published by


some publishers from 2023-2024 (2yrs). Each record is an individual research article
with its constituent metadata like DOI (persistent identifier link), authors, title, date of
publication, journal of publication etc. Some of the metadata fields like authors would
be nested to contain detailed information about the same. The structure of an object
(research article) is shared in the file “Individual Record Structure.txt”. In case of
missing meta-data then that particular object won’t have the field name

Your Objective

To analyse the data and find insight/patterns from the sample data which could be of
interest to a researcher, publisher, university (affiliation) or general public. Your
analysis and conclusions could be submitted as a word document(.docx) or python
notebook(. ipynb) with relevant chart/graphs that helped you uncover an insight

Some examples for your reference

1. https://scholarlykitchen.sspnet.org/2022/11/08/guest-post-publishing-fast-and-
slow-a-review-of-publishing-speed-in-the-last-decade/
2. https://www.linkedin.com/posts/mtclarke_where-are-all-the-papers-going-i-
wrote-activity-7237517444964085760-Eq-
h?utm_source=share&utm_medium=member_desktop
3. https://www.csescienceeditor.org/article/publishing-in-january-and-impact-on-
citations-does-the-data-support-the-strategy/

Some open questions for your reference

1. Which countries have the most collaboration while publishing research and
how has their collaboration effected over the years
2. Publishing trends of a journal/publisher with respect to a country
3. YoY (year on year) publishing trend of a particular journal with respect to a
subject area/concept

You are not limited by any of the above questions or examples and are free to
choose any other question/insight that you might want to uncover. You are also free
to use any additional open-source database to supplement your data analysis.
Brownie points if you could support your findings with other articles/news events
published on the internet that validate your findings

General Guidelines

Confidential
1. Data and this README file can be downloaded from
https://drive.google.com/drive/folders/1RqphzTFi-
bMnfJKHntnIdmT2mjuxDB5w?usp=sharing
2. Please upload your submissions (.docx or . ipynb) in this form by 2:00PM IST 24th
January 2025

3. You may submit multiple time within the submission window and only your latest
submission would be considered for evaluation
4. In case of multiple files in a submission, please zip and upload them. Do not
upload the input files again, only your submission files need to be uploaded

Confidential

You might also like