Skip to content

Conversation

@landreev
Copy link
Contributor

@landreev landreev commented Nov 25, 2024

What this PR does / why we need it:

This PR reconciles the 2 implementations of the same feature, mine and @stevenferey 's, hence 2 issue numbers in the PR name. It will only close one of them, #10982; the other issue needs more work on other features being added as part of it and it's going back into "on hold" for now.

Which issue(s) this PR closes:

Special notes for your reviewer:

Suggestions on how to test this:

Below is a real life example of an OAI-PMH archive subject to the feature added in this PR.

Create a harvesting client as follows:

harvest url: https://data.progedo.fr/oai
set: None
metadata format: oai_dc
archive type: Generic OAI archive

Most (or all?) of their records will be failing to import when testing with the develop branch prior to this PR. All (or most) should successfully import when testing this branch. As of writing this, having tested just now:

Screen Shot 2024-11-25 at 10 20 33 AM

since this is a real, active archive, their holdings are subject to ongoing change; so, your results may vary.

Does this PR introduce a user interface change? If mockups are available, please link/include them here:

Is there a release notes update needed for this change?:

Additional documentation:

Preview docs at https://dataverse-guide--11049.org.readthedocs.build/en/11049/api/native-api.html#create-a-harvesting-client

@coveralls
Copy link

coveralls commented Nov 25, 2024

Coverage Status

coverage: 22.486% (+0.04%) from 22.447%
when pulling 40fe665 on 10909-oai-identifiers-as-pids
into f95c1a0 on develop.

@github-actions

This comment has been minimized.

@github-actions

This comment has been minimized.

@github-actions

This comment has been minimized.

@landreev landreev added the Size: 10 A percentage of a sprint. 7 hours. label Nov 25, 2024
@landreev landreev added this to the 6.5 milestone Nov 25, 2024
@landreev landreev marked this pull request as ready for review November 25, 2024 15:23
@pdurbin pdurbin changed the title 10982 10909 Allow using OAI-PMH identifiers as persistent ids of harvested datasets Allow using OAI-PMH identifiers as persistent ids of harvested datasets Nov 25, 2024
@pdurbin pdurbin self-assigned this Nov 25, 2024
@pdurbin pdurbin changed the title Allow using OAI-PMH identifiers as persistent ids of harvested datasets 10982 10909 Allow using OAI-PMH identifiers as persistent ids of harvested datasets Nov 25, 2024
Copy link
Member

@pdurbin pdurbin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall, looks great. Here's some initial feedback.

@github-actions

This comment has been minimized.

1 similar comment
@github-actions

This comment has been minimized.

…added when resolving a conflict with a cherry-picked commit, which of course changes the checksum)
@github-actions

This comment has been minimized.

@landreev landreev assigned pdurbin and unassigned pdurbin and landreev Nov 25, 2024
Copy link
Member

@pdurbin pdurbin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I didn't run the code but it looks good (docs too) and the "how to test" instructions seem quite clear. Approved.

@pdurbin pdurbin removed their assignment Nov 25, 2024
@github-actions

This comment has been minimized.

@ofahimIQSS ofahimIQSS self-assigned this Nov 26, 2024
@github-actions
Copy link

📦 Pushed preview images as

ghcr.io/gdcc/dataverse:10909-oai-identifiers-as-pids
ghcr.io/gdcc/configbaker:10909-oai-identifiers-as-pids

🚢 See on GHCR. Use by referencing with full name as printed above, mind the registry name.

@ofahimIQSS
Copy link
Contributor

tested and passed in internal - 1670 Files were successfully harvested
Testing of 11049.docx

@ofahimIQSS ofahimIQSS merged commit 3c427c1 into develop Nov 27, 2024
9 checks passed
@ofahimIQSS ofahimIQSS deleted the 10909-oai-identifiers-as-pids branch November 27, 2024 15:08
@ofahimIQSS ofahimIQSS removed their assignment Nov 27, 2024
landreev added a commit that referenced this pull request May 30, 2025
…led the 2 implementations of the OAI-identifiers-as-global-ids feature, mine and Steven Ferey's. #11479
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Size: 10 A percentage of a sprint. 7 hours.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Feature Request: Request identifier support for OAI_DC harvesting

5 participants