Skip to content

Figure out whether, or how to support the extended ISO 639-3 list of languages #8578

@tjouneau

Description

@tjouneau

After version 5.4, things have improved regarding language mapping problems.
Some codes are still not managed. In the cases encountered, frm (Medieval French) and fro (Old French).
Would it be possible to include all codes in the Dataverse source?

What steps does it take to reproduce the issue?
Try to harvest from https://repository.ortolang.fr/api/oai/?verb=ListRecords&set=producer:atilf&metadataPrefix=oai_dc
6 datasets are not harvested, 4 due to language mapping issues.

What happens?
Mapping errrors documented in the harvest log :
Exception processing getRecord(), oaiUrl=https://repository.ortolang.fr/api/oai, identifier=oai:ortolang.fr:0c2017f1-7c3b-473a-b75d-ad97b4e09bd0, edu.harvard.iq.dataverse.api.imports.ImportException, Failed to import harvested dataset: class edu.harvard.iq.dataverse.util.json.ControlledVocabularyException (Value 'fro' does not exist in type 'language')"
I'm attaching the server.log relevant extract and the harvest log.

harvest_ortolang3_2022-04-04T15-34-00.log
server.log

Which version of Dataverse are you using?
5.10

Any related open or closed issues to this bug report?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Feature: HarvestingGREI 3Search and BrowseNIH OTA DCGrant: The Harvard Dataverse repository: A generalist repository integrated with a Data CommonsNIH OTA: 1.4.14 | 1.4.1 | Resolve OAI-PMH harvesting issues | 5 prdOwnThis is an item synched from the product ...Size: 30A percentage of a sprint. 21 hours. (formerly size:33)Type: Buga defectpm.GREI-d-1.4.1NIH, yr1, aim4, task1: Resolve OAI-PMH harvesting issuespm.GREI-d-1.4.2NIH, yr1, aim4, task2: Create working group on packaging standardspm.GREI-d-2.4.1BNIH AIM:4 YR:2 TASK:1B | 2.4.1B | (started yr1) Resolve OAI-PMH harvesting issuespm.epic.nih_harvesting

    Type

    No type

    Projects

    Status

    No status

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions