Property talk:P227
Add topicDocumentation
identifier from the Gemeinsame Normdatei authority file of names, subjects, and organizations
The Integrated Authority File (Q36578) (GND) is managed by the German National Library in cooperation with various library networks in German-speaking Europe and other partners.
Allowed qualifiers
[edit]| Wikidata qualifier (Q15720608) | entity (Q35120) | Example |
|---|---|---|
| subject named as (P1810) | person (Q215627) | Naguib Mahfouz (Q7176): Maḥfūẓ, Naǧīb |
| subject named as (P1810) | geographical feature (Q618123) | universe (Q1): Weltall / Kosmos <Begriff> |
| alternative name (P4970) | person (Q215627) | Édith Piaf (Q1631): Gassion, Edith G. |
| start time (P580) | geographical feature (Q618123) | Dnipro (Q48256): subject named as (P1810): Jekaterinoslaw (1776) |
| end time (P582) | corporate body (Q106668099) | Leipzig University (Q154804): subject named as (P1810): Karl-Marx-Universität Leipzig (1991) |
| point in time (P585) | event (Q1656682) | Chaos Communication Congress (Q516804): 2005 |
| subject has role (P2868) | person (Q215627) | Mark Twain (Q7245): subject named as (P1810): Twain, Mark + subject has role (P2868): pseudonym (P742) |
| subject has role (P2868) | corporate body (Q106668099); more precise: museum (Q33506) |
Panorama Mesdag (Q1738414): museum in the Hague |
| subject has role (P2868) | work (Q386724) | Panorama Mesdag (Q110996824): panorama by Hendrik Willem Mesdag |
| subject has role (P2868) | geographical feature (Q618123) | Tomb of the Scipios (Q1540716): subject named as (P1810): Scipionen-Grab +scope note (P9570): Benutze Kombination: Cornelii Scipiones AND Grab AND Rom + subject has role (P2868): cross-reference (Q1302249) |
| subject has role (P2868) | topical term (Q65708270) | book publisher (Q1320047): subject named as (P1810): Verlag + subject has role (P2868): quasi-synonym (Q2122467) |
| subject has role (P2868) | geographical feature (Q618123) corporate body (Q106668099) |
Hamburg-Altona station (Q650270): architectural structure (Q811979) Hamburg-Altona station (Q650270): organization (Q43229) |
| subject has role (P2868) | corporate body (Q106668099) | Upper Alsace (Q1999970). temporary name (Q131095163): Ober-Elsaß |
| location (P276) | corporate body (Q106668099) | Academic Press (Q2076913): New York / San Diego |
| sex or gender (P21) | topical term (Q65708270) | pianist (Q486748): male or of unspecified gender / female[1] |
| scope note (P9570) | topical term (Q65708270) | senator (Q15686806): Auch für Senatoren mit Ministerrang |
| scope note (P9570) | work (Q386724) | Xenien (Q523255): Goethe |
| scope note (P9570) | creator of the GND (for help with indentification) |
Karl-Heinz Schulze (Q113454903): Datensatz angelegt von DFF - Deutsches Filminstitut & Filmmuseum / filmportal.de [DE-Wi17FP] |
| identifier shared with (P4070) | topical term (Q65708270) | Orient (Q205653) ≈ Q1947733 |
| reason for deprecated rank (P2241) | refers to different subject (Q28091153) unconfirmed (Q28831311) redirect (Q45403344) | |
| reason for deprecated rank (P2241) | topical term (Q65708270) | picture editor (Q861009) ≠ photojournalist (Q957729) reason: does not exactly match (Q42415624) [1] |
| reason for deprecated rank (P2241) | (rare case where GND was accidentally reused) | Alfred Daiber (Q114087670): replacement (Q23009439) → Alfred Daiber (Q15782350) |
| reason for deprecated rank (P2241) | (should be replace with valid GND) | undifferentiated identifier value (Q68648103) |
| intended subject of deprecated statement (P8327) | In addition to refers to different subject (Q28091153) |
See: property constraint (P2302) = P227#P2302 (P227#P2302)
Sources and abbreviations
[edit]Gadget
[edit]To import data from the authority file you can use the gadget GND reveal. You only have to add
importScript( 'User:Magnus Manske/gnd reveal.js' )
to your common.js.
Notes
[edit]- Databases to look up the GND: Online-GND (English; German) and DNB-Portal.
- For subject headings the GND uses quasi-synonym (Q2122467) what is useful for libraries but contradicts the concept of Wikidata:
- Error reports
- Error report on German Wikipedia: de:WP:GND/F
- List of undifferentiated identifier value (Q68648103): Wikidata:WikiProject Authority control/Tn
- If a conflation is too complicated to be solved immediately: Property talk:P227/Conflation
- Wikidata:WikiProject Authority control/Error reporting procedures
- Recording companies and music publishers have yet not been imported into the GND (de:Hilfe:GND#Körperschaften). Instead of GND you can use DNB edition ID (P1292).
Duplicates
[edit]Many duplicates result from the merger of the four authority files that were maintained separately until April 2012. They are resolved step-by-step by the German National Library (Q27302) and the other institutions participating in the GND. The duplicate data records from Gemeinsame Körperschaftsdatei (Q872551) (GKD), Schlagwortnormdatei (Q897080) (SWD) and German Music Archive (Q27901) (DMA) are divided into "winner" and "loser" data records depending on their source authority file. (This source authority file for an ID can be found under "Old identifiers" on the page for an ID on https://explore.gnd.network/en/.)
The rule for merging is: "It will always be merged into the winning data record."[2]
- corporate body (Q106668099) → GKD data record
- event (Q1656682) → GKD data record
- works of music → DMA data record[3]
- geographical feature (Q618123) → SWD data record
For duplicates: Please mark the "winner" authority file with "preferred" rank (see Help:Ranking). For the "loser" data record the qualifier
Examples:
- Supreme Court of the Netherlands (Q133765): Q133765#P227 (former GKD = winner since it's a corporate body (Q106668099))
- Łobez (Q196163): 196163#P227 (former SWD: 1 ID; former GKD: 2 IDs due to different cataloging rules)
BTW: You don't have to stick to the library rules. You can use your own judgment to decide which GND is the best choice for Wikidata.
Lists of possible duplicates: Wikidata:Database reports/Identical GND ID and Property talk:P227/Duplicates
Wikidata Query Service:
- https://w.wiki/FoEQ (Property talk:P227/qualifier/subject has role)
Known VIAF/GND problems
[edit]VIAF is helpful but also often incorrect, outdated, and is mixing two identifier systems that in some cases produce dead links.
- Johannes Fabian (Q15641418), VIAF 91414487 merged in January 2014 three GNDs, only one was correct:
- VIAF changes numbers with dashes: Instituto Brasileiro de Geografia e Estatística (Q268072).
- Same name + same year of birth = different person
- In some cases VIAF merges two persons because only one of them has a GND
- Please use: GND = "no value" (for the person without a GND) [2]
- For items often confused use: different from (P1889)
- For unknown reasons VIAF is not importing all GND ids
- Samuel Ramos (Q7412445) = GND 1022446479 (created 16-05-12)
- June 2015: 3 years later the GND id still not harvested by VIAF
- DNB and VIAF have been made aware of the problem, there might have been a harvesting glitch during the first weeks of the GND going live in April 2012: GND records which never have been touched since then are still unknown to VIAF (as an estimate about 15.000 GND records for persons created in early summer 2012 are not represented in VIAF). [7. Jun. 2015 Gymel]
- Update: GND 1022446479 added to VIAF:59099151 on 2015-07-12.
- In some cases VIAF clusters get deleted instead of merged
- Åke Blomström (Q270863): VIAF 228866914; taken care by KrBot
- In rare cases VIAF clusters are reused for a different item
- William of Ockham (Q43936): VIAF:41835567 in 2015
- Lorenzo Guglielmo Traversagni (Q18674108): VIAF:41835567 in 2016
- William of Ockham (Q43936): VIAF:262145669298005170004 (created 2016-02-28)
- Drew Fudenberg (Q1258707): till 2016-09-25 VIAF:59183378 (now: "First National Bank of Lynn")
- William of Ockham (Q43936): VIAF:41835567 in 2015
- In the case a conflated VIAF cluster was used for creating an item, see Help:Conflation of two people (Wikidata:VIAF/cluster)
References
[edit]- ↑ For most occupations, there are two forms in the GND. In some cases there is only a gender-neutral or a female term. Examples: Motorradfahrer (GND 4170605-5); Regierungsbeamter (GND 4398677-8); Au-pair-Mädchen (GND 4003655-8).
- ↑ Barbara Pfeifer: GND-Umstieg in der DNB aktueller Stand und Planungen, GND-Nachbereitungs-Workshop, 31 May 2012, slide 8.
- ↑ Werke der Musik, in: GND-Anwendungsbestimmungen, 2 May 2014, retrieved 18 October 2025.
| ||||||||||||||||||||||||||||||||
(1[01234]?\d{7}[0-9X]|[47]\d{6}-\d|[1-9]\d{0,7}-[0-9X]|3\d{7}[0-9X])|”: value must be formatted using this pattern (PCRE syntax). (Help)List of violations of this constraint: Database reports/Constraint violations/P227#single best value, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P227#Unique value, SPARQL (every item), SPARQL (by value)
List of violations of this constraint: Database reports/Constraint violations/P227#Conflicts with P1292, search, SPARQL
List of violations of this constraint: Database reports/Constraint violations/P227#Label in 'de' language, search, SPARQL
|
This property is being used by:
Please notify projects that use this property before big changes (renaming, deletion, merge with another property, etc.) |
| ||||||||||||||||||||||||||||||||||||||
GND ID exists, type is human, GND ID contains - (Help)
Violations query:
SELECT ?item ?gndid WHERE { ?item wdt:P227 ?gndid; wdt:P31 wd:Q5. FILTER(REGEX(STR(?gndid), "-")) . }List of this constraint violations: Database reports/Complex constraint violations/P227#GND ID - type human should not have GND ID containing -
GND ID exists, type is human, sex missing (Help)
Violations query:
SELECT ?item ?gndid WHERE { ?item wdtn:P227 ?gndid; wdt:P31 wd:Q5 . MINUS { ?item wdt:P21 [] } } LIMIT 1000List of this constraint violations: Database reports/Complex constraint violations/P227#GND ID - type human should have sex
GND ID exists, type is human, VIAF missing (Help)
Violations query:
SELECT ?item ?gndid WHERE { ?item wdtn:P227 ?gndid; wdt:P31 wd:Q5 . MINUS { ?item wdt:P214 [] } } LIMIT 1000List of this constraint violations: Database reports/Complex constraint violations/P227#GND ID - type human should have VIAF
GND ID exists, type is human, is instance of a subclass of human (Help)
Violations query:
SELECT ?item ?gndid WHERE { ?item wdt:P31/wdt:P279+ wd:Q5 . ?item wdt:P31 ?type . ?item wdtn:P227 ?gndid . }List of this constraint violations: Database reports/Complex constraint violations/P227#GND ID - type human should not be instance of a subclass of human
Discussion
[edit]Archives | |||
|---|---|---|---|
|
Zitat oder Auszug
[edit]Hallo @Clemens Dulcis: Wieso hast du quotation or excerpt (P7081) unter „erlaubte Qualifikatoren“ hinzugefügt? Kannst du ein Beispiel nennen? Es gibt bereits den Qualifikator scope note (P9570), der imho diesen Zweck erfüllt. --Kolja21 (talk) 20:34, 23 November 2025 (UTC)
- Vielen Dank für die Nachfrage. Ich habe die Ergänzung wieder gelöscht. Das wichtigste Beispiel war Peter Finke (Q1794580). Viele Grüße! -- Clemens Dulcis (talk) 23:04, 23 November 2025 (UTC)
- Danke für die schnelle Rückmeldung. Bei Peter Finke (Q1794580): German philosopher of science (born 1942) ist der Verwendungshinweis etwas lang geraten. 1) Der Verwendungshinweis dient in erster Linie zur Definition und Unterscheidung mehrerer Normdatensätze (z.B. beim Begriff Senator: "Auch für Senatoren mit Ministerrang"). Bei einem langen Text ist in der Tat quotation or excerpt (P7081) besser, eingetragen unter der Fundstelle. 2) Reicht die Übernahme der Angaben aus dem Feld "Weitere Angaben" aus. Für "Beziehungen zu Organisationen" gibt es in Wikidata die Eigenschaften employer (P108) und affiliation (P1416). Dort kann man Angaben wie "Professor für Wissenschaftstheorie und Biolinguistik" und "1982-2006" ergänzen und mit der GND als Quelle belegen. --Kolja21 (talk) 23:40, 23 November 2025 (UTC)
Schott-Archiv digital
[edit]One more database using GND: Schott-Archiv digital (Q137564935): historical archive of the publishing house B. Schott’s Söhne. --Kolja21 (talk) 03:31, 24 December 2025 (UTC)
Federated queries Wikidata-GND
[edit]I post here a few interesting queries thanks to https://sparql.dnb.de/ and https://qlever.dev/wikidata/:
- GND occupations most used in GND persons
- GND places most used in GND persons
- GND persons last edited in December 2025
--Epìdosis 16:32, 2 February 2026 (UTC) FYI @Emu, Kolja21: P.S. only the last edit date is stored in the GND RDF, not the creation date ... so this is the maximum we can obtain through SPARQL for the last point
- @Epìdosis Could it be that your queries undercount due to different data models? For example, this query from the DNB examples minus the
LIMITreturns 180,555 of Jurist instances whereas your queries return 159,796 and 57,367 instances respectively. While this isn’t really all that relevant to our current challenge, it might nevertheless worth looking into this discrepancy? --Emu (talk) 19:46, 2 February 2026 (UTC)- @Emu: I agree that the discrepancy is strange; as of now, however, I don't understand why it happens honestly; yes, it is worth looking into it. Epìdosis 19:59, 2 February 2026 (UTC)
Duplicates with pseudonyms
[edit]Hello @Epìdosis, Kolja21, Emu:, here is a query highlighting potential duplicates in Wikidata, identified because their respective GND identifiers have a gndo:realIdentity relationship between them and thus one of them may simply be the pseudonym of the other. I have removed cases of duo and collectives pseudonyms, some other false positives may remain. --Thomas Kerboul (BGE) (talk) 13:37, 20 February 2026 (UTC)
- Thanks for the query (1,426 results). Looks like the usual suspect using sock puppets like User:TpDataManager and User:Rudolf Ziegler. To prevent future duplicates, one would have to check the GNDs for pseudonyms and import the missing authority files. This is too time-consuming to do manually: 2 items merged + 2 missing GNDs added (= 4 GNDs). --Kolja21 (talk) 00:39, 21 February 2026 (UTC)
- It would also be helpful if someone could add the following properties to the already created approximately 1,400 dublicates:
- subject named as (P1810)
- if it is a pseudonym: subject has role (P2868): pseudonym (Q61002)
- retrieved (P813)
- This would make it easier to check the items and merge them manually. --Kolja21 (talk) 00:56, 21 February 2026 (UTC)
- Here is a case of duplication created on the base of NL CR AUT ID (P691):
- pseudonym = Frau Annika (Q122573819): no description
- real name = Annika Sauerborn (Q122573821): no description
- @Frettie: FYI. --Kolja21 (talk) 01:07, 21 February 2026 (UTC)
- I refined the query to include the GND label, to keep only pseudonyms and legal names that are instance of (P31)human (Q5) on Wikidata and further excluding items that are linked to each other either with said to be the same as (P460) or different from (P1889) : https://qlever.dev/wikidata/1sIEZT . Based on that I think we can add the missing infos and even attempt at merging the whole 1347 results with Quickstatements. --Thomas Kerboul (BGE) (talk) 06:39, 21 February 2026 (UTC)
- Thanks, i think @Vojtěch Dostál merge this robotically. Vojtěch, do you have script for this? ~~---- Frettie (talk) 07:47, 21 February 2026 (UTC)
- @Frettie Unfortunately, not really, as my access to the WMCZ Catmandu server hasn't been working for more than a year despite repeated calls for attention :). Vojtěch Dostál (talk) 08:50, 21 February 2026 (UTC)
- Ok, i'll try to merger it by AI script. :) --Frettie (talk) 08:54, 21 February 2026 (UTC)
- @Vojtěch Dostál: If the problem is the availability of the server, couldn’t you move to another server, for example Toolforge? (Or is the code only stored on the server and thus inaccessible? Neither User:VojtěchDostálBot nor Wikidata:Requests for permissions/Bot/VojtěchDostálBot say anything about the availability of the non-framework Python code you use.) —Tacsipacsi (talk) 13:42, 22 February 2026 (UTC)
- Just for the records: The issue of duplicates from pseudonyms was already raised in 2018, without any concrete action being taken. I have just merged the item mentioned on the talk page of Reinheitsgebot. --Kolja21 (talk) 17:59, 22 February 2026 (UTC)
- Hi @Kolja21, @Vojtěch Dostál – there is table of Pseudonym and Real person in Google Sheet (editor link) – if this is merge candidate, last column is "YES". Feel free to fix it. I can try to fix it few days or weeks later.--Frettie (talk) 08:18, 23 February 2026 (UTC)
- This will likely be more difficult than it looks. Some are for names of real people borrowed by other people to publish (publishing under the disguise of a different person, eg Drahoslava Janderová (Q95435281) and Sergej Machonin (Q12051502)). Some are for collective pseudonyms. I am afraid most of this has to be sorted out manually :-(. Perhaps we can cut out a part of this that could be merged by a bot, but we have to be careful not to make an even bigger mess. Vojtěch Dostál (talk) 08:53, 23 February 2026 (UTC)
- Hi @Kolja21, @Vojtěch Dostál – there is table of Pseudonym and Real person in Google Sheet (editor link) – if this is merge candidate, last column is "YES". Feel free to fix it. I can try to fix it few days or weeks later.--Frettie (talk) 08:18, 23 February 2026 (UTC)
- there is technical difficulties, but i believe, that we solve this puzzle. :) --Frettie (talk) 18:52, 22 February 2026 (UTC)
- Just for the records: The issue of duplicates from pseudonyms was already raised in 2018, without any concrete action being taken. I have just merged the item mentioned on the talk page of Reinheitsgebot. --Kolja21 (talk) 17:59, 22 February 2026 (UTC)
- @Frettie Unfortunately, not really, as my access to the WMCZ Catmandu server hasn't been working for more than a year despite repeated calls for attention :). Vojtěch Dostál (talk) 08:50, 21 February 2026 (UTC)
- Here is a case of duplication created on the base of NL CR AUT ID (P691):
- It would also be helpful if someone could add the following properties to the already created approximately 1,400 dublicates:
Different year of birth
[edit]@Kolja21, Emu, Diggr: this morning I have been experimenting with the SPARQL endpoint of GND; federated queries with Wikidata tend to timeout in case of hundreds of thousands of results, but they work usually well on smaller numbers. I have extracted Property talk:P227/Inconsistencies/Different year of birth for a check, if you are interested. I will try to work on the pseudonyms later this week. --Epìdosis 10:37, 24 February 2026 (UTC)
- Very helpful list. Thank you! --Kolja21 (talk) 21:13, 24 February 2026 (UTC)
- @Kolja21, Emu: using the SPARQL endpoint of GND, I also extracted birthdates and deathdates for items after Q90000000 and I am going to add them with reference; it will probably take some days, cf. https://qs-dev.toolforge.org/batches/Epìdosis/. Epìdosis 10:58, 25 February 2026 (UTC)
Changes in GND URL
[edit]GND URL are no more available. New URL seems to be https://explore.gnd.network/gnd/. @VIGNERON:
Exemple :
--SAPA bdc (talk) 11:07, 27 February 2026 (UTC)
- I also just realized this change, the old URLs don't work anymore (site can't be reached) and there doesn't seem to be any forwarding active right now. I think the URL formatter should be adjusted, but I'll leave that to those with more insight into all the places where changes are needed. PB (SIK-ISEA) (talk) 17:49, 27 February 2026 (UTC)
- The current URL (the classic DNB catalog) is still valid, but currently disrupted. GND-Explorer (
the beta version of the new DNB catalog) and OGND are two popular alternatives. Kolja21 (talk) 19:10, 27 February 2026 (UTC)- PS: The beta version of the new DNB catalog is another project. DNB website: "Testen Sie auch die Betaversion des neuen DNB-Katalogs." Currently also experiencing technical problems: "Too Many Requests". Kolja21 (talk) 21:58, 27 February 2026 (UTC)
- The current URL (the classic DNB catalog) is still valid, but currently disrupted. GND-Explorer (
Changes by RVA2869. For info: @VIGNERON, SAPA bdc, PB (SIK-ISEA), Kolja21: ¯\_(ツ)_/¯ —Eihel (talk) 03:50, 28 February 2026 (UTC)
- The changes should be reverted. --Emu (talk) 09:01, 28 February 2026 (UTC)
- I think we should revert once https://d-nb.info/gnd/$1 comes back (I would guess on Monday). Epìdosis 14:40, 28 February 2026 (UTC)
- +1. There was no change in the GND URL. The GND-Explorer is useful but does not contain the literature that is available in the German National Library (DNB). Kolja21 (talk) 22:34, 28 February 2026 (UTC)
- @Kolja21, thanks for the clarification. For my use case, linking to the GND-Explorer might actually be the better solution, so if anything, this disruption has helped to get some more visibility for the GND-Explorer. :) PB (SIK-ISEA) (talk) 08:04, 1 March 2026 (UTC)
- A user on the German Wikipedia suspects that a certificate had expired. (February has only 28 days.) Anyway, the error seems to be fixed now and the OPAC is back online. --Kolja21 (talk) 07:36, 2 March 2026 (UTC)
- Our (DNB) services were subject to an exceptionally high volume of requests. Unfortunately, this has been happening more frequently since the AI boom. We are trying to adapt our services accordingly. Diggr (talk) 14:08, 2 March 2026 (UTC)
- A user on the German Wikipedia suspects that a certificate had expired. (February has only 28 days.) Anyway, the error seems to be fixed now and the OPAC is back online. --Kolja21 (talk) 07:36, 2 March 2026 (UTC)
- @Kolja21, thanks for the clarification. For my use case, linking to the GND-Explorer might actually be the better solution, so if anything, this disruption has helped to get some more visibility for the GND-Explorer. :) PB (SIK-ISEA) (talk) 08:04, 1 March 2026 (UTC)
- +1. There was no change in the GND URL. The GND-Explorer is useful but does not contain the literature that is available in the German National Library (DNB). Kolja21 (talk) 22:34, 28 February 2026 (UTC)
- I think we should revert once https://d-nb.info/gnd/$1 comes back (I would guess on Monday). Epìdosis 14:40, 28 February 2026 (UTC)
Missing GND pseudonyms
[edit]@Kolja21, Emu, Diggr: this morning I have made a second experiment with the two endpoints after #Different year of birth: 6108 missing GND pseudonyms being added now through https://qs-dev.toolforge.org/batch/23367/. --Epìdosis 09:18, 8 March 2026 (UTC)
- @Kolja21, Emu, Diggr, Wurgl:: second part, I also add 1431 missing GND pseudonyms which will create conflicts with other items (mostly merges to be made, but also more complex cases like the ones involving collective pseudonyms): https://qs-dev.toolforge.org/batch/23368/. Epìdosis 10:17, 8 March 2026 (UTC)
- Note: the number of pages in de:Kategorie:Wikipedia:GND in Wikipedia weicht von GND in Wikidata ab is up to 2695 after the addition, since DE.WP only structures one GND per person; in order to fix this, it will be necessary to progressively prefer-rank the GND used in DE.WP. FYI @Silewe:. Epìdosis 10:35, 8 March 2026 (UTC)
- Thanks, I already noticed. --Kolja21 (talk) 12:59, 8 March 2026 (UTC)
- @Epìdosis: Can you change in cases like Maya Seidensticker pseudonym (Q61002) (GND type pip) to collective pseudonym (Q16017119) (GND type pis), cf. Talk:Q1473624? --Kolja21 (talk) 13:13, 8 March 2026 (UTC)
- @Kolja21: sure, this batch is running (https://qs-dev.toolforge.org/batch/23386/) and will do it for all the 843 collective pseudonyms we have in Wikidata. Epìdosis 14:32, 8 March 2026 (UTC)
- Batch completed; this (https://w.wiki/JFPo) is a list of all the items in Wikidata containing a GND with qualifier subject has role (P2868)collective pseudonym (Q16017119) despite not being instance of (P31)collective pseudonym (Q16017119); this query should have 0 results (collective pseudonyms should always have a separate item, connected to the items of the real persons using them), but are 764 as of now. Epìdosis 14:46, 8 March 2026 (UTC)
- Thanks. We have two concepts for linking collective pseudonyms:
- Which one should we select as the default? --Kolja21 (talk) 14:55, 8 March 2026 (UTC)
- I don’t have a strong opinion about either option but I think the first one is more common? --Emu (talk) 18:15, 8 March 2026 (UTC)
- I also think the first one is more common, so I would also lean towards it; massively switching to another would not be very complex anyway in the future. Epìdosis 20:31, 8 March 2026 (UTC)
- Property talk:P227/Constraint violations/Single best value constraint (only humans) now with 5.813 items (violations count). Beside collective pseudonym (Q16017119) there are some pseudonyms as unconfirmed hypothesis (Q67203058) where the items should not be merged. Example: Paulus Commodus (Q113774759): no description. --Kolja21 (talk) 09:44, 9 March 2026 (UTC)
- Eh, a lot to check ... if you need other data extraction from GND to help with the check of these items, please ask here. Epìdosis 12:24, 9 March 2026 (UTC)
- Thanks. Before setting "preferred rank" I add subject named as (P1810). (Not mandatory, but helpful for distinguishing the real name from the pseudonyms.) It would save a few clicks if this property was already filled in. --Kolja21 (talk) 13:42, 9 March 2026 (UTC)
- @Kolja21: sure, it will be done in max 2 hours: 5814 edited items through https://qs-dev.toolforge.org/batch/23416/. Epìdosis 14:44, 9 March 2026 (UTC)
- Property talk:P227/Constraint violations/Single best value constraint (only humans) back under control after the batch, 13 items now. de:Kategorie:Wikipedia:GND in Wikipedia weicht von GND in Wikidata ab will surely need more time for manual checks, 2625 results now. Epìdosis 23:18, 11 March 2026 (UTC)
- As a placeholder, de:Kategorie:Wikipedia:GND in Wikipedia weicht von GND in Wikidata ab at 1216 results now. Epìdosis 16:51, 31 March 2026 (UTC)
- Property talk:P227/Constraint violations/Single best value constraint (only humans) back under control after the batch, 13 items now. de:Kategorie:Wikipedia:GND in Wikipedia weicht von GND in Wikidata ab will surely need more time for manual checks, 2625 results now. Epìdosis 23:18, 11 March 2026 (UTC)
- @Kolja21: sure, it will be done in max 2 hours: 5814 edited items through https://qs-dev.toolforge.org/batch/23416/. Epìdosis 14:44, 9 March 2026 (UTC)
- Thanks. Before setting "preferred rank" I add subject named as (P1810). (Not mandatory, but helpful for distinguishing the real name from the pseudonyms.) It would save a few clicks if this property was already filled in. --Kolja21 (talk) 13:42, 9 March 2026 (UTC)
- Eh, a lot to check ... if you need other data extraction from GND to help with the check of these items, please ask here. Epìdosis 12:24, 9 March 2026 (UTC)
- Property talk:P227/Constraint violations/Single best value constraint (only humans) now with 5.813 items (violations count). Beside collective pseudonym (Q16017119) there are some pseudonyms as unconfirmed hypothesis (Q67203058) where the items should not be merged. Example: Paulus Commodus (Q113774759): no description. --Kolja21 (talk) 09:44, 9 March 2026 (UTC)
- I also think the first one is more common, so I would also lean towards it; massively switching to another would not be very complex anyway in the future. Epìdosis 20:31, 8 March 2026 (UTC)
- I don’t have a strong opinion about either option but I think the first one is more common? --Emu (talk) 18:15, 8 March 2026 (UTC)
- @Kolja21: sure, this batch is running (https://qs-dev.toolforge.org/batch/23386/) and will do it for all the 843 collective pseudonyms we have in Wikidata. Epìdosis 14:32, 8 March 2026 (UTC)
- @Epìdosis: Can you change in cases like Maya Seidensticker pseudonym (Q61002) (GND type pip) to collective pseudonym (Q16017119) (GND type pis), cf. Talk:Q1473624? --Kolja21 (talk) 13:13, 8 March 2026 (UTC)
- Thanks, I already noticed. --Kolja21 (talk) 12:59, 8 March 2026 (UTC)
- Note: the number of pages in de:Kategorie:Wikipedia:GND in Wikipedia weicht von GND in Wikidata ab is up to 2695 after the addition, since DE.WP only structures one GND per person; in order to fix this, it will be necessary to progressively prefer-rank the GND used in DE.WP. FYI @Silewe:. Epìdosis 10:35, 8 March 2026 (UTC)
Massive imports of data from GND (2026)
[edit]Massive import of data from GND (April 2026)
[edit]
Notified participants of WikiProject Germany I write here to inform you that from tomorrow I will start through QS 3.0 a very big import of CC0 data from GND ID (P227) (data extracted from https://sparql.dnb.de/ on March 9 su taken from the dump of February 23); the data have been selected in the following way:
- for all humans having GND ID (P227) and no sex or gender (P21), I extracted gndo:gender and converted it into female (Q6581072) or male (Q6581097) (7712)
- for all humans having GND ID (P227) and no place of birth (P19), I extracted gndo:dateOfBirth and converted it into an item (334123)
- for all humans having GND ID (P227) and no place of death (P20), I extracted gndo:dateOfDeath and converted it into an item (62273)
- for all humans having GND ID (P227) and no occupation (P106), I extracted gndo:professionOrOccupation and converted it into an item (609820)
The conversion of the places and occupations was based exclusively on existing GND ID (P227) already present in Wikidata; in all cases in which the value in GND was not a GND, or in which it was a GND but that GND was not present in Wikidata, I will import no data.
If you see mistakes (on over 1 M statements it is certain it will happen), the cause is one of the following:
- the human in Wikidata is matched with the wrong GND person -> move the GND and all the data imported from it to the appropriate item, creating it if needed
- the place/occupation in Wikidata was matched with the wrong GND -> move the GND to the correct place/occupation in Wikidata, creating it if needed; then report the issue here below, I will fix all the cases massively at the end of the import
- the GND person contains a wrong information -> fix the incorrect data in the person and report it here below so that @Kolja21: can fix directly the issue or report it in de:Wikipedia:GND/Fehlermeldung
Best, --Epìdosis 10:06, 31 March 2026 (UTC) FYI
WikiProject Authority control has more than 50 participants and couldn't be pinged. Please post on the WikiProject's talk page instead. --Epìdosis 10:10, 31 March 2026 (UTC)
- Import finished, please report in the sections below if you see mistakes of any kind. Epìdosis 08:03, 5 April 2026 (UTC)
- My first checks on places (of birth and of death) have not found evident mistakes, i.e. non-places used as places. Epìdosis 08:30, 5 April 2026 (UTC)
Massive import of data from GND (May 2026)
[edit]I am now about to start a second massive import, similar to the last one, but with a few improvements (the two main ones: I will takes all the GND occupations, instead of only the first one, and I will skip the cases of GND places or occupations connected from 2(+) Wikidata items) and also more documentation. So, in theory the best option would be having one federated query that extracts, for each property, only the cases in which the Wikidata personal item misses the information and the personal GND ID present in the Wikidata item has the information; however, in my experience this times out; so I extract on one side all the Wikidate personal items with a GND missing the property and on the other side all the personal GND IDs having the property. Here are the queries (data extracted today, so the GND dump used is from April 20):
| Wikidata property | Wikidata humans with GND missing it | GND field | GND persons having it |
|---|---|---|---|
| sex or gender (P21) | https://qlever.dev/wikidata/XVDPIS | gndo:gender | https://sparql.dnb.de/gnd/7cEuHW |
| place of birth (P19) | https://qlever.dev/wikidata/hTrtr4 | gndo:placeOfBirth | https://sparql.dnb.de/gnd/QYHmUJ |
| place of death (P20) | https://qlever.dev/wikidata/N51oji | gndo:placeOfDeath | https://sparql.dnb.de/gnd/RAbyk6 |
| occupation (P106) | https://qlever.dev/wikidata/j1tBN0 | gndo:professionOrOccupation | https://sparql.dnb.de/gnd/ftQ20p |
| date of birth (P569) | https://qlever.dev/wikidata/NmZb7V | gndo:dateOfBirth | https://sparql.dnb.de/gnd/LJJNeJ |
| date of death (P570) | https://qlever.dev/wikidata/FLZ7PJ | gndo:dateOfDeath | https://sparql.dnb.de/gnd/0wdHmF |
Appendix: https://qlever.dev/wikidata/7XWk8n to extract non-personal GND IDs connected from Wikidate items (avoiding unique-value constraint violations; the unfiltered one would be https://qlever.dev/wikidata/ZEcqzw).
I am now preparing the new QS files. Updates will follow here. --Epìdosis 15:23, 4 May 2026 (UTC)
The numbers by property (total: ):
- sex or gender (P21): 27022 values
- date of birth (P569): 327532 values
- place of birth (P19): 72436 values
- date of death (P570): 88700 values
- place of death (P20): 17400 values
- occupation (P106): 209951 values
Please note I will run firstly the removals of unsourced date of birth (P569) values and afterwards the date of death (P570); so the values will be readded, with a GND reference, in maximum 24 hours.
As previously written, if you see mistakes, the cause is one of the following:
- the human in Wikidata is matched with the wrong GND person -> move the GND and all the data imported from it to the appropriate item, creating it if needed
- the place/occupation in Wikidata was matched with the wrong GND -> move the GND to the correct place/occupation in Wikidata, creating it if needed; then report the issue here below, I will fix all the cases massively at the end of the import
- the GND person contains a wrong information -> fix the incorrect data in the person and report it here below so that @Kolja21: can fix directly the issue or report it in de:Wikipedia:GND/Fehlermeldung
Thanks, --Epìdosis 17:46, 4 May 2026 (UTC)
- Imho we should look at some of the cases in which the unsourced date of birth (P569) and date of death (P570) differ from GND. That should be documented even though we can't review every single case. --Kolja21 (talk) 19:39, 4 May 2026 (UTC)
- As of now they will stay untouched, I am only improving the ones where the dates coincide; we can talk about them afterwards, surely they deserve a look. Epìdosis 19:41, 4 May 2026 (UTC)
- As a note about the ongoing removals of date of birth (P569) and date of death (P570) values with precision year (which will take about 2 days): I know I am removing a few hundreds of values with valid references; I am rechecking all my removals of referenced values and reverting them, so they are not going unnoticed. Epìdosis 22:36, 4 May 2026 (UTC)
- Import finished: please report here any issue for future documentation. Epìdosis 07:01, 8 May 2026 (UTC)
- As a note about the ongoing removals of date of birth (P569) and date of death (P570) values with precision year (which will take about 2 days): I know I am removing a few hundreds of values with valid references; I am rechecking all my removals of referenced values and reverting them, so they are not going unnoticed. Epìdosis 22:36, 4 May 2026 (UTC)
- As of now they will stay untouched, I am only improving the ones where the dates coincide; we can talk about them afterwards, surely they deserve a look. Epìdosis 19:41, 4 May 2026 (UTC)
Reports of wrong matches of GND places and GND occupations
[edit]- https://d-nb.info/gnd/4730726-2 moved from Santana (Q873384) to Sântana (Q275910) (and https://d-nb.info/gnd/5227491-3 added to Santana (Q873384)). --Epìdosis 10:32, 1 April 2026 (UTC)
- Value fixed manually in 4 items. Epìdosis 08:03, 5 April 2026 (UTC)
https://d-nb.info/gnd/138325871was associated with both Greater London (Q23306) and London (Q84); I removed it from the first. --Epìdosis 11:05, 4 April 2026 (UTC)- You mean GND 4074335-4 (London & Greater London). --Kolja21 (talk) 14:20, 4 April 2026 (UTC)
- Of course https://d-nb.info/gnd/4074335-4, copy-paste mistake. --Epìdosis 14:43, 4 April 2026 (UTC)
- Greater London (Q23306) is being removed from 1441 places of birth and 809 places of death. Epìdosis 08:09, 5 April 2026 (UTC)
- Same issue found with Tokyo: https://d-nb.info/gnd/4078337-6 was both in Tokyo (Q7473516): special wards in the eastern part of Tokyo Metropolis in Japan, that used to form a single city and Tokyo (Q1490): capital and largest city of Japan; I have removed the first one and I am removing 641 duplicate values (birthplaces and deathplaces) through https://quickstatements.toolforge.org/#/batch/259276. Epìdosis 13:50, 2 June 2026 (UTC)
- Same issue found with Bad Mergentheim: https://d-nb.info/gnd/4038709-4 was both in Bad Mergentheim (Q31897574): human settlement in Germany and Bad Mergentheim (Q61942): municipality in Main-Tauber-Kreis, Baden-Württemberg, Germany; I have removed the first one and I am removing 230 duplicate values (birthplaces and deathplaces) through https://quickstatements.toolforge.org/#/batch/259279 Epìdosis 16:48, 2 June 2026 (UTC)
- Same issue found with Tokyo: https://d-nb.info/gnd/4078337-6 was both in Tokyo (Q7473516): special wards in the eastern part of Tokyo Metropolis in Japan, that used to form a single city and Tokyo (Q1490): capital and largest city of Japan; I have removed the first one and I am removing 641 duplicate values (birthplaces and deathplaces) through https://quickstatements.toolforge.org/#/batch/259276. Epìdosis 13:50, 2 June 2026 (UTC)
- Greater London (Q23306) is being removed from 1441 places of birth and 809 places of death. Epìdosis 08:09, 5 April 2026 (UTC)
- Of course https://d-nb.info/gnd/4074335-4, copy-paste mistake. --Epìdosis 14:43, 4 April 2026 (UTC)
- You mean GND 4074335-4 (London & Greater London). --Kolja21 (talk) 14:20, 4 April 2026 (UTC)
- teacher at a Gymnasium (Q131717117) and professor at a Gymnasium (Q1558050) share the same identifier, so the occupation was added twice. In a case like this we could use does not exactly match (Q42415624). (Rarely used so far.) Example: Bildredakteur vs. Fotojournalist. @Emu: FYI. --Kolja21 (talk) 12:43, 6 April 2026 (UTC)
- OK I should probably leave the most generic, so professor at a Gymnasium (Q1558050), removing the other, right? Epìdosis 07:53, 7 April 2026 (UTC)
- Yes, this would be good. I don't know how many professions this applies to. Emu might know more about this topic. --Kolja21 (talk) 09:01, 7 April 2026 (UTC)
- 1596 cases found through https://w.wiki/Ksv6 - however, looking again at the two items, the most generic one seems teacher at a Gymnasium (Q131717117), since professor at a Gymnasium (Q1558050)subclass of (P279)teacher at a Gymnasium (Q131717117); so should I remove professor at a Gymnasium (Q1558050) instead? Epìdosis 09:17, 7 April 2026 (UTC)
- Yes please. The professional title "Gymnasialprofessor" is outdated. (It may still be in use in Austria.) --Kolja21 (talk) 09:47, 7 April 2026 (UTC)
- 1596 cases found through https://w.wiki/Ksv6 - however, looking again at the two items, the most generic one seems teacher at a Gymnasium (Q131717117), since professor at a Gymnasium (Q1558050)subclass of (P279)teacher at a Gymnasium (Q131717117); so should I remove professor at a Gymnasium (Q1558050) instead? Epìdosis 09:17, 7 April 2026 (UTC)
- Yes, this would be good. I don't know how many professions this applies to. Emu might know more about this topic. --Kolja21 (talk) 09:01, 7 April 2026 (UTC)
- OK I should probably leave the most generic, so professor at a Gymnasium (Q1558050), removing the other, right? Epìdosis 07:53, 7 April 2026 (UTC)
- I have found both Bremen (Q1209): federated state of Germany and Bremen (Q24879): city in the Bremen federated state, Germany have https://d-nb.info/gnd/4008135-7 - suggestions @Kolja21, Emu:? As of now, both have been added as birthplace and deathplace. Thanks, --Epìdosis 16:34, 17 April 2026 (UTC)
- "Place of birth" usually refers to a city rather than a state (Bundesland), so Bremen (Q24879) should be correct. --Kolja21 (talk) 18:41, 17 April 2026 (UTC)
- https://qlever.dev/wikidata/tiyVqC found 2116 cases; I keep the city, so I am removing the state through https://qs-dev.toolforge.org/batch/27973/. We should find a way to avoid this duplication in future imports, although I am reluctant to deprecate the statement in the item of the state. Ideas? Epìdosis 19:31, 17 April 2026 (UTC)
- The GND entry makes it clear that it conflates both (Verwendungshinweis: Ansetzung für das Land und die Stadt Bremen), so neither of the statements can be deprecated. Or both should be deprecated, and a conflation item created (if there isn’t one yet). —Tacsipacsi (talk) 09:00, 20 April 2026 (UTC)
- Bremen isn't a conflation (= error). In some cases does not exactly match (Q42415624) would be a good solution (see Help:P227) but unfortunately, there is one user who is known to be disruptive. --Kolja21 (talk) 12:17, 20 April 2026 (UTC)
- According to the English and German descriptions of conflation (Q14946528), conflation is usually erroneous – that is, not always. So I think what happens here qualifies as conflation (even though it’s clearly intentional on the side of GND), and I’d use reason for deprecated rank (P2241)undifferentiated identifier value (Q68648103) rather than reason for deprecated rank (P2241)does not exactly match (Q42415624) – especially because the latter assumes that there is an exactly matching subject, which isn’t the case here (I don’t know about a Wikidata item whose topic would be “Bremen as a city or as a state”). —Tacsipacsi (talk) 09:58, 21 April 2026 (UTC)
- In the case of GND undifferentiated identifier value (Q68648103) is a clearly defined term, see difference between name (Tn) and person (Tp). Kolja21 (talk) 20:16, 21 April 2026 (UTC)
- According to the English and German descriptions of conflation (Q14946528), conflation is usually erroneous – that is, not always. So I think what happens here qualifies as conflation (even though it’s clearly intentional on the side of GND), and I’d use reason for deprecated rank (P2241)undifferentiated identifier value (Q68648103) rather than reason for deprecated rank (P2241)does not exactly match (Q42415624) – especially because the latter assumes that there is an exactly matching subject, which isn’t the case here (I don’t know about a Wikidata item whose topic would be “Bremen as a city or as a state”). —Tacsipacsi (talk) 09:58, 21 April 2026 (UTC)
- Bremen isn't a conflation (= error). In some cases does not exactly match (Q42415624) would be a good solution (see Help:P227) but unfortunately, there is one user who is known to be disruptive. --Kolja21 (talk) 12:17, 20 April 2026 (UTC)
- The GND entry makes it clear that it conflates both (Verwendungshinweis: Ansetzung für das Land und die Stadt Bremen), so neither of the statements can be deprecated. Or both should be deprecated, and a conflation item created (if there isn’t one yet). —Tacsipacsi (talk) 09:00, 20 April 2026 (UTC)
- https://qlever.dev/wikidata/tiyVqC found 2116 cases; I keep the city, so I am removing the state through https://qs-dev.toolforge.org/batch/27973/. We should find a way to avoid this duplication in future imports, although I am reluctant to deprecate the statement in the item of the state. Ideas? Epìdosis 19:31, 17 April 2026 (UTC)
- "Place of birth" usually refers to a city rather than a state (Bundesland), so Bremen (Q24879) should be correct. --Kolja21 (talk) 18:41, 17 April 2026 (UTC)
- @Kolja21, Epìdosis: currently GND 4028557-1 is on both Jena (Q3150) and Jena-Zentrum (Q1686796) ; presumably only the first should be used. --Thomas Kerboul (BGE) (talk) 08:34, 27 April 2026 (UTC)
- I guess the most generic one, Jena (Q3150); waiting for confirmation to go on with removals. Epìdosis 09:49, 27 April 2026 (UTC)
- Yes, Jena (Q3150) is correct. GND 4028557-1 is about the city, not about the subdivision (GND removed). --Kolja21 (talk) 21:52, 27 April 2026 (UTC)
- I guess the most generic one, Jena (Q3150); waiting for confirmation to go on with removals. Epìdosis 09:49, 27 April 2026 (UTC)
- https://d-nb.info/gnd/124758495 = Joseph Kittinger (Q342862): American military pilot (1928–2022) contains "Flieger" (https://d-nb.info/gnd/124758495) as Beruf; however, Flieger is defined as "Niedrigster Dienstgrad der Luftwaffe" and Joseph Kittinger was an American soldier, so the association is wrong; there might be similar issues in other items, to be checked at the end of the import. FYI @Kolja21:. --Epìdosis 21:46, 20 May 2026 (UTC)
- I've changed it to "Flugzeugführer" (GND 4017678-2). --Kolja21 (talk) 22:04, 20 May 2026 (UTC)
Reports of wrong data in GND persons
[edit]- Ko A-sung (Q291446) = https://d-nb.info/gnd/1037028937: this deathplace was reverted, I agree that it seems to be an error, and Seul should be birthplace instead; @Kolja21: can you fix? @ДолбоЯщер: feel free to report here if you find other mistakes with this import, so that we can check and correct the issue also in the GND. --Epìdosis 19:34, 17 April 2026 (UTC)
Done The location was clearly entered in the wrong field. GND corrected --Kolja21 (talk) 19:43, 17 April 2026 (UTC)
- Giacommo Agostini (Q253450) = https://d-nb.info/gnd/118501046: birthdate 1942 has been put as deathdate (@Kolja21:). --Epìdosis 06:58, 8 May 2026 (UTC)
Done Thanks. I've corrected the typo. --Kolja21 (talk) 07:08, 8 May 2026 (UTC)
- I leave here a general query for GND-Wikidata inconsistencies (https://qlever.dev/wikidata/M2OImd), excluding properties with URL-datatype. --Epìdosis 20:26, 14 May 2026 (UTC)
Same birthyear and deathyear
[edit]- @Kolja21: An urgent issue: we have at least 400 cases + 369 cases (there might be coincidences between the two lists) of GND having caused a birth year = death year, which in most cases is wrong, and is surely wrong in presence of occupation (P106). Could you help in fixing? If you remove the wrong value in Wikidata after you have corrected it in GND, the results will automatically disappear from the query. A big thanks! --Epìdosis 07:32, 8 May 2026 (UTC) P.S. I can add a column with the direct link to the GND entry in the queries if it is useful. --Epìdosis 07:33, 8 May 2026 (UTC)
- OK, I have the improved queries with the link to GND: now 394 cases + 409 cases. --Epìdosis 08:03, 8 May 2026 (UTC)
- If you find correct cases (i.e. the person really died before the first birthday) please add significant event (P793)infant death (Q22004151); this will exclude them from the newly improved queries: 386 cases + 400 cases. --Epìdosis 08:21, 8 May 2026 (UTC)
- Hi @Epì,
- from my work with databases, I noticed that there is often confusion between birth/death dates, through problems of encoding the data ; a single misplaced pipe (or other separator) changes a birth date in death date, or the contrary.
- Often, the issue can be solved by reading (through human eyes, not through bot) the sources cited in the entry, because they are in human language and say in words what has been accidentally wrongly encoded.
- for example on Phyllis Bird (Q21598079) - the LC id https://id.loc.gov/authorities/names/n82052016.html encoded "1934-06-25" as "Death date", but the source in human language says "found: Her The Bible as the Church's book, c1982:CIP t.p. (Phyllis A. Bird) pub. info. (Phyllis Ann Bird; Phyllis Bird; b. 6/25/34; Th.D., Harvard Div. Sch.; AB, U. of CA, Berkeley, grad. wk. in sociology; minister, United Methodist Church; assoc. prof. of Old Testament, Perkins Sch. of Theology, SMU)"
- great way to solve many issues, as long as "sources" are cited in the authority - and also to notify the database manager of a problem on a specific record, which I do on a nearly daily basis for databases I use often ;)
- On Zacharias Klingius (Q5914136), I read a wrong birth date with https://d-nb.info/gnd/129291803 as source ; but when I read DNB, I see "Lebensdaten: 1603-1671"... so, how did your import could add a wrong 1671 date on "birth" ? are there other invisible (to the human eye) data ? or is it encoding error ? Hsarrazin (talk) 10:45, 8 May 2026 (UTC)
- for example, https://d-nb.info/gnd/1194965156 is an obvious encoding error in DNB, since nobody could become an alchemist and be executed for being an alchemist before his first birthday :D
- Do you have a "coherence checker" in your tool, to remove such obvious accidental errors IN the database ?
- also one question please : how come that you add full dates on items, when all I see in the DNB database when I check the source is "years" in "Lebensdaten" ? - see here, for example - and often different from the dates you added : different display for human and machine ? need to log on to get full display ? Hsarrazin (talk) 10:55, 8 May 2026 (UTC)
- @Hsarrazin: thanks very much for your comment. Yes, this is a frequent confusion in library authority files, as it is easy to choose the wrong field were to put a datum especially if the other one is absent (e.g. birth date for persons without death date) or unknown (e.g. death date for persons with unknown birth date). I also confirm that it is usually easy to solve, especially for persons who are still alive and whose birth date has been entered as death date.
- For your very pertinent LoC example (https://id.loc.gov/authorities/names/n82052016.html), @Mcampany: for the fix.
- Yes, with this kind of imports we are indeed going to make emerge a lot of mistakes that came to Wikidata from external sources and that we can now fix back there.
- In GND, the dates are encoded in a way that I have not found elsewhere in any authority file: you can encode a couple of birth-year and death-year is encoded (or just a birth-year or just a death-year); however, if you encode a couple of birth-day and death-day (or just a death-day; a single birth-day cannot be encoded because the system does not allow to store birth-days of living persons, but it can be found sometimes, probably encoded before the rule was added and never removed afterwards), the system also generates a second couple of birth-year and death-year and both are present in the record. In the public interface of DNB catalogue (e.g. https://d-nb.info/gnd/129291803) you never see days, only years; but if you go in the RDF export (e.g. https://d-nb.info/gnd/129291803/about/lds), you find the most precise birthdate and the most precise deathdate, and if you go in the GND Explorer (e.g. https://explore.gnd.network/gnd/129291803) you find all of them. In this case, as you can see in the Explorer, the year dates are correct, but there is an encoding error of the death-day (@Kolja21:). Often, in fact, there are strange inconsistencies between years and days in GND, with the years usually being correct in such cases; however, since from the GND Query Service I extract the most precise dates (basically the same that are found in the RDF export), I got a certain amount of mistakes. But we will fix them soon. Epìdosis 11:18, 8 May 2026 (UTC)
- @Hsarrazin: For the coherence check in GND, I guess (@Kolja21: for confirmation) unfortunately it is completely absent, although having all data well separated in different fields it would be really easy to implement. For the question on full dates, I guess I have already answered in the message above.
- Updated queries: I have removed the filter on GND data added in 2026, let's have a look at all conflicts involving GND data, also imported in previous years; I have also filtered out a few dates with precision lower than year which were not real mistakes. So now we have 447 cases + 444 cases. Epìdosis 11:25, 8 May 2026 (UTC)
- ahhh, I was not aware of that difference between public display and "explorer" one - well, I don't understand German (I'm French), so I cannot even read the labels around the data :D - I do not know if it's possible to go to linked works from an authority, or how :( -
- Thanks for explanations about how all this works ; if librarians have to update 2 fields for birth/date when there is new data, no wonder there can be errors made :( - years should be auto-computed from full birth/death dates, making errors obvious - as a librarian doing that same work myself, I know how it works.- and presently (this moment) working of a simple tool to easily check that kind of accidents ^^
- PS : I began manually checking, deprecating and correcting dates, and occasionally sourced some that were correct... Hope you will be able to sort all of these errors out. Is @Kolja21 a DNB maintainer ? Hsarrazin (talk) 11:42, 8 May 2026 (UTC)
- (https://id.loc.gov/authorities/names/n82052016.html) is
Done - And yes, @Hsarrazin, it would be ideal if we only had to update 1 field. Issues with mixed-up birth and death dates are caused by a really silly but simple typographical error--birth dates are saved in a field called 046$f, while death dates are saved in 046$g. The display on id.loc.gov uses human language for those fields, but the underlying record doesn't, and change is slow-moving. Mcampany (talk) 13:02, 8 May 2026 (UTC)
- indeed, Marc, Unimarc or Intermarc languages make it very easy to mixup data. I am lucky enough to now use a program that has "different boxes" to input birth and death dates, which makes it much more difficult to mix them up, except when really tired
- at least on wikidata we have constraints that help retrieving and correcting the errors. Hsarrazin (talk) 13:18, 8 May 2026 (UTC)
- I would love to have different boxes! Or even just better automatic error detection. Some colleagues and I recently started using Wikidata's constraint violations to detect duplicates in LC authority records. Maybe someday we'll expand to finding errors in them too, but for now there's already more than enough to keep us busy.
- Anyways, love the great work on this import. Thanks, everyone! Mcampany (talk) 13:34, 8 May 2026 (UTC)
- @Hsarrazin: yes, Kolja21 can fix most mistakes in GND, although not everything (there is a complex system of levels); what he cannot fix goes into de:Wikipedia:GND/Fehlermeldung. Thanks for the checks, I hope in around a week we can cleanup everything or mostly everything in the two queries above (now down to 428 and 406 results). Epìdosis 14:30, 8 May 2026 (UTC)
- ohhh, found a nice one caused by a much too common name "Christopher Young"
- Chris Taliutafa Young (Q30275364) was originally created from the enwp article, it was a Samoan chief, dead in 1967.
- then data about a british germanist and historian (born in 1967 - the same year the first one was born)... were put into the mix and at least one other author, active 1977 (that seems born in 1950, according to LC that I couldn't find on WD - the link in the viaf cluster is highly suspicious (as the author), coming from the Peerage input, not academic source)
- I proceeded to a surgical separation by creating Christopher J. Young (Q139713345) where I put the germanist (born in 1967). Obviously, the errors was very ancient, and probably caused by the name, which is very common - and once the first library ID was added, with link to viaf, dominoes fell...
- I think I got all the IDs, but cannot access the National Virtual Library of India ID (P13951) one on Chris Taliutafa Young (Q30275364) to see to which item it pertains.
- If someone could check my work, that would be great :) Hsarrazin (talk) 15:32, 8 May 2026 (UTC)
- nice case of human error when updating birth date from source : mixup of day and year in birth date - and on Eva-Maria Smolka (Q130815381)- should be 4 march 1926 Hsarrazin (talk) 16:46, 8 May 2026 (UTC)
- update of the queries ?
- after 2 hours I corrected it (by deprecating the wrong GND data on Adrien Le Gallo (Q2825209)), it still appears in the list. Did I do it wrong ? Hsarrazin (talk) 17:37, 8 May 2026 (UTC)
- @Hsarrazin: thanks very much for the great work! So, for Chris Taliutafa Young (Q30275364) very good reconstruction for the issue, all the IDs were clearly misplaced (it often happens for old Wikidata items, often regarding politicians and sportspeople, to which wrong VIAF IDs have been attached); after moving the IDs to the correct items usually the best solution is just erasing everything if possible, so that the mistake doesn't propagate anymore to any third parties not taking deprecated rank into account; so I have now rechecked and proceeded in this way.
- As for the queries, https://qlever.dev/wikidata/VblW7u has now 352 results (down from 447) and https://qlever.dev/wikidata/SRAS1w 323 results (down from 444); in fact they are written so that they also show deprecated values, because our final aim is fixing the issue in GND and after that completely removing the wrong value. Using queries without deprecated values shows 330 results (https://qlever.dev/wikidata/QCgyA9) and 323 results (https://qlever.dev/wikidata/nqG4XN) now, and in this second case Adrien Le Gallo (Q2825209) is correctly absent since you deprecated the wrong birthdate from GND. I will go on tomorrow, I am on the second query. Epìdosis 17:51, 8 May 2026 (UTC)
- ok ! I thought the query was automatically updated, so I didn't understand why checked and corrected items didn't disappear.
- maybe we should coordinate with LC maintainer, since I found some items where the "wrong" date that contradicted GND was inverted birth/death in LCCN @Mcampany: Hsarrazin (talk) 17:58, 8 May 2026 (UTC)
- I see you already posted one in WP:P244 maintenance. Thank you! Would you mind making a list and then sharing them all at once when you're finished looking through the queries? That might be a little easier since I worry I might miss an individual post, especially if there are a lot of them. Mcampany (talk) 19:01, 8 May 2026 (UTC)
- @Mcampany I just added a small list in the P244 project page - will add the others I find her. I also think that @Epìdosis can build a query of "recently deprecated" dates with LC sourcing (just like the one we are using now to clean up GND ;) Hsarrazin (talk) 19:32, 8 May 2026 (UTC)
- Sure, it is a pleasure; query added now there, it shows also cases which had been previously deprecated. Epìdosis 20:30, 8 May 2026 (UTC)
- @Mcampany I just added a small list in the P244 project page - will add the others I find her. I also think that @Epìdosis can build a query of "recently deprecated" dates with LC sourcing (just like the one we are using now to clean up GND ;) Hsarrazin (talk) 19:32, 8 May 2026 (UTC)
- I see you already posted one in WP:P244 maintenance. Thank you! Would you mind making a list and then sharing them all at once when you're finished looking through the queries? That might be a little easier since I worry I might miss an individual post, especially if there are a lot of them. Mcampany (talk) 19:01, 8 May 2026 (UTC)
- @Epìdosis I noticed some items where the GND and the Deutsche Biographie give the same date, one for birth, the other for death (with the same id, of course). - at least 4 or 5 until now. for example see Johann Jakob Christian Schwartzkopff (Q94870850)
- I left them untouched.
- Can I presume it's an error on Deutsche Biographie, when retrieving the date, in the wrong place ? Hsarrazin (talk) 19:45, 8 May 2026 (UTC)
- No, unfortunately, each case must be reviewed individually. Source:
- NDB 23 (2007), S. 799 (Schwartzkopff, Louis Victor Robert)
- Died (not born) 1801. I've corrected GND 1017936471 (typo). --Kolja21 (talk) 20:08, 8 May 2026 (UTC)
- then I let you investigate - my german is inexistant, apart from some words that I'm used to - good luck with them :) Hsarrazin (talk) 20:24, 8 May 2026 (UTC)
- Good updates: https://qlever.dev/wikidata/VblW7u has now 159 results (down from 352) and https://qlever.dev/wikidata/SRAS1w 150 results (down from 323). Epìdosis 13:08, 10 May 2026 (UTC)
- Final update on this: yesterday I finished the check (most of the remaining cases where in fact infact deaths, so no mistakes in GND): the two queries are now effectively empty (https://qlever.dev/wikidata/VblW7u and https://qlever.dev/wikidata/SRAS1w have both 2 results, one already fixed and one for which research is ongoing). A great thanks to @Kolja21, Hsarrazin: for all the work! --Epìdosis 08:15, 17 May 2026 (UTC)
- 2 new ones appeared since - one seems to be a physician, so definitely not infant death ;) Hsarrazin (talk) 10:54, 17 May 2026 (UTC)
- Yes, this is the one already fixed (Q137331503), in the sense we have reported it and it will be fixed in a few years by DNB :) Epìdosis 11:00, 17 May 2026 (UTC)
- 2 new ones appeared since - one seems to be a physician, so definitely not infant death ;) Hsarrazin (talk) 10:54, 17 May 2026 (UTC)
- Final update on this: yesterday I finished the check (most of the remaining cases where in fact infact deaths, so no mistakes in GND): the two queries are now effectively empty (https://qlever.dev/wikidata/VblW7u and https://qlever.dev/wikidata/SRAS1w have both 2 results, one already fixed and one for which research is ongoing). A great thanks to @Kolja21, Hsarrazin: for all the work! --Epìdosis 08:15, 17 May 2026 (UTC)
- Good updates: https://qlever.dev/wikidata/VblW7u has now 159 results (down from 352) and https://qlever.dev/wikidata/SRAS1w 150 results (down from 323). Epìdosis 13:08, 10 May 2026 (UTC)
- then I let you investigate - my german is inexistant, apart from some words that I'm used to - good luck with them :) Hsarrazin (talk) 20:24, 8 May 2026 (UTC)
- No, unfortunately, each case must be reviewed individually. Source:
- nice case of human error when updating birth date from source : mixup of day and year in birth date - and on Eva-Maria Smolka (Q130815381)- should be 4 march 1926 Hsarrazin (talk) 16:46, 8 May 2026 (UTC)
- @Hsarrazin: yes, Kolja21 can fix most mistakes in GND, although not everything (there is a complex system of levels); what he cannot fix goes into de:Wikipedia:GND/Fehlermeldung. Thanks for the checks, I hope in around a week we can cleanup everything or mostly everything in the two queries above (now down to 428 and 406 results). Epìdosis 14:30, 8 May 2026 (UTC)
- If you find correct cases (i.e. the person really died before the first birthday) please add significant event (P793)infant death (Q22004151); this will exclude them from the newly improved queries: 386 cases + 400 cases. --Epìdosis 08:21, 8 May 2026 (UTC)
- OK, I have the improved queries with the link to GND: now 394 cases + 409 cases. --Epìdosis 08:03, 8 May 2026 (UTC)
Birthyear after deathyear
[edit]- I open another section for a worse problem, fortunately occurring in fewer items: birthyear after deathyear. We have https://qlever.dev/wikidata/qOD6FL (85 results) and https://qlever.dev/wikidata/XvY0YU (90 results); as in the other case, many results appear in both, and the mistake may not depend from GND. I will start the checks from the second one. Best, --Epìdosis 08:19, 17 May 2026 (UTC)
- Hi, is there reason for deprecated rank (P2241) to indicate that the date use the wrong calendar - on Ibn Malik (Q716704) the dates (600/672) seem to match the Hegire's (Islamic) calendar... Hsarrazin (talk) 11:23, 17 May 2026 (UTC)
- Fixed now, thanks! Epìdosis 13:22, 17 May 2026 (UTC)
- oohhhh, that's how it's done ! thanks - no sure I'll remember it, but thanks :) Hsarrazin (talk) 13:26, 17 May 2026 (UTC)
- All checked, https://qlever.dev/wikidata/qOD6FL is down to 4 results and https://qlever.dev/wikidata/XvY0YU to 2 results, all already reported in de.WP or under investigation in the section below. Thanks again @Kolja21, Hsarrazin:! Epìdosis 13:15, 22 May 2026 (UTC)
- oohhhh, that's how it's done ! thanks - no sure I'll remember it, but thanks :) Hsarrazin (talk) 13:26, 17 May 2026 (UTC)
- Fixed now, thanks! Epìdosis 13:22, 17 May 2026 (UTC)
- Hi, is there reason for deprecated rank (P2241) to indicate that the date use the wrong calendar - on Ibn Malik (Q716704) the dates (600/672) seem to match the Hegire's (Islamic) calendar... Hsarrazin (talk) 11:23, 17 May 2026 (UTC)
- I have created a general query on GND for finding all cases in GND having birthdate after deathdate: https://sparql.dnb.de/fOo4oU. The results in the GND SPARQL endpoint, as you know, are updated monthly: as of now they are 461 (updated 19.04.2026), but I am confident that at the next monthly update in a few days they will be fewer. I have also found a strange encoding issue for some dates, I hope it can be fixed massively: cf. User talk:Diggr#Dates in square brackets. --Epìdosis 10:35, 19 May 2026 (UTC) FYI @Kolja21, Diggr:
- After the update of the dump used in the GND query service from 19 April to 28 May, the results slightly decreased to 456. Epìdosis 08:24, 30 May 2026 (UTC) P.S. this improved version also shows the corresponding Wikidata items: https://sparql.dnb.de/PAgfKO. --Epìdosis 08:57, 30 May 2026 (UTC)
- [edited because I never finished my question - sorry]
- Do you think you could change the link from the GND access to the Explore-Gnd access in the query ? [edited]
- many typos it seems : for Bodewin Keitel (Q99104), for example ; nothing wrong in https://d-nb.info/gnd/137729405 ; the explore GND gives up the problem : https://explore.gnd.network/gnd/137729405?term=137729405&rows=25&pos=1
- same for https://explore.gnd.network/gnd/1194256775?term=1194256775&rows=25&pos=1
- much cleanup seems to be needed in GND - good luck to those who work inside GND ;)
- I only wish it would be possible to have the same kind of checking against other National libraries catalogs to help clean their data -- and people working both in WD and in those catalogs... Hsarrazin (talk) 09:15, 30 May 2026 (UTC)
- Sure, very easy: here is the query with the "normal" URL and the GND Explorer URL (https://sparql.dnb.de/ydVuQN). Of course I also hope that those who work inside GND take care of this :) --Epìdosis 10:20, 30 May 2026 (UTC)
- nice !! I wish I was able to write such neat useful SPARQL queries !*
- is it possible to compare the "year" Lebensdaten with the Exakte Lebensdaten, to detect obvious typos ? (error of century is the more frequent error I found, whatever the database, because it is easy to automatically type 19.. instead of any other century, because our fingers are just used to it.)
- for what I checked (the first 50) thanks to this new query, most discrepancies are pure typos.
- Otto Blaul (Q94875455) is an anachronism, you you compare to "floruit"
- Typo in GND corrected. @Thomas Kerboul (BGE): IdRef missing: https://www.sudoc.fr/080322514. --Kolja21 (talk) 17:03, 30 May 2026 (UTC)
- Christiana Sophia Catharina von Seidensticker (Q94758533) leaves me wondering, since the birthdate on WD is source with GND, that is wrong
- Typo in GND corrected. --Kolja21 (talk) 17:13, 30 May 2026 (UTC)
- Lilly Molnar (Q94753256) has 2 sets of dates on GND, and it seems both birthdates are wrong - I found "1897"... but serious sources need to be checked
- Otto Blaul (Q94875455) is an anachronism, you you compare to "floruit"
- just the kind of "minuscule" errors, that were so difficult to find, when you could not compare data :( Hsarrazin (talk) 13:13, 30 May 2026 (UTC)
- Hi @Hsarrazin:, unfortunately the GND query service does not allow to compare the "year" Lebensdaten with the Exakte Lebensdaten because, in presence of multiple birth or death dates, only the date with the highest precision (i.e. day) is present in the RDF export of the authority record and thus included in the query service. I agree that, also from my experience, most are typos, easily fixable. FYI @Kolja21, Diggr:. Epìdosis 13:25, 30 May 2026 (UTC)
- methinks we have worked too hard on poor GND... it's been displaying "Too Many Requests" for 2 hours now :D Hsarrazin (talk) 17:06, 30 May 2026 (UTC)
- Hi @Hsarrazin:, unfortunately the GND query service does not allow to compare the "year" Lebensdaten with the Exakte Lebensdaten because, in presence of multiple birth or death dates, only the date with the highest precision (i.e. day) is present in the RDF export of the authority record and thus included in the query service. I agree that, also from my experience, most are typos, easily fixable. FYI @Kolja21, Diggr:. Epìdosis 13:25, 30 May 2026 (UTC)
- Sure, very easy: here is the query with the "normal" URL and the GND Explorer URL (https://sparql.dnb.de/ydVuQN). Of course I also hope that those who work inside GND take care of this :) --Epìdosis 10:20, 30 May 2026 (UTC)
- After the update of the dump used in the GND query service from 19 April to 28 May, the results slightly decreased to 456. Epìdosis 08:24, 30 May 2026 (UTC) P.S. this improved version also shows the corresponding Wikidata items: https://sparql.dnb.de/PAgfKO. --Epìdosis 08:57, 30 May 2026 (UTC)
Age above 122 years
[edit]Considering that Jeanne Calment (Q182260) has presently the record of 122 lived years, persons died at an age beyond 122 years are probably mistakes. We have https://qlever.dev/wikidata/SQl7Ih (110 results) and https://qlever.dev/wikidata/ip1jY4 (99 results) to check. --Epìdosis 15:47, 2 June 2026 (UTC)
- Improved versions, removing the persons with subject has role (P2868)supercentenarian (Q1200828): https://qlever.dev/wikidata/a8fmpy (105 results) and https://qlever.dev/wikidata/VVdvwC (97 results). --Epìdosis 15:55, 2 June 2026 (UTC)
- hi, Aḥmad ʻAlī Kuhzād (Q25657111) is a problem of islamic/georgian calendar, obviously ;) Hsarrazin (talk) 16:12, 2 June 2026 (UTC)
- I see quite a bunch of typos in the list - wonder how they arrived there, because consulting https://explore.gnd.network/gnd/170971619?term=170971619& and Rudolf Mondelaers (Q133767592) I do not see it :(
- may I suggest you sort the list by "age" ? - the most obvious typo should be easy to catch :D Hsarrazin (talk) 16:21, 2 June 2026 (UTC)
- @Hsarrazin: Aḥmad ʻAlī Kuhzād (Q25657111): I see, it happens sometimes; this is clearly wrongly encoded, I reported it. Here the sorted queries: https://qlever.dev/wikidata/KKfVw6 (105 results) and https://qlever.dev/wikidata/uLps1y (97 results). Today QLever is not updating real-time for some reason, so you might see a few cases I have already fixed in Wikidata; it will be probably updated in a few hours I guess. Epìdosis 16:24, 2 June 2026 (UTC)
- what I meant is that I don't understand where the typo came from in GND - unless it was entered manually - GND gives the right year - the typo is in WD - like Leo Wannagat (Q94830379) Hsarrazin (talk) 16:39, 2 June 2026 (UTC)
- @Hsarrazin: I think Leo Wannagat (Q94830379) is one of the cases in which fortunately someone has already corrected the mistake in GND; 1012 was in GND in 2020, so between 2020 and 2026 someone has corrected in 1912; now I also updated WD. Epìdosis 17:16, 2 June 2026 (UTC)
- ok, so if I find other similar cases, I can just correct WD :) Hsarrazin (talk) 17:21, 2 June 2026 (UTC)
- @Hsarrazin: I think Leo Wannagat (Q94830379) is one of the cases in which fortunately someone has already corrected the mistake in GND; 1012 was in GND in 2020, so between 2020 and 2026 someone has corrected in 1912; now I also updated WD. Epìdosis 17:16, 2 June 2026 (UTC)
- what I meant is that I don't understand where the typo came from in GND - unless it was entered manually - GND gives the right year - the typo is in WD - like Leo Wannagat (Q94830379) Hsarrazin (talk) 16:39, 2 June 2026 (UTC)
- @Hsarrazin: Aḥmad ʻAlī Kuhzād (Q25657111): I see, it happens sometimes; this is clearly wrongly encoded, I reported it. Here the sorted queries: https://qlever.dev/wikidata/KKfVw6 (105 results) and https://qlever.dev/wikidata/uLps1y (97 results). Today QLever is not updating real-time for some reason, so you might see a few cases I have already fixed in Wikidata; it will be probably updated in a few hours I guess. Epìdosis 16:24, 2 June 2026 (UTC)
- hi, Aḥmad ʻAlī Kuhzād (Q25657111) is a problem of islamic/georgian calendar, obviously ;) Hsarrazin (talk) 16:12, 2 June 2026 (UTC)
authorities requiring further investigation
[edit]- The following dates require investigation to find the correct values and/or file reports, I leave them for you @Kolja21: (and sorry for the many pings today). --Epìdosis 17:53, 8 May 2026 (UTC)
- Ludwig Hahn (Q133489319): 1600-1600 (died as a child, cf. Epitaphium. Ludowigi, D. Philippi Galli Metropolitanæ Magdeburgensis Ecclesiæ Pastoris Filioli)
- Can someone check the exact data? ("Nati 7. Martij Hor. 11. noctis: Mortui 21. eius" = born March 7 at 11 PM; died March 21?) --Kolja21 (talk) 19:30, 8 May 2026 (UTC)
- if so, there is incompatibility with the P106 cathedral preacher (Q23646062) - could there be an homonym ? Hsarrazin (talk) 19:35, 8 May 2026 (UTC)
- I deleted the occupation but I can't correct the DDB link to Ludovicus Gallus: Traktat von der Gebärung des Geistes der Metalle (UB Kassel). --Kolja21 (talk) 19:54, 8 May 2026 (UTC)
- is he there as author (P50) ? I wonder why GND would have built an authority for an infant, would couldn't have written anything :/ Hsarrazin (talk) 19:59, 8 May 2026 (UTC)
- I confirm your interpretation of the Latin; yes, then the occupation is clearly wrong. Epìdosis 20:18, 8 May 2026 (UTC)
- @Hsarrazin: Authority files are assigned not only to authors, but also to individuals about whom a biography (or, in this case, an epitaph) has been published. For archives, a single letter or photo is sufficient to create a GND. --Kolja21 (talk) 20:36, 8 May 2026 (UTC)
- ahhhhh ! that was an epitaph !! - indeed, I know the archival practice, I work in an archive service, where I create authorities everyday.
- what disturbed me was the reference to "Traktat von der Gebärung des Geistes der Metalle" which looked like a book title - but I guess "Ludovicus Gallus" is not a really rare name :/ Hsarrazin (talk) 20:39, 8 May 2026 (UTC)
- If you have time you could write to the UB Kassel. They should correct the linking. --Kolja21 (talk) 23:43, 8 May 2026 (UTC)
- @Hsarrazin: Authority files are assigned not only to authors, but also to individuals about whom a biography (or, in this case, an epitaph) has been published. For archives, a single letter or photo is sufficient to create a GND. --Kolja21 (talk) 20:36, 8 May 2026 (UTC)
- I confirm your interpretation of the Latin; yes, then the occupation is clearly wrong. Epìdosis 20:18, 8 May 2026 (UTC)
- is he there as author (P50) ? I wonder why GND would have built an authority for an infant, would couldn't have written anything :/ Hsarrazin (talk) 19:59, 8 May 2026 (UTC)
- I deleted the occupation but I can't correct the DDB link to Ludovicus Gallus: Traktat von der Gebärung des Geistes der Metalle (UB Kassel). --Kolja21 (talk) 19:54, 8 May 2026 (UTC)
- Jonas Gans (Q94910965): The date of death was noted as the date of birth.
Done --Kolja21 (talk) 00:38, 9 May 2026 (UTC) - Alfred Ahner (Q95296674): 10.05.1861 - 17.11.
1861, son of Gustav Eduard Ahner (Q60838460): no description- 3 GNDs. GND 119487620X conflated with Alexander Reuss (Q60842831): (1844-1912).
Done --Kolja21 (talk) 12:55, 9 May 2026 (UTC)
- 3 GNDs. GND 119487620X conflated with Alexander Reuss (Q60842831): (1844-1912).
- Fritz Müsch (Q139003342): 1900-1900 = 20.sc.
Done --Kolja21 (talk) 02:14, 10 May 2026 (UTC) - Karl-Heinz Schröter (Q95252489): born 1936 in Berlin; writes poetry and feuilletons (LCAuth). I've removed 12.01.1936 - 10.04.1936. Conflation with Karl-Heinz Schröter (Q139737464): no description fixed.
Done --Kolja21 (talk) 20:25, 10 May 2026 (UTC)
- @Emu: Can you take a look? 12.01.1936 Berlin - 10.04.
1936Berlin. The year of death is wrong but maybe you'll find something at the civil registration office (Standesamt). --Kolja21 (talk) 20:32, 10 May 2026 (UTC)- @Kolja21 10. April 1983, Sterbebuch Zehlendorf 1983, Band 2, Nr. 855. --Emu (talk) 22:09, 10 May 2026 (UTC)
- Besten Dank. Ich habe GND und WD entsprechend ergänzt. --Kolja21 (talk) 03:44, 11 May 2026 (UTC)
- @Kolja21 10. April 1983, Sterbebuch Zehlendorf 1983, Band 2, Nr. 855. --Emu (talk) 22:09, 10 May 2026 (UTC)
- @Emu: Can you take a look? 12.01.1936 Berlin - 10.04.
- Michael Klock (Q137331503): 1950-1950. Year of death is wrong. I've filed a report due to technical difficulties. --Kolja21 (talk) 21:09, 10 May 2026 (UTC)
- Volker Wohlfahrt (Q139118420): 18.12.1950 -
19.01.1950; still alive in 2005. Corrected. --Kolja21 (talk) 04:52, 11 May 2026 (UTC) - Kurt Hoppenrath (Q94925331): 1975-1975; died 1975. Typo corrected. --Kolja21 (talk) 05:02, 11 May 2026 (UTC)
- Günter Schilling (Q139005483): 12.05.1988 - 12.05.1988; died 1988. Typo corrected. --Kolja21 (talk) 05:21, 11 May 2026 (UTC)
- Emich X von Leiningen-Hartenburg (Q136643082)
- Indeed. Kaiser und Höfe ID (P1818) had the GND but it wasn't imported. Another construction site. Merged. --Kolja21 (talk) 05:48, 11 May 2026 (UTC)
- Juliane von Hessen (Q139000194)
Done GND is correct, confirmed by other sources: born and died on the same day. --Epìdosis 22:10, 8 May 2026 (UTC)
- Justus Oldenburger (Q139113203) - said to be "Vater von Wilhelm Oldenburger (gest. 24.02.1723)", so, certainly not died when infant ;)
- 23.09.1599 - XX.XX.16XX = already corrected. --Kolja21 (talk) 10:27, 11 May 2026 (UTC)
- Hans Rauch (Q139202145) - 10.6p Personen zu Telekommunikation und Verkehr, Fremdenverkehr ?
- Typo b not p (innkeeper): 10.6b Fremdenverkehr, Hotel- und Gaststättengewerbe. --Kolja21 (talk) 11:59, 11 May 2026 (UTC)
- Julius Flack (Q139233456) - Redakteur in der Hörspielabteilung des Südwestfunks
Done 1947-1947 = fl. 1947. --Kolja21 (talk) 00:33, 12 May 2026 (UTC)
- Gerhard Pöttker (Q139001168) - maybe the correct birthdate can be found
- 03.12.
1996- 02.04.1996. I've deleted the wrong year of birth. The date of death is correct. Later someone can check the source and and fill in the missing information: Samtgemeinde Lathen (2015), S. 499-500. --Kolja21 (talk) 00:48, 12 May 2026 (UTC)
- 03.12.
- Robert Weickert (Q139229432) "Akademischer Grad: Dr. med." - does this mean he was a medical doctor, or something else entirely ?
Done Dr. med. = physician. Merged with Robert Weickert (Q133632551): no description. --Kolja21 (talk) 12:46, 12 May 2026 (UTC)
- Johann Peter Schütz (Q94894330):
Done Typo. Merged with Q138549163 (New item using Relator2). --Kolja21 (talk) 14:14, 12 May 2026 (UTC) - Wilhelm Erick Haake (Q94928116)
Done 1875-1875; infant death. --Kolja21 (talk) 01:41, 15 May 2026 (UTC) - Alfons Roth-Schuler (Q139227329) Waiting for LEO-BW (Q48754091): 503 Service Unavailable. --Kolja21 (talk) 01:55, 15 May 2026 (UTC)
- 1933-1933: I've send an email to the State Archives of Baden-Württemberg. --Kolja21 (talk) 23:10, 15 May 2026 (UTC)
Done 1933-2002: Year of death in GND corrected. (LEO-BW will follow.) --Kolja21 (talk) 15:24, 5 June 2026 (UTC)
- 1933-1933: I've send an email to the State Archives of Baden-Württemberg. --Kolja21 (talk) 23:10, 15 May 2026 (UTC)
- Kurt Paschold (Q123127653) this one was not flagged as problematic, since there was no death date - but GND refers to dewp as source... and dewp gives another date. I let you investigate. --Hsarrazin (talk) 08:14, 15 May 2026 (UTC)
Done Wikipedia (Stand: 25.11.2024) = † nach 1961. The Wikipedia article has been updated. Now also GND includes the exact year of death. --Kolja21 (talk) 23:22, 15 May 2026 (UTC)
- Gottfried Siegmund Germann (Q133076573)
Done 1766-1766; fl. 1783 => year of death was a typo. --Kolja21 (talk) 16:34, 16 May 2026 (UTC) - Daniel Müller (Q139010740)
Done Infant death. --Kolja21 (talk) 16:51, 16 May 2026 (UTC) - Rosina Born (Q139016808) (probably infant death)
Done Confirmed. --Kolja21 (talk) 21:15, 16 May 2026 (UTC) - Arthur Loibl (Q95865161) - contradiction between GND and Commons data - error ? or homonym ? --Hsarrazin (talk) 13:05, 16 May 2026 (UTC)
Done Two actors (b. 1969 / b. 1949). Conflation solved. --Kolja21 (talk) 21:30, 16 May 2026 (UTC)
- Ludwig Hahn (Q133489319): 1600-1600 (died as a child, cf. Epitaphium. Ludowigi, D. Philippi Galli Metropolitanæ Magdeburgensis Ecclesiæ Pastoris Filioli)
- From here, issues of birthyear after deathyear
- Albin Freytag (Q139118774) 1979-1948 but still active Oktober 1959. That doesn't make sense. --Kolja21 (talk) 01:12, 19 May 2026 (UTC)
- Maria Martell (Q139228603)
Done Typo in the year of birth. --Kolja21 (talk) 01:22, 19 May 2026 (UTC) - Agnis von Reiboldt (Q136854433)
Done Typo in the year of birth. --Kolja21 (talk) 01:40, 19 May 2026 (UTC) - Andreas Creutzer (Q139123761) Typo in the year of birth (now deprecated). Born 1609 and (sic) born 04.09.1698. --Kolja21 (talk) 02:01, 19 May 2026 (UTC)
- Friedrich Pietzker (Q126945163) 1778-1832 according to description.
Done Father of the opera singer Friederike Wittenburg (Q126944531). --Kolja21 (talk) 00:51, 19 May 2026 (UTC) - Johann von Thill (Q133093326)
Done Typo in the year of birth. --Kolja21 (talk) 20:14, 21 May 2026 (UTC) - Christoph Meurer (Q93844540) GND OK but we need @Thomas Kerboul (BGE): for IDREF. --Epìdosis 13:08, 21 May 2026 (UTC)
- Johann Michael Caletzki (Q60831395)
Done Typo in the year of birth. --Kolja21 (talk) 04:57, 22 May 2026 (UTC) - Johann David Götsch (Q55911869)
Done Year of birth + death moved to work period (1664-1680). --Kolja21 (talk) 06:59, 22 May 2026 (UTC) - Berta Becker (Q131322253) Created as conflated item. No description and no source for 1833-1898 given. --Kolja21 (talk) 06:11, 23 May 2026 (UTC)
- It was a test OpenRefine import, tagged "RISM test Ingest", which made me think of "musicians" and "composers" and "interprets"... RISM standing for " (Répertoire International des Sources Musicales" (Internation repertory of Musical sources)
- could she be one of the many "Madame Becker" or "mademoiselle Becker" cited on partitions and in newspapers ? - the wrong adding of GND from the start being an error from a newbie... ; but I cannot search efficiently in these sources in german/english... can one of you guys do it ? see the database here
- I see 2 possibilities, from now : keep the "GND" item and its correct other data, and delete wrong unsourced data ; or split into 2 items, and putting the "GND" data on a new item (to avoid repurposing) Hsarrazin (talk) 09:04, 23 May 2026 (UTC)
Done I've created a new item. GND 126021635 = Berta Becker (Q139906028): no description. --Kolja21 (talk) 07:26, 24 May 2026 (UTC)
- Pierre Hermand (Q109858381)
Done Born 1920 was unsourced. I've corrected the year of birth and added IdRef as source. --Kolja21 (talk) 04:48, 22 May 2026 (UTC) - Bernd Reutemann (Q132927581) surely not born in 169
- Sandra Jendrzej (Q133394637) surely not born in 188
- Gerardo Ibarra (Q133367065) surely not born in 548
There are quite a bunch of babies in the 16th, 17th and 18th century, that were probably created for archival purpose, I guess. I stopped after a dozen. Too sad, and too difficult... I let you check them. – The preceding unsigned comment was added by Hsarrazin (talk • contribs).
- I am checking them. --Epìdosis 10:10, 9 May 2026 (UTC)
- oh, and among them are probably a bunch of the Boveri (Q21491897) family https://kalliope-verbund.info/DE-611-HS-3050329 - don't now how to display labels in the query, so, difficult to check easily --Hsarrazin (talk) 13:35, 9 May 2026 (UTC)
- In this case it would be helpful to create an item for this family. Unfortunately, there is no GND. --Kolja21 (talk) 16:21, 9 May 2026 (UTC)
- well, at least, I added family name (P734) which should make the retrieving easier... - also, there already exists Boveri (Q99370063) - don't know if it is the same though :/ Hsarrazin (talk) 16:39, 9 May 2026 (UTC)
- I've created Q139721055: no description, since Margret Boveri (Q97372) was an important journalist. --Kolja21 (talk) 01:34, 10 May 2026 (UTC)
- well, at least, I added family name (P734) which should make the retrieving easier... - also, there already exists Boveri (Q99370063) - don't know if it is the same though :/ Hsarrazin (talk) 16:39, 9 May 2026 (UTC)
- In this case it would be helpful to create an item for this family. Unfortunately, there is no GND. --Kolja21 (talk) 16:21, 9 May 2026 (UTC)
- oh, and among them are probably a bunch of the Boveri (Q21491897) family https://kalliope-verbund.info/DE-611-HS-3050329 - don't now how to display labels in the query, so, difficult to check easily --Hsarrazin (talk) 13:35, 9 May 2026 (UTC)
Addition of references to existing values of P21
[edit]
Notified participants of WikiProject Data Quality Wikidata has a lot of unreferenced sex or gender (P21) and this has been often considered as a problem; in order to mitigate this issue using GND data, I have previously used my bot (Wikidata:Requests for permissions/Bot/EpidòseosBot), which ran on ca. 350k items. Now, using the query services of GND and Wikidata, we can reach a far better efficiency. I have ready an import of GND references only for existing P21 values, specifically 560455 unreferenced sex or gender (P21)male (Q6581097) and 237041 unreferenced sex or gender (P21)female (Q6581072). While preparing this addition, I have also found 2081 discrepancies: they are now collected in Property talk:P227/Gender mismatches. If there are no objections, I will run this import of references next week, after the import above (#Massive import of data from GND (May 2026)) has been completed. Best, --Epìdosis 14:39, 5 May 2026 (UTC) P.S. As a statistical note, as of today we have 11092855 values of P21, out of which 5229708 with at least one reference, in most cases [4882973] exactly one reference (source); the 797496 to-be-added references would thus increase the percentage of referenced P21 values from 47.1% to 54.3%. --Epìdosis 15:28, 5 May 2026 (UTC)
- Hi,
- only saw that discussion today :)
- could you please add VIAF cluster ID (P214) for Q5 items in your imports, please ? Normally, each GND record should match a viaf entry, and it's very painful to complete items with only GND id without it... I regularly stumble on items for French authors, and have no way to check Bnf/Sudoc, except by searching there, instead of matching through Viaf. – The preceding unsigned comment was added by Hsarrazin (talk • contribs).
- Hi @Hsarrazin:! I am not massively creating new items on the basis of GND, I am only enriching existing items, as you can see from above discussions. That said, I perfectly much agree that having a VIAF cluster ID (P214) besides GND ID (P227) is vital. There is already a bot extracting VIAF for items having only GND or other VIAF members, Wikidata:Requests for permissions/Bot/DifoolBot 2 (FYI @Difool:). In theory I could extract VIAF ID from GND, but some experiments proved me that often the VIAF values inside GND records are severely outdated, so it is much better to extract VIAF IDs directly from VIAF, which I am personally not able to do. --Epìdosis 14:59, 5 May 2026 (UTC)
- ok !
- I work manually on an item by item basis, because my daily job is creating and aligning Authorities in our Library catalog. I'll continue to add (or update) Viaf wherever they should be ^^
- I manually corrected or sourced a batch of French author I know in my catalog. And, by the way, I noticed that quite of lot of erroneous genders are the work of User:MrProperLawAndOrder (i.e. blocked Tamawashi) :( Hsarrazin (talk) 16:56, 5 May 2026 (UTC)
Addition of referenced P106 values
[edit]As of now, 750345 humans containing GND ID (P227) have one or more values of occupation (P106) all unreferenced (query: https://qlever.dev/wikidata/NWtTVM). For most of them (611661) one or more referenced occupation (P106) can be added from GND, thus solving mostly the issue. If there are no objections, I will run this addition next week, together with the addition of P21, so that the total number of edits is not 797496+611661 but smaller, since each item will be edited only once (either for P21 or for P106 or for both). Thanks, --Epìdosis 07:46, 7 May 2026 (UTC)
- Sounds great, thanks. - PKM (talk) 20:30, 7 May 2026 (UTC)
- The import is ongoing; some issues have been reported to me because there is a small group of values that are in fact invalid for occupation (P106) (cf. User talk:Epìdosis#import data from GND); this is due to both unforeseen GND errors (field of study inputted into Beruf instead of Studienfach) and also the strange way in which GND encodes in RDF the value of Studienfach together with the values of Beruf (cf. User talk:Diggr#fieldOfStudy and professionOrOccupation). This is a provisorial list of problematic P106 values imported: https://qlever.dev/wikidata/rl7pb0. I will make more accurate statistics and massively fix them, mainly through
{{Autofix}}, after the import is finished. Epìdosis 17:41, 16 May 2026 (UTC)- Also related: museum director (Q22132694): executive in charge of a museum = GND 4398836-2 is a occupation (P106) in GND, but we consider that it is a position held (P39) in Wikidata. KrBot is already doing the autofixes (see e.g. the history on Q121709). --Thomas Kerboul (BGE) (talk) 08:52, 17 May 2026 (UTC)
- The import finished a few hours ago. I have already fixed some tens of thousands of redundant occupations, removing throug QS batches preexisting values which had imported from Wikimedia project (P143)- or no references and were less precise. In the meanwhile, Autofix should also deploy. I will look at the overall situation during the next days. If you notice cases I should prioritise, feel free to report them in this section (FYI @Pallor, Thomas Kerboul (BGE), Kolja21:). Thanks! Epìdosis 16:37, 22 May 2026 (UTC)
- Thanks for the ping. KrBot has moved the tasks that are actually assignments (P31 = position (Q4164871)) and were listed in the P106 talk page under Autofix.
- Two potential issues may arise:
- 1; They are not listed in the autofix, so KrBot does not move them; they remain as occupation even though they are actually position.
- 2; They are actually already listed in the position (P39), but country-specifically. E.g.: justice minister (Q17763739) vs Minister of Justice of Hungary (Q50586662), or KrBot moved them and they therefore appear twice.
- These are probably harder to filter out; I’ve corrected three so far, but since 2 million occupations were added, this could have happened multiple times.
- Examples: Q389720, Q599637. Thank you Pallor (talk) 21:33, 22 May 2026 (UTC)
- Thanks for raising the issue. I will investigate soon and add more to the autofix (BTW, issues are also being reported with some female occupations, I have just added one autofix and others will probably be needed); I will also have a look at redundances for ministers on position held (P39). Epìdosis 08:06, 23 May 2026 (UTC)
- The import finished a few hours ago. I have already fixed some tens of thousands of redundant occupations, removing throug QS batches preexisting values which had imported from Wikimedia project (P143)- or no references and were less precise. In the meanwhile, Autofix should also deploy. I will look at the overall situation during the next days. If you notice cases I should prioritise, feel free to report them in this section (FYI @Pallor, Thomas Kerboul (BGE), Kolja21:). Thanks! Epìdosis 16:37, 22 May 2026 (UTC)
- Also related: museum director (Q22132694): executive in charge of a museum = GND 4398836-2 is a occupation (P106) in GND, but we consider that it is a position held (P39) in Wikidata. KrBot is already doing the autofixes (see e.g. the history on Q121709). --Thomas Kerboul (BGE) (talk) 08:52, 17 May 2026 (UTC)
- The import is ongoing; some issues have been reported to me because there is a small group of values that are in fact invalid for occupation (P106) (cf. User talk:Epìdosis#import data from GND); this is due to both unforeseen GND errors (field of study inputted into Beruf instead of Studienfach) and also the strange way in which GND encodes in RDF the value of Studienfach together with the values of Beruf (cf. User talk:Diggr#fieldOfStudy and professionOrOccupation). This is a provisorial list of problematic P106 values imported: https://qlever.dev/wikidata/rl7pb0. I will make more accurate statistics and massively fix them, mainly through
- Hi @Epìdosis, I saw here that voice (Q17172850) was mistakenly added as occupation. Difool (talk) 17:25, 27 May 2026 (UTC)
- Thanks very much for the report, this is caused by the strange behaviour of the GND RDF export as other previous cases, I reported it here; in the meanwhile, in Wikidata I have added an autofix occupation (P106)voice (Q17172850) -> instrument (P1303)voice (Q17172850) to solve the issue. Epìdosis 19:39, 27 May 2026 (UTC)
- I also saw here that Pope (Q19546) was added as occupation (P106), should be position held (P39). Difool (talk) 15:04, 31 May 2026 (UTC)
- well, if that's the case, many corrections to do ^^ - https://w.wiki/QRBs (178 P106 ; 270 P39) Hsarrazin (talk) 15:12, 31 May 2026 (UTC)
- Thanks @Difool, Hsarrazin:, removed all the occupation (P106)Pope (Q19546) leaving only the position held (P39)Pope (Q19546) through https://quickstatements.toolforge.org/#/batch/259233. Epìdosis 18:46, 31 May 2026 (UTC)
- well, if that's the case, many corrections to do ^^ - https://w.wiki/QRBs (178 P106 ; 270 P39) Hsarrazin (talk) 15:12, 31 May 2026 (UTC)
- I also saw here that Pope (Q19546) was added as occupation (P106), should be position held (P39). Difool (talk) 15:04, 31 May 2026 (UTC)
- Thanks very much for the report, this is caused by the strange behaviour of the GND RDF export as other previous cases, I reported it here; in the meanwhile, in Wikidata I have added an autofix occupation (P106)voice (Q17172850) -> instrument (P1303)voice (Q17172850) to solve the issue. Epìdosis 19:39, 27 May 2026 (UTC)
Incorrect name order
[edit]GND seems contain a number of entries that has surname and given name reversed. Example: Waldstromer Conrad (Q94944478) and Groland Ulrich (Q95330243). This also affects other databases using GND as identifier, such as Deutsche Biographie. GZWDer (talk) 20:04, 25 April 2026 (UTC)
- That's a kown problem with low-quality bulk imports. Waldstromer and Groland aren't family names. The database fields in question are called Persönlicher Name (name) + Zusatz zum persönlichen Namen (addition). This applies in particular to the Middle Ages and the nobility. --Kolja21 (talk) 22:41, 25 April 2026 (UTC)
GNDplus
[edit]Hi! How should we deal with the new GNDplus identifiers (more info here)? Should they be added to Wikidata and, if so, here or in a new property?
Notified participants of WikiProject Germany --Printstream (talk) 08:17, 24 May 2026 (UTC)
- Regarding: "How should we deal with the new GNDplus identifiers (more info here)?"
- https://wiki.dnb.de/spaces/GND/pages/463145380/GNDplus : "Der GNDplus dient erstens als Inkubator für neue GND-Normdatensätze und zweitens als Bereich, um Informationen zu ergänzen" - the wording is inconsistent, can "neue GND-Normdatensätze" not be used to "Informationen zu ergänzen" and is the "Bereich" also an "Inkubator" from where data will later be added to the GND itself? Also "GNDplus Datenraum (GNDplus)" what does that mean? Why is there "nun den GNDcs. Mit dem GNDcs".
- The GND itself isn't a "Bereich, um Informationen zu ergänzen"? They already have the levels 1 to 6(?) is GNDplus something like level 7?
- There is also an "Infographik zur Funktionsweise des GNDplus Datenraum (GNDplus)" and the graphic doesn't contain the term "GNDplus" but "GNDcs"
- Maybe best: give them feedback, they seem to lack consistency and as such are a source of more chaos.
- Regardinng: "Should they be added to Wikidata and, if so, here or in a new property?"
- Example: Archiv der Familie Langenbach
- Q139945971
- https://explore.gnd.network/gnd/plus-1060815036
- Beschreibende Angaben: Entitätentyp:
- plus (aus GNDplus)
- win (Sammlung)
- Idenntifikatoren: GND:
- GNDplus-ID plus-1060815036
- PPN plus-1060815036
- Katalogisierende Institution
- ISIL: DE-101 / Redaktion DE-101
- Datum der Ersterfassung: 12.11.2014
- Satzart: TuE
- Beschreibende Angaben: Entitätentyp:
- https://d-nb.info/1060815036 redirect to https://portal.dnb.de/opac.htm?method=simpleSearch&cqlMode=true&query=idn%3D1060815036 , i.e. idn=1060815036
- not found:
- As the GNDplus is currently designed, it cannot be used in Property:P227 without producing links to 404-urls as long as the latter has formatter URL ( https://www.wikidata.org/w/index.php?title=Property:P227&oldid=2496530834#P1630 ) https://d-nb.info/gnd/$1 as preferred and not e.g. https://explore.gnd.network/gnd/$1" .
- Also, as of 2026-05-27, each example at https://explore.gnd.network/search?f.entitycode=plus&rows=100 has "Sammlung (win)", and each of the first three in the list had the ID part after "plus-" as DNB idn (https://d-nb.info/1315526522 https://d-nb.info/988550865 https://d-nb.info/1223476316) the entities can be identified by the "DNB edition ID" (Property:P1292).
- Note: not each "DNB edition ID" is a contained in GNDplus, e.g. https://d-nb.info/574197265 Emil und die Detektive : ein Roman für Kinder / Erich Kästner. Ill. von Walter Trier - but https://explore.gnd.network/gnd/plus-574197265 "Diese Seite existiert nicht Meinen Sie: https://explore.gnd.network https://explore.gnd.network/search"
- Summary:
- as long as 404-links would be created: don't add in P227
- as long as there are no IDs outside the DNB edition IDs : don't create a new property
- GNDplus (talk) 16:02, 27 May 2026 (UTC)
- If I understand correctly those new GNDplus are some kind of work-in-progress identifiers that will need a validation from a GND agency to become full GNDs. There is some similarities in IdRef, where a librarian can propose a new RAMEAU indexing term and use it right away while the decision to accept it or not is pending. It is not rare that the proposition ends up rejected and thus further work is needed to replace the proposition where it is used. I fear it would be the same for those GNDplus and I would refrain from using it in Wikidata, because that would create futher work to periodically check their status; it's also not clear if their are persistent or not. --Thomas Kerboul (BGE) (talk) 08:59, 28 May 2026 (UTC)
- The sockpuppet has found a new playground. Instead of improving his poor imports he created Q139945971 as version, edition or translation (Q3331189). Always keeps us busy. @Epìdosis, Lymantria: Sockpuppet of Tamawashi. --Kolja21 (talk) 10:45, 28 May 2026 (UTC)
- @Printstream: Imho we can ignore the "GNDplus-Datensätze im GND Explorer". Right now, it's just an experiment. --Kolja21 (talk) 10:58, 28 May 2026 (UTC)
- You're right. It is in an early state and we're trying out what works. Once we enter the beta phase, we'll make an announcement. Diggr (talk) 07:15, 2 June 2026 (UTC)
- @Printstream: Imho we can ignore the "GNDplus-Datensätze im GND Explorer". Right now, it's just an experiment. --Kolja21 (talk) 10:58, 28 May 2026 (UTC)
- The sockpuppet has found a new playground. Instead of improving his poor imports he created Q139945971 as version, edition or translation (Q3331189). Always keeps us busy. @Epìdosis, Lymantria: Sockpuppet of Tamawashi. --Kolja21 (talk) 10:45, 28 May 2026 (UTC)
- If I understand correctly those new GNDplus are some kind of work-in-progress identifiers that will need a validation from a GND agency to become full GNDs. There is some similarities in IdRef, where a librarian can propose a new RAMEAU indexing term and use it right away while the decision to accept it or not is pending. It is not rare that the proposition ends up rejected and thus further work is needed to replace the proposition where it is used. I fear it would be the same for those GNDplus and I would refrain from using it in Wikidata, because that would create futher work to periodically check their status; it's also not clear if their are persistent or not. --Thomas Kerboul (BGE) (talk) 08:59, 28 May 2026 (UTC)
question about consulting GND database, please
[edit]Hi, if you've not understood yet, I don't speak german - at all - which means it's very difficult for me to find a very simple functionality in the database, apart from the displayed parts.
my question is, from a GND item (in https://d-nb.info/gnd/ or Explore GND), how can I access the linked works the person has authored, or is subject of ? - I have a person, and I'm not sure if he's the one I look for ; I'd like to see the linked works to estimate probabily that he's the right guy, or just an homonym (living person without dates) ; how can I know which works he's related to ?
Note : the guy I'm looking for is a "Michael Feik", and I'm not sure at all that he's the same as https://explore.gnd.network/gnd/1061292959?term=1061292959
thanks for your help :) Hsarrazin (talk) 13:05, 31 May 2026 (UTC)
- Hi I'm not an expert, but I think I can help you:
- normaly you can use https://swb.bsz-bw.de/DB=2.104/SET=2/TTL=1/LNG=EN/NXT?COOKIE=Us998,Pbszgast,I2017,B20728,SY,NRecherche-DB,D2.104,Ed2900659-0,A,H,R193.197.31.8,FY at the bottom of the page you get three links:
- related publications | ...written by/involved in... | publication about...
- unfortunately there is nothing related in this case.
- In the GND Explorer you can find the section Katalogisierende Institution there: DE-Wi17FP --> if you look this up in https://isil.staatsbibliothek-berlin.de/isil/DE-Wi17FP the DFF - Deutsches Filminstitut & Filmmuseum / filmportal.de created the GND. and I was able to find a single on their website (not in the Personen section but in their website search: https://www.filmportal.de/person/michael-feik_090a784db1ca4672bf694123df0f0b28
- so the related works are:
- 1991 Lorenza (1991 is the same date as the time mentioned in the GND)
- 1982 Der Zwilling
- Hope this was helpfull. Wetterwolke (talk) 18:03, 31 May 2026 (UTC)
- thanks for this help... he is probably NOT the co-author of this book : https://search.worldcat.org/fr/title/174400613
- but I hoped you could tell me "where" in the normal GND authority display (or in explore GND) I would find the link/button to linked works (on any person)... because I often have this problem, and I do not understand writings. I do not intend to use a third german database - it's difficult enough for me like this (frow WD)
- in authority databases I use everyday (french/english), there is a visible link to works that allows to access all works in the Catalog, either authored or about the person. I cannot find it on GND.
- Is there NO direct link to linked works ? or just on this one Person ? Hsarrazin (talk) 18:23, 31 May 2026 (UTC)
- The current GND Explorer does not display linked works, and it likely won't in the near future. To find works by a specific person, click the bottom button on the right-hand side ("Partner network") and then select "BEACONAggregator," or go directly to https://prometheus.lmu.de/gnd/{ID}. If the person has literature in one of the major German library catalogs, it will be listed right at the top (e.g., 119545373). Click on these links to see their associated works. In this particular case, there is no library in Germany that holds works by this person. --Printstream (talk) 18:47, 31 May 2026 (UTC)
- ok ! thanks ! "Beacon aggregator" then :) - that's all I needed to know :D - it's always very difficult to find your way through a catalog, in a language you don't understand :D Hsarrazin (talk) 19:23, 31 May 2026 (UTC)
- The current GND Explorer does not display linked works, and it likely won't in the near future. To find works by a specific person, click the bottom button on the right-hand side ("Partner network") and then select "BEACONAggregator," or go directly to https://prometheus.lmu.de/gnd/{ID}. If the person has literature in one of the major German library catalogs, it will be listed right at the top (e.g., 119545373). Click on these links to see their associated works. In this particular case, there is no library in Germany that holds works by this person. --Printstream (talk) 18:47, 31 May 2026 (UTC)
- Germany-related properties
- All Properties
- Properties with external-id-datatype
- Properties used on 1000000+ items
- Authority control properties
- Properties with entity type constraints
- Properties with scope constraints
- Properties with format constraints
- Properties with qualifiers constraints
- Properties with conflicts with constraints
- Properties with single best value constraints
- Properties with unique value constraints
- Properties with label language constraints
- Properties with complex constraints