{"id":3744,"date":"2022-10-21T10:35:59","date_gmt":"2022-10-21T08:35:59","guid":{"rendered":"https:\/\/distam.hypotheses.org\/?p=3744"},"modified":"2024-10-24T18:15:01","modified_gmt":"2024-10-24T16:15:01","slug":"transcripts-moissonnage-archivage-de-site-web-en-cas-dabsence-de-lapi","status":"publish","type":"post","link":"https:\/\/distam.hypotheses.org\/3744","title":{"rendered":"transcripts : moissonnage\/archivage de site web en cas d\u2019absence de l\u2019API"},"content":{"rendered":"\n<p>Travaillant sur les Balkans, Katarina Risti\u0107 et Nikola Risti\u0107 (Leipzig University, Allemagne) ont mis \u00e0 disposition un outil de moissonnage appliqu\u00e9 au probl\u00e8me sp\u00e9cifique du site web du Tribunal P\u00e9nal International pour l\u2019ex-Yougoslavie [lire :\u00a0<a class=\"\" href=\"https:\/\/trafo.hypotheses.org\/40678\"><em>Web Scraping and Digital Archives: A Program for the Retrieval of the Transcripts of the International Criminal Tribunal for Former Yugoslavia<\/em><\/a>]. Cet outil permet la collecte automatis\u00e9e de donn\u00e9es du site web qui n\u2019a pas sa propre API. Le code, ainsi que les instructions d\u00e9taill\u00e9es sur la fa\u00e7on de l\u2019installer, sont disponibles sur\u00a0<a class=\"\" rel=\"noreferrer noopener\" href=\"https:\/\/github.com\/nikolarist\/transcripts\" target=\"_blank\">GitHub<\/a>.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><em>Image mise en avant : capteur d&#8217;\u00e9cran du <a href=\"https:\/\/www.icty.org\/en\/cases\">site web du Tribunal P\u00e9nal International pour l\u2019ex-Yougoslavie<\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Travaillant sur les Balkans, Katarina Risti\u0107 et Nikola Risti\u0107 (Leipzig University, Allemagne) ont mis \u00e0 disposition un outil de moissonnage appliqu\u00e9 au probl\u00e8me sp\u00e9cifique du site web du Tribunal P\u00e9nal International pour l\u2019ex-Yougoslavie [lire :\u00a0Web Scraping and Digital Archives: A Program for the Retrieval of the Transcripts of the International&#46;&#46;&#46;<\/p>\n","protected":false},"author":43803,"featured_media":3749,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_license":"","footnotes":""},"categories":[326,98],"tags":[399,444],"ppma_author":[567],"class_list":["post-3744","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-veille","category-outils-de-traitement","tag-archives-du-web","tag-outils"],"authors":[{"term_id":567,"user_id":43803,"is_guest":0,"slug":"liao","display_name":"Shueh-Ying LIAO","avatar_url":"https:\/\/secure.gravatar.com\/avatar\/5fba617a2db144ff380f7a8333968760b2cbb34cfe1bba72ae5013d05477e99e?s=96&d=blank&r=g","1":"","2":"","3":"","4":"","5":"","6":"","7":"","8":""}],"_links":{"self":[{"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/posts\/3744","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/users\/43803"}],"replies":[{"embeddable":true,"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/comments?post=3744"}],"version-history":[{"count":1,"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/posts\/3744\/revisions"}],"predecessor-version":[{"id":3752,"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/posts\/3744\/revisions\/3752"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/media\/3749"}],"wp:attachment":[{"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/media?parent=3744"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/categories?post=3744"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/tags?post=3744"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/distam.hypotheses.org\/wp-json\/wp\/v2\/ppma_author?post=3744"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}