Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2018
The number of Open Statistical Data available for reuse is rapidly increasing. Linked open data technology enables easy reuse and linking of data residing in different locations in a simple and straightforward manner. Yet, many people are not familiar with the technology standards and tools for making use of open statistical data. In this tutorial, we will introduce Linked Open Statistical Data (LOSD) and demonstrate the use of LOSD technologies and tools to visualize open data obtained from various European Countries. We will also give the participants the opportunity to use these tools thus obtaining a personal experience on their capabilities.
Proceedings of the 18th Annual International Conference on Digital Government Research, 2017
Open data have tremendous potential which however remains unexploited. A large part of open data is numerical and highly structured. We concentrate on those data and capitalize on linked open data (LOD) as the underlying technology. In this paper, we present a number of tools to facilitate publishing and reusing of linked open statistical data. We propose an architecture and implementation that allows developing custom visualization and analysis tools without knowledge of LOD technologies. We further present work towards deploying relevant tools in six di erent countries to improve decision-making and transparency and thus support public administration.
2014
A major part of Open Data concerns statistics such as population figures, economic and social indicators. The adoption of the Linked Data principles and technologies has promised to enhance the analysis of statistical data at a Web scale. Statistical data, however, is typically organized in data cubes where a numeric fact is categorized by dimensions. Both data cubes and Linked Data introduce complexity that raises the barrier for opening up and reusing statistical data. In this paper we describe the first release of the OpenCube toolkit that aims at supporting the whole lifecycle of linked data cubes. In particular, the OpenCube toolkit supports transforming raw data into RDF data cubes, attaching metadata, and providing query access to them. In addition, the toolkit enables linked data cube browsing and exploration as well as performing data analytics in an easy manner.
To make the Web of Data a reality, and push large scale integration of, and reasoning on, data on the Web, huge amounts of data must be made available in a standard format, reachable and manageable by Semantic Web tools. National statistical offices across the world already possess an abundance of structured data, both in their databases and files in various formats. We will first consider the reasons for making such data available as Linked Data. Then, some novel data representation methods, compatible with SDMX, an international standard for exchanging statistical data, will be showcased. We will then explain how to move from structured statistical data, represented in XML, to RDF, as well as how to enrich such datasets using standard classification schemas. Finally, we will present a way of increasing data visibility through cataloging newly created linked statistical data at both local and international level.
This demo presents LODStats, a web application for collection and exploration of the Linked Open Data statistics. LODStats consists of two parts: the core collects statistics about the LOD cloud and publishes it on the LODStats web portal, a front-end for exploration of dataset statistics. Statistics are published both in human-readable and machine-readable formats, thus allowing consumption of the data through web front-end by the users as well as through an API by services and applications. As an example for the latter we showcase how to visualize the statistical data with the CubeViz application.
2017
An increasing number of governments, organisations and enterprises publish huge amounts of data as open data. The major part of this data concern statistics, e.g. data in the Open Data Portal of the European Commission (open-data.europa.eu). This data has the potential for innovative uses including performing advanced data analytics and visualizations on top of combined data that were previously isolated. From a technological point of view, statistical data can be combined by employing linked open data technologies and, specifically, the W3C RDF Data Cube vocabulary [1]. This has the potential to realise the vision of performing data analytics on top of integrated but previously isolated statistical data across the Web [2] [3]. Although several practical solutions have been developed during the last years for creating and exploiting Linked Open Statistical Data (e.g. [4] [5]), these solutions are mainly technology-driven and are not able to address the complexity and dynamics of pub...
2012
Linked Open Data (LOD) is a growing movement for organizations to make their existing data available in a machinereadable format. There are two equally important viewpoints to LOD: publishing and consuming. This article analyzes the requirements for both sub-processes and presents an example of publishing statistical data in RDF format and integrating the data into the LOD cloud via the PublicData.eu portal. In particular, it discusses the establishment of the Serbian CKAN metadata repository that serves for publishing open governmental data from Serbia, as well as a source catalogue for the PublicData.eu portal. Furthermore, by using an illustrative case study of the Statistical Office of the Republic of Serbia, it elaborates the adaption of the LOD2 Stack for analysis and dissemination of official statistics information.
2021
In the context of the European project “GIOCOnDa”, this paper describes the conversion process from Open Data to Linked Open Data (LOD) and its implementation in the GIOCOnDa LOD platform. The platform contains a number of conversion configurations that allow different data sources from a variety of Open Data domains to be converted into LOD, without the need of software programming. In addition, the platform is configurable and extensible, as it enables to define mapping configurations for new datasets. Keywords-Linked Open Data (LOD); GIOCOnDa; OntoPia.
Ki - Künstliche Intelligenz, 2015
In recent years. Linked Open Data (LOD) has matured and gained acceptance across various communities and domains. Large potential of Linked Data technologies is seen for an application in scientific disciplines. In this article, we present use cases and applications for an application of Linked Data in the social sciences. They focus on (a) interlinking domain-specific information, and (b) linking social science data to external LOD sources (e.g. authority data) from other domains. However, several technical and research challenges arise, when applying Linked Data technologies to a scientific domain with its specific data, information needs and use cases. We discuss these challenges and show how they can be addressed.
Most current visualizations for Linked Open Data are created for a single purpose or a single dataset. These ad hoc approaches can hardly exploit the linkedness of LOD, and we miss the tools for comfortable and enjoyable LOD browsing. On the other hand WWW has mature generic visualization: the web browsers. With the LODmilla browser we try to find the basic commodity features for generic LOD browsing including views, graph manipulation, searching, etc. With our browser users can navigate and explore multiple LOD datasets and they can also save LOD views and share them with other users.
Procedia Computer Science, 2014
The Open Access movement and the research management can take a new turn if the research information is published as Linked Open Data. With Linked Open Data, the management of the research information within institutions and across institutions can be facilitated, the quality of the available data can be improved and their availability to the public is assured. However, it can be difficult for non-expert users to take advantage of the interlinked information offered by Linked Open Data as they lack of indepth knowledge. In this paper, we present a use case of publishing research metadata as Linked Open Data and creating interactive visualizations to support users in analyzing the Flemish research landscape.
Lecture Notes in Computer Science
Demographic, economic, social and other datasets are often used in policy-making processes. These types of statistical data are opened more and more by governments, which enables the use of these datasets by the public. However, statistical data needs often to combine different datasets. Data cubes can be used to combine datasets and are a multi-dimensional array of values typically used to describe time series of geographical areas. While Linked Open Statistical Data (LOSD) cube software is still in an initial stage of maturity, there is a need for evaluation the software platforms used to process this open data. Yet there is a lack of evaluation methods. The objective of this ongoing research paper is to identify functional requirements for open data cubes infrastructures. Eight main processes are identified and a list of 23 functional requirements are used to evaluate the OpenCube platform. The evaluation results of a LOSD platform show that many functions are not automated and need to be manually executed. We recommend the further integration of the building blocks in the platform to reduce the barriers for the use of datasets by the public.
2013
In the last few years, with the rise of the open data movement, a large and increasing number of governments and organizations have started to make information freely available and easily accessible online. Additionally, in order to increase transparency and improve interoperability and interaction with citizens and society as a whole, but also create new businesses and job opportunities, national governments publish their data in a machine-readable and future-proof format. In this paper we present the LOD2 Statistical Workbench, an integrated set of professional tools for accessing, manipulating, exploring and publishing statistical data. The data representation and processing is based on the W3C standard vocabularies (RDF Data Cube as a main model) and open source components delivered by the LOD2 consortium. The system meets the needs of both publishers and consumers of statistical data and directs the potential of the LOD2 tools to the specific domain of the statistical office. U...
Information, 2019
An important part of Open Data is of a statistical nature and describes economic and social indicators monitoring population size, inflation, trade, and employment. Combining and analyzing Open Data from multiple datasets and sources enable the performance of advanced data analytics scenarios that could result in valuable services and data products. However, it is still difficult to discover and combine Open Statistical Data that reside in different data portals. Although Linked Open Statistical Data (LOSD) provide standards and approaches to facilitate combining statistics on the Web, various interoperability challenges still exist. In this paper, we propose an Interoperability Framework for LOSD, comprising definitions of LOSD interoperability conflicts as well as modelling practices currently used by six official open government data portals. Towards this end, we combine a top-down approach that studies interoperability conflicts in the literature with a bottom-up approach that studies the modelling practices of data portals. We define two types of LOSD schema-level conflicts, namely naming conflicts and structural conflicts. Naming conflicts result from using different URIs. Structural conflicts result from different practices of modelling the structure of data cubes. Only two out of the 19 conflicts are currently resolved and 11 can be resolved according to literature.
Semantic Technology, 2018
In 2016, the Japanese Statistics Center published a large-scale statistical linked open data (LOD) site consisting of approximately 300 million triples. The LOD simplify processing data, such as filtering, aggregation, and integration of data. The aim of this action is to promote domestic and international utilization of the statistics. In this paper, we introduce publishing processes and a use case of statistical LOD. Afterwards, we show our approach to speed up the SPARQL search for the vast number of LOD.
2013
The access information law approved by the Brazilian government regulates the provision of open government data in the Web. However, they are heterogeneous, unstructured and derived from independent sources, making it difficult to interconnect. This paper presents a process of identifying sources, ontology generation, mapping and publishing statistical linked data in the form of multidimensional cubes, represented by the RDF Data Cube Vocabulary. In this process, data are transformed and assigned semantic meaning through its connection with domain ontologies. Through a web application, the publication of these data is automated, allowing for future analysis operations with the use of Online Analytical Processing (OLAP). As a result, the approach is expected to increase the scale in the publication of statistical linked data, and therefore, increasing the potential for analysis.
Web Semantics: Science, Services and Agents on the World Wide Web, 2016
Publishing and sharing open government data in Linked Data format provides many opportunities in terms of data aggregation/integration and creation of information mashups. Statistical data, that contains various performance indicators and their evolution through time, is an example of data that can be used as the foundation for policy prediction, planning and adjustments, and can be re-used in different applications. However, due to Linked Data being relatively a new field, currently there is a lack of tools that enable efficient exploration and analysis of linked geospatial statistical datasets. Therefore, ESTA-LD (Exploratory Spatio-Temporal Analysis) tool was developed to address some of the Linked statistical Data management issues, such as crossing the statistical and the geographical dimensions, producing statistical maps, visualizing different measures, and comparing statistical indicators of different regions through time. This paper discusses the modeling approach that was adopted so that the published data conform to the established standards for representing statistical, spatial and temporal data in Linked Data format. The main contribution is related to the delivery of state-of-the-art open-source tools for retrieving, quality assessment, exploration and analysis of statistical Linked Data that is made available through a SPARQL endpoint.
In the recent years the Linked Open Data phenomenon has gained a substantial traction. This has lead to a vast amount of data being available on the Web in what is known as the LOD cloud. While the potential of this linked data space is huge, it fails to reach the non-expert users so far. At the same time there is even larger amount of data that is so far not open yet, often because its owners are not convinced of its usefulness. In this paper we refine our Linked Data Visualization Model (LDVM) and show its application via its implementation Payola. On a real-world scenario built on real-world Linked Open Data created from Czech open data sources we show how end-user friendly visualizations can be easily achieved. Our first goal is to show that using Payola, existing Linked Open Data can be easily mashed up and visualized using an extensible library of analyzers, transformers and visualizers. Our second goal is to give potential publishers of (Linked) Open Data a proof that simply by publishing their data in a right way can bring them powerful visualizations at virtually no additional cost.
Many national and international organizations today leverage semantic web technologies to make statistical datasets available as Linked Open Data (LOD). A key advantages of this approach is that the data not only becomes publicly available, but also machine-readable and hence suitable for automated discovery and exploration. Whereas this has great potential to support interesting use cases, it remains difficult for end users today to utilize and combine these statistical Linked Data. Three challenges are: (i) directing users to relevant data sources based on a specified location; (ii) facilitating data integration despite a lack of outgoing links between datasets; and (iii) offering flexible means to integrate and aggregate data from various sources. As time and location are highly relevant dimensions in most statistical data, we address the identified challenges by first constructing geographical metadata for statistical sources. Following a mashup approach, we introduce mechanisms to recommend interesting datasets to end users and automatically enable data integration, visualization, and comparisons based on userdefined criteria.
2014
syntax to make statements about resources, and thanks to LOD principles, construct links to available external datasets in a simple manner, making them accessible and queryable through the Internet and promoting data reusability. As the amount of information at hand is greater every day, the need to handle it efficiently becomes a key requirement for anybody interested in working with it. This situation settles a great scenario for the Information Visualization field (or InfoVis, as it is known by academics and industry), taking advantage of humans capacity to identify patterns and gain insights from visual representations of abstract data. InfoVis positions itself in the intersection of other data-related fields: Statistics, Analytics, Dissemination and so on. One of the biggest issues concerning mass adoption of LOD outside the Semantic Web (SW) community, is the technical and conceptual knowledge required to take full advantage of the benefits provided by this type of data publis...
2016
The StatDCAT Application Profile is an extension of the DCAT Application Profile for Data Portals in Europe, version 1.1 (DCAT-AP). Its purpose is to provide a specification that is fully conformant with DCAT-AP version 1.1 as it meets all obligations of the DCAT-AP Conformance Statement. Its basic use case is to facilitate a better integration of the existing statistical data portals with the Open Data Portals, improving the discoverability of statistical datasets. StatDCAT-AP defines a small number of additions to the DCAT-AP model that are particularly relevant for statistical datasets. This will be beneficial for the general data portals, enabling enhanced services for the discovery of statistical data.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.