Skip to content

LETG/csw-harvester

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

csw-harvester

Synopsis

This is a python script that harvests metadata from CSW web services and saves some information from these metadata in a postgreSQL database.

Motivation

This script is used to analyze Spatial Data Infrastructures for the GEOBS research project : https://www-iuem.univ-brest.fr/pops/projects/geobs.

Dependencies

How to run

The PostgreSQL database must first be created. A database dump is provided with database/csw_harvester.sql

Physical Data Model

The host, port, database name, schema, user and password must be set in csw-harvester.py.

The CSW list is read from a CSV file ; an example is provided with sources-csw.csv. For each CSW, you can set a start in each step (for example, if set at 30, records will be extracted 30 by 30). Lines can be commented with #

You can then run the python script csw-harvester.py with the following options :

The completion mode (true by default) is used to run another iteration of the script without overwriting the data already stored in the database. This is useful if for example one CSW stopped working after a given record and you want to start again from this record.

The date option is used to force the extraction date stored in the database, if - for example - you are using the completion mode and want the extraction date to be the same for all metadata.

License

This project is published under the General Public License v3.

About

a python script that harvests metadata from CSW web services

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages