Leodis data

From Leeds Hackspace Wiki
Jump to navigation Jump to search

Links

PeoplesNetwork data feeds

1) If you head on over to http://www.peoplesnetwork.gov.uk/discover/advanced scroll down and click leodis, and then click the search button to see all records from leodis. Not much use from an API perspective ;)

2) Get the URL from above, and wipe out the stylesheet= parameter from the URL, you're talking to the search web service and can see a bit more detail. The links in the records like

<str name="aggregator.internal_record_link">/dpp/resource/415750/stream/CultureGrid_Item</str>

Are links to the repository itself. If you navigate to http://www.peoplesnetwork.gov.uk/dpp/resource/415750 You'll see a page listing all the different representations of that resource we have. By adding /stream/<<type>> you can get all the different metadata types for the record, and it's thumbnail. There should be a link to the actual digital artifact in all the records.

This search web service and front end is handy if you want to do searching, but if you're after a full-on harvest of the dataset you're better to go with the OAI-PMH web service instead....

3) Use OAI-PMH to harvest the leodis collection.. The base URL for this would be:

http://www.peoplesnetwork.gov.uk/dpp/oai?verb=listRecords&metadataPrefix=CultureGrid_Item&set=Leodis

This gives you the first page, after that you take the resumption token from the end of the document and create a new URL using resumptionToken as

http://www.peoplesnetwork.gov.uk/dpp/oai?verb=listRecords&resumptionToken=sid=411884;mprefix=CultureGrid_Item;set=Leodis;from=;to=

(IE, the params are replaced with the resumptionToken and the verb).

This *should* be a complete set of Leodis records. I know you were interested in things like user generated content and augmentation? We've got a project on the go at the moment to try and extract and autmatically enhance the records with spatial and subject category information. If you were building anything on top of this data, it would be ace if you could retain back some kind of semantic link to the source record so we can share anything that might be generated. I'd be really interested in any kind of API we might be able to provide that might allow you to annotate the records and then get the extra info out in the searches and the OAI service.

In terms of spatial search and such, the access points should be documented by the SRU explain service at http://www.peoplesnetwork.gov.uk/discover/xsl/default?operation=explain

Hope this is of some help to you, be really interested in talking some more / feedback / other ways of helping / using the data / data sources.

Database design

Schema can be found: [1]

Data gathering