Mediaflux concepts
- assets - the file content and associated metadata
- namespaces - remote folder structure on the mediaflux server
- XML-based
- has it's own query language
- asset.get - display metadata for an asset
- asset.set - alter metadata for an asset
- asset.query - find assets based on metadata queries
Mediaflux commands
Retrieving metadata
The general command is:
asset.get :id ASSET-ID asset.get :id "path=FULL-PATH-TO-FILE"
If we wanted to extract a specific piece of information, such as the checksum, we use the optional xpath argument:
pawsey:/projects/Data Team/testfiles>asset.get :id 69776098 :xpath content/csum value = 384461DE
Populating metadata
asset.set :id ASSET-ID <ELEMENT AND VALUE>
asset.set :id 69776160 :name "new filename" version = 2 { changed-or-created=true stime=74452766 }
Note: a new version of the asset is created - reflecting the fact that the name has been altered.
A more complex example, where we assign a geospatial location (in this case a point) to an asset:
asset.set :id 69776160 :geoshape < :point < :latitude -31.95 :longitude 115.86 :elevation 10.0 > > version = 3 { changed-or-created=true stime=74452768 }
This last case is an example of specifying an XML document that details the xpath + value for metadata items.
It is equivalent to an XML metadata document that looks like this:
<geoshape> <point> <latitude> -31.95 </latitude> <longitude> 115.86 </longitude> <elevation> 10.0 </elevation> </point> </geoshape>
Slight digression - metadata templates
The doc type shows the metadata that can be queried and specified.
Custom doc types can be made.
Asset is a special "first-order" type attached to everything.
Custom doc types get added under the <meta> element.
asset.doc.type.describe :type asset
Things such as the asset name and the geoshape and other items in the asset template are first order metadata items and treated as described above in terms of asset.get and asset.set.
Custom metadata templates are slightly different, as they sit under the <meta> element of the asset, rather than at the top level.
pawsey:/projects/Demo/sean>asset.doc.type.describe :type csiro:seismic ... definition = None element = None { type=string min-occurs=0 name=name } element = None { type=string min-occurs=0 name=geometry } element = None { type=string min-occurs=0 name=basin } element = None { type=string min-occurs=0 name=sub-basin } element = None { type=string min-occurs=0 name=data-type } element = None { type=string min-occurs=0 name=vertical-scale } element = None { type=string min-occurs=0 name=project }
Applying this template to a piece of data could be achieved as follows:
pawsey:/projects/Demo/sean>asset.set :id 69776160 :meta < :csiro:seismic < :name "Perth" :geometry "sprawling" > > version = 5 { changed-or-created=true stime=74452772 } pawsey:/projects/Demo/sean>asset.get :id 69776160 :xpath meta/csiro:seismic value = Perth value = sprawling
Querying the metadata
The simple form of an asset query is:
asset.query :where "LOGICAL-EXPRESSION" asset.query :where "namespace='/projects/Demo'" asset.query :where "namespace>='/projects/Demo' and name='*.jpg'" asset.query :where "namespace>='/projects/Data Team/sean' and (name='*.JPG' or name='*.PNG')"
Simple actions on the results
asset.query :where "namespace>='/projects/Data Team/sean' and name='*.jpg'" :action get-value :xpath -ename name name
Exercises
Exercise 1 - upload a file and then inspect the metadata in the system.
Exercise - querying metadata - geospatial search? TODO - import some geoshape attached