This change brings a host of stability changes to improve deployment on Postgres and Elasticsearch, changes to the underlying database architecture to improve metadata analytic querying.
It also, includes two new additions from the ISO 11179 specification - Organizations and Identifiers. Organization will allow us to build new and better extensions to support data management where information isn’t standardised - our first test caase will be the upgrade of the Aristotle Dataset Extensions to support the W3C DCAT data registry format, but we’ve already seen a bunch of interest in using Organizations in publishing workflows.
By implementing 11179 identifiers, we’re building capability for metadata records to have multiple identifiers with fully qualified namespaces. This will improve how Aristotle instances are federated, and also improve how Aristotle and other metadata systems and formats will be able to communicate using common identifiers. For users, this also provides more assurance around accessing information without having to expose internal database identifiers, and will prevent clashes between systems.
For example, the Aristotle open registry includes a lot of content from the AIHW METeOR Metadata Registry.
With the addition of Organizations and Identifiers, ‘
meteor’ is now a fully qualified namespace in the Aristotle open registry, which means a user can browse to
/identifier/meteor/349510 on the open registry, and every other Aristotle system that was
federating METeOR content and get the exact same metadata - every time - and link it back to the authoritative source on METeOR.
This represents a big step forward for federated standards-based government metadata systems!
Also included is a new research project in the Aristotle labs - a Data Dictionary CSV Uplaoder. This project is still in development, but we’ve been collaborating with metadata custodians to get feedback on a minimum specification for Data Dictionaries to help new users quickly build metadata for import into a registry, without requiring extensive training or specialised tools. This format is based on ISO 11179 and the fields from AS4590 and allows a user to build a data dictionary within a standard spreadsheet tool and upload a CSV to help populate a registry with new metadata - and by tying this format into the Aristotle Metadata Registry, we can perform intelligent heuristics to improve metadata reuse and identification within the registry to improve data matching efforts down stream.
Below is a sample of the data dictionary format, made using Excel:
When imported using the new uploader, fields can be matched to records that are already in the database:
You can provide feedback on the data dictionary tool on its GitHub page, or by emailing us with the details below.
Due to the underlying changes to the database architecture, its advised to check out the Aristotle wiki for details on how to upgrade to version 1.4.