This is a working document. A final version will be posted when discussions are completed.
To Do:
- Better define all proposed dcterms, provide scope notes and examples (what is allowable and what is not)
- As a practice if an element is CORE we will attempt to enforce conformity of values to a certain standard. If Not Quite Core, we will not attempt to enforce conformity to any standard.
- Determine if it is possible to “re-map” Dome or use the new terms going forward
- Ask Carl if reports are possible (asked on 1/26 – he’s looking into it) - Received 02/8
Research use of dcterms:mediator
To Discuss:
Question 1 for Carl: Can/How do we move the contents of a field into another field? How do we do it en masse? See question 3 for Carl.
Question 2 for Carl: Is it significantly different or more difficult to map just field at a time rather than multiple fields?
Question 3 for Carl: What is the difference in changing fields within a namespace versus migrating all the fields to a new namespace? We found that you can move metadata fields from one schema to the other within the dome metadata registry editing module.
Good Question 1 How will our recommended changes to the DOME metadata registry affect indexing in DOME
Good Question 2: How will our changes affect metadata harvesting, either through OAI-PMH, Google, or other harvesters.
It sounds like this is a good candidate for a MIT Libraries Project Proposal.
It also sounds like we need to put together a plan of attack. What changes to make how and when.
We should add some fields from VRA core in dome-test and put together a metadata record that contains metadata from multiple schemas to see how it looks in DOME.
We may want to identify pieces of this list of DOME adjustments that we can fast track to assist new collections going into DOME.
dc.contributor.attribute
Considering mapping all custom contributor fields to dcterms:contributor and creating an agent display field that would display names with associated roles
Custom Contributor Fields
- .advisor = 1 use (can be mapped to dcterms:contributor with no problems)
- .author = 4488 uses (Question: will mapping to dcterms:contributor create duplicate names in that field? Can we dedup this field after mapping?)
- .editor = 0 uses
- .illustrator = 0 uses
- .other = 99 uses (can be mapped to dcterms:contributor with no problems)
Even if we do not decide to go to dcterms:contributor we should:
- immediately cease using custom contributor fields
- map custom contributor fields to existing dc.contributor.none field (Question: Is it significantly different or more difficult to map just a few fields rather than the whole set)
We want to go ahead and recommend the addition of a custom name display field to aggregate all names and roles. This field would need to be established in our custom "dome" namespace. This field would be included in the brief record in DOME. While consolidating contributor fields can be accomplished programmatically, it may prove difficult to generate display fields for items already deposited in DSpace. All items from Rotch will already have a display field (currently in the dc.creator field -- we may want to migrate this metadata to the new display field before we start consolidating contributor fields). Archives has information in their METS export that could be used to produce the display name field. This information is currently not being put in DOME. For existing items, it would need to be concatenated and added to the records.
dcterms: available vs. dc.date.accessioned
both are DSpace system dates. Is this duplication necessary? Do we care? DSpace will automatically populate both fields unless the filed is already populated prior to deposit. dc.date.accessioned is the date the system took in the item. dc.date.available is intend for use with embargoed items.
dcterms: datecopyrighted vs. dcterms: issued
issued will be used for the publication date of formally published items. Do we need to know when something was copyrighted if it is different from the date issued?
dcterms:created for non-published items
dcterms:issued for published items
both dcterms:created and dcterms:issued are indexed
We will use dcterms:datecopyrighted for items that different copyright and publication dates. dc.date.datecopyrighted has never been used in DOME. While possible, it will be likely seldom used. We do not think that it is enough that an item only has a dcterms:datecopyrighted field. It must have either a dcterms:created or dcterms:issued field, the dcterms:datecopyrighted is strictly supplemental.
dcterms: date vs. dcterms:created and dcterms:issued vs. dcterms:temporal
Do we want to use this field as a display field for all the various dates that can be associated with an item?
We want a date display field(s) for various kinds of date expressions and items with multiple associated dates that need to be concatenated. Candidates are dcterms:date, dcterms:temporal, dcterms:created, dcterms:issued and dcterms:date We want to allow various collections to follow their own cataloging rules in choosing the date format.
- When describing the coverage of an item (the date as the subject) use dcterms:temporal.
- When concatenating a bunch of dates use a dome:dates field (to be added to metadata registry in custom dome namespace, the namespace is also used of displaying multiple authors and roles).
- When including multiple kinds of dates (for example, the date a photograph was taken and the date the content of the photograph was created) choose the most important date and put in either dcterms:created or dcterms:issued. Put other dates in dcterms:date or in dcterms:temporal.
We need to determine how prescriptive we need to be in controlling the syntax of the various date fields. THIS IS THE MOST IMPORTANT QUESTION We need to ask Carl how best to use date ranges and other kinds of date expressions (for example, ca. 1976) in indexed date fields. Carl thinks this field is indexed as a string of text. We are leaning towards unconcern over the indexing of date fields and the allowance of what ever kind of date expression is most appropriate for the content being described. We are also considering restricting dcterms:created and dcterms:issued to single dates and ranges, reserving dome:dates for ca., approx., etc.
We need a solution that does not require significant editing of metadata export from source repositories, nor significant programming/scripting/processing by the deposit mechanism/process.
DomeCORE “Notes” elements (No further action needed)
Use dcterms:abstract for those things that are formally abstracts, dcterms:tableOfContents for tables of contents, dcterms:description for all else. At least one of dcterms:abstract, dcterms:description, or dcterms:tableOfContents is required to satisfy the DOME Core MD Requirement. Most items will have descriptions.
dcterms:identifier and dcterms:identifier.other
We are discussing using two identifier fields, one for an official identifier (not assigned by DOME), and identifier.other for all other types of identifiers. Should this field remain qualified after metadata table update? We think it should, but are unsure of how to accomplish that. We choose to keep the practice of using separate fields for primary identifiers, secondary identifiers and the DOME handle. What we want to do is move the secondary and handle fields from our to a custom dome namespace/schema.
dc.identifier.attribute
Map all the various custom identifier fields to dome:otherIdentifier (except for dc.identifier.uri)
dc.identifier.uri
This is where the handle goes. Map to dome:URI
dc.identifier.vendorcode
Used for visual items from IRIS. Can this be mapped into dome:otherIdentifier? Yes.
We need to be able to distinguish between the DOME assigned handle, the primary identifier assigned previous to deposit in DOME, and all other identifiers assigned previous to deposit. Currently no secondary identifier needs its own custom field. This may change in the future.
The dcterms namespace contains just one flavor of identifer, dcterms:identifiers. Any qualification or customizations would need to be made in our custom dome namespace. dcterms:identifier is indexed for search
Use dc.identifier for the primary identifier assigned previous to deposit in DOME. This identifier is usually assigned by the system used to catalog the items prior to deposit or by the cataloger according to the particular policy of the collection being cataloged.
For the DOME assigned Handle use dome.URI defined in our custom dome namespace. Question for Carl: if we move Handles to our custom dome namespace will this affect the harvesting of metadata from DOME via the OAI-PMH or other protocols.
For all other identifiers assigned previous to deposit in DOME use dome:otherIdentifier defined in our custom dome namespace.
---- Stopped here 2011-02-08 ----
dcterms:language
(asking Carl for usage report) If it does not cause any major issues, we recommend merging dc.language.iso with dc.language.
dc.language is never used. We will map dc.language.iso to dcterms:language.
dc.relation.ispartofseries
(asking Carl for usage report) recommend merging this with dc.realation.ispartof = dcterms:isPartOf
dc.relation.ispartof is used 236 times
dc.relation.ispartofseries is used 4271 times
as far as we know only in the 'MIT Whirlwind' and 'Communications Forum' communities. Items in these communities are often assigned multiple dc.relation.ispartofseries.
There should be no problem in migrating dc.relation.ispartofseries to dcterms:isPartOf along with dc.relation.ispartof
dc.relation.type
map to VRA relation type instead of a dcterm
This field has not been used in DOME and it is misnamed. The intended element comes from the VRA Core schema and should be named '[whatever the vra namespace prefix is]:worktype'. We will add this element in a new VRA schema to be added to the metadata registry in DOME.
dc.relation.type should be removed from the dc schema in DOME
dcterms:accessRights
Core or Not Quiet Core? Answer: Not Quite Core pending further information from legal counsel.
dc.rights.uri vs. dcterms:rights
Primarily used for creative commons licenses. Do we really need a separate field for this or can we just put the URIs in the regular rights field.
dc.rights.uri is not used in DOME, we would like to remove that field. Any future copyright URIs can go into dcterms:rights.
dc.subject.attribute
map all custom subject fields to dcterms:subject
dc.subject has 37 uses
dc.subject.lcsh has 304909 uses
dc.subject.classification has 0 uses
dc.subject.ddc has 0 uses
dc.subject.lcc has 0 uses
dc.subject.mesh has 0 uses
dc.subject.other has 111788 uses. This field contains AAT terms, some LCSH headings, College and University Thesaurus terms. Since it is already a catch all and not every lcsh heading is included in the dc.subject.lcsh field, the custom DOME qualifications carry little meaning. We would like to collapse them all to dcterms:subject.
dcterms:type
We need to revise the current type vocabulary. We would like to adopt a practice of mandatory assignment of value from our controlled vocabulary and optional assignment from other vocabularies. Should the optional type values be entered in a separate field?
Todo: Identify the one vocabulary that we will use for the DOME Core type field. This vocabulary will be mapped to dcterms:type.
When a type vocabulary is defined within a specificaiton (for example, VRA Core type) use the identified field within that schema (Adding the schema and field to the Dome metadata registry).
For type vocabularies that are not defined within a metadata specification (for example, local type vocabularies) we will use a 'localType' (need to find a better name) element in our homegrown DOME metadata schema.
Todo: Separate all the different vocabularies from the dc.type field. This will be messy.
---- Stopped here 2011-02-15 ----
dcterms:publisher.institution
added to identify MIT Libraries/Archives as the content holder or distributor (could also be an MIT DOME scheme?)
also looking into the use of dcterms:mediator as a possible option
dc.format.support
use VRA
For Carl:
Usage reports for:
- dc.descripiton.sponsorship & dc.description.statementof responsibility (we are leaning toward removing these fields)
- dc.description.uri
- dc.format.mimetype (Is this an internal DSpace field?)
- dc.language.iso (system use?)
- dc.relation.isbasedon (We recommend renaming this element to the dcterms standard “dcterms:source”. This may conflict with another md field “dc.source.uri”.)
- dc.relation.ispartofseries (We recommend merging this with dc.relation.ispartof.)
Other:
- Why are 55 and 56 both dc.source.uri & are these fields being used?
Metadata Terms:
(List does not include elements listed above in the questions “to discuss”)
DomeCORE:
dcterms:contributor
dcterms:creator
dcterms:description
dcterms:abstract
dcterms:tableofcontents
dcterms:identifier
dcterms:issued
dcterms:created
dcterms:rights
dcterms:title
dcterms:type
NotQuiteCORE:
dcterms:spatial
dcterms:temporal
dcterms:provenance
dcterms:extent
dcterms:medium
dcterms:format
dcterms:bibliographicCitation
dcterms:language
dcterms:publisher
dcterms:hasPart
dcterms:hasVersion
dcterms:isFormatOf
dcterms:isPartOf
dcterms:source
dcterms:isReferencedBy
dcterms:isReplacedBy
dcterms:isVersionOf
dcterms:replaces
dcterms:requires
dcterms:relation
dcterms:subject
dcterms:alternative
New NotQuiteCore:
dcterms:conformsTo
dcterms:hasFormat
dcterms:isRequiredBy
dcterms:references