This chapter addresses the description of an encoded item so that the musical text, as well as its sources, encoding, and revisions are all thoroughly documented. Such documentation is necessary for scholars using the texts, for software processing them, and for catalogers in libraries and archives. Together these descriptions and declarations provide an electronic analog to the title page attached to a printed work. They also constitute an equivalent for the content of the code books or introductory manuals customarily accompanying electronic data sets.
Every MEI-conformant text not embedded in another XML carrier that provides for capturing metadata, such as TEI or METS, must carry a set of descriptions, prefixed to it and encoded as described in this chapter. This set is known as the MEI header, tagged
meiHead, and has five major parts:
one or more alternative identifiers, tagged with
altId, each of which provides an identifying name or number associated with the file.
a file description, tagged
fileDesc, containing a full bibliographic description of the computer file itself, from which a user of the text could derive a proper bibliographic citation, or which a librarian or archivist could use in creating a catalog entry recording its presence within a library or archive. The term computer file here is to be understood as referring to the whole intellectual entity or document described by the header, even when this is stored in multiple physical operating system files. The file description also includes information about the source or sources from which the electronic document was derived. The MEI elements used to encode the file description are described in section
File Description below.
an encoding description, tagged
encodingDesc, which describes the relationship between an electronic text and its source or sources. It allows for detailed description of whether (or how) the text was normalized during transcription, how the encoder resolved ambiguities in the source, what levels of encoding or analysis were applied, and similar matters. The MEI elements used to encode the encoding description are described in section
Encoding Description below.
a work description, tagged
workDesc, containing classification and contextual information about the work, such as its subject matter, the situation in which it was produced, the individuals described by or participating in producing it, and so forth. Such a work profile is of particular use in highly structured composite texts such as corpora or language collections, where it is often highly desirable to enforce a controlled descriptive vocabulary or to perform retrievals from a body of text in terms of text type or origin. The work description may however be of use in any form of automatic text processing. The MEI elements used to encode the work description are described in section
Work Description below.
a revision history, tagged
revisionDesc, which allows the encoder to provide a history of changes made during the development of the electronic text. The revision history is important for version control and for resolving questions about the history of a file. The MEI elements used to encode the revision description are described in section
Revision Description below.
2.1. File Description
The structure of the bibliographic description of a machine-readable or digital musical text resembles that of a book, an article, or other kinds of textual objects. The file description element of the MEI header has therefore been closely modelled on existing standards in library cataloging; it should thus provide enough information to allow users to give standard bibliographic references to the electronic text, and to allow catalogers to catalog it. Bibliographic citations occurring elsewhere in the header, and in the text itself, are derived from the same model.
The bibliographic description of an electronic musical text should be supplied by the mandatory
fileDesc element:
(file description) – Contains a full bibliographic description of the MEI file.
The
fileDesc element contains two mandatory and six optional elements, each of which is described in more detail below. These elements are listed below in the order in which they must occur within the
fileDesc element.
Used to express size in terms other than physical dimensions, such as number of pages,
number of records in file, number of bytes, performance duration for music, audio
recordings
and visual projections, etc.
(publication statement) – Container for information regarding the publication or
distribution of a bibliographic item, including the publisher's name and address,
the date
of publication, and other relevant details.
(title statement) – Container for title and responsibility meta-data.
The title statement contains the title given to the electronic work, together with one or more optional statements of responsibility which identify the encoder, editor, author, compiler, or other parties responsible for it:
A person or organization who transcribes a musical composition, usually for a different
medium from that of the original; in an arrangement the musical substance remains
essentially unchanged.
Names of individuals, institutions, or organizations responsible for funding. Funders
provide financial support for a project; they are distinct from sponsors, who provide
intellectual support and authority.
Names of sponsoring individuals, organizations or institutions. Sponsors give their
intellectual authority to a project; they are to be distinguished from funders, who
provide
the funding but do not necessarily take intellectual responsibility.
(responsibility statement) – Names one or more individuals, groups, or in rare cases,
mechanical processes, responsible for creation or realization of the intellectual
or
artistic content.
The
title element contains the chief name of the electronic work. Its content takes the form considered appropriate by its creator. The element may be repeated, if the work has more than one title (perhaps in different languages). Where the electronic work is derived from an existing source text, it is strongly recommended that the title for the former should be derived from the latter, but clearly distinguishable from it, for example by the addition of a phrase such as ‘: an electronic transcription’ or ‘a digital edition’. This will distinguish the electronic work from the source text in citations and in catalogs, which contain descriptions of both types of material.
Other alternative titles or subtitles may be encoded in additional title elements with values in the @type attribute that distinguish them from the chief title. Sample values for the @type attribute include: main (main title), subordinate (subtitle, title of part), abbreviated (abbreviated form of title), alternative (alternate title by which the work is also known), translated (translated form of title), uniform (collective title).
The @type attribute is provided for convenience in analyzing titles and processing them according to their type; where such specialized processing is not necessary, there is no need for such analysis, and the entire title, including subtitles and any parallel titles, may be enclosed within a single
title element, as in the following example:
The electronic work will also have an external name (its ‘filename’ or ‘data set name’) or reference number on the computer system where it resides at any time. This name is likely to change frequently, as new copies of the file are made on the computer system. Its form is entirely dependent on the particular computer system in use and thus cannot always easily be transferred from one system to another. Moreover, a given work may be composed of many files. For these reasons, these Guidelines strongly recommend that such names should not be used as the title for any electronic work.
Helpful guidance on the formulation of useful descriptive titles in difficult cases may be found in the Anglo-American Cataloguing Rules (Gorman and Winkler, 1978, chapter 25) or in equivalent national-level bibliographical documentation.
At a minimum, the creator of the musical text and the creator of the file should be identified. If the bibliographic description is for a corpus, identify the creator of the corpus. Optionally also include the names of others involved in the transcription or elaboration of the text, sponsors, and funding agencies. The name of the person responsible for physical data input need not normally be recorded, unless that person is also intellectually responsible for some aspect of the creation of the file.
In traditional bibliographic practice, those with primary creative responsibility are given special prominence. MEI accommodates this approach by providing responsibility-role elements. For example:
Secondary intellectual responsibility in this case is encoded using
respStmt. The
respStmt element has two subcomponents: a
name element identifying a responsible individual or organization, and a
resp element indicating the nature of the responsibility. All names should be stated in the form in which the persons or bodies wish to be publicly cited. This will usually be the fullest form of the name, including first names. No specific recommendations are made at this time as to appropriate content for
resp. However, it should make clear the nature of the responsibility.
This method of encoding facilitates exchange of bibliographic data with library catalogs and bibliographic databases as well as applications whose handling of bibliographic data is restricted to traditional responsibility roles. Additional information regarding these responsibility-role elements can be found in chapter
Bibliographic Citations and References.
When the MEI.namesdates module is enabled, two additional elements are also permitted within
respStmt:
(personal name) – Designation for an individual, including any or all of that
individual's forenames, surnames, honorific titles, and added names.
These elements allow for more precise identification of the entity associated with the name than is permitted by the simpler
name element. The following example shows how a precise date range can be associated with a personal or corporate name.
For additional information about corporate and personal names, see chapter
Names and Dates.
In addition to, or instead of the
resp element, the @role attribute on
name,
persName, and
corpName may be used to capture the nature of responsibility. While
resp accommodates capturing the wide variety of text that may occur in responsibility statements, use of the @role attribute provides the possibility of recording a controlled value independently of the textual content of
resp.
Where it is necessary to group responsibilities and names, multiple responsibility statements may be used. For example:
It is often desirable to mix primary and secondary intellectual responsibility information. Treating all intellectual roles the same way can allow literal transcription of existing responsibility statements and simplify programmatic processing. The following example demonstrates how a responsibility statement may be transcribed using interleaved
resp and
persName elements:
However, eliminating explanatory text and relying on standardized values for @role, as in the following example, allows data creation and processing tools of the greatest simplicity.
2.1.2. Edition Statement
The
editionStmt element is the second component of the
fileDesc element. It is optional but recommended when applicable.
(edition designation) – A word or text phrase that indicates a difference in either
content or form between the item being described and a related item previously issued
by the
same publisher/distributor (e.g. 2nd edition, version 2.0, etc.), or simultaneously
issued
by either the same publisher/distributor or another publisher/distributor (e.g. large
print
edition, British edition, etc.).
(responsibility statement) – Names one or more individuals, groups, or in rare cases,
mechanical processes, responsible for creation or realization of the intellectual
or
artistic content.
For printed texts, the term ‘edition’ applies to the set of all the identical copies of an item produced from one master copy and issued by a particular publishing agency or a group of such agencies. A change in the identity of the distributing body or bodies does not normally constitute a change of edition, while a change in the master copy does.
For electronic texts, the notion of a master copy is not entirely appropriate, since they are far more easily copied and modified than printed ones; nonetheless, the term edition may be used for a particular state of a machine-readable text at which substantive changes are made and fixed. Synonymous terms used in these Guidelines are version, level, and release. The words revision and update, by contrast, are used for minor changes to a file which do not amount to a new edition.
No simple rule can specify how substantive changes have to be before they are regarded as producing a new edition, rather than a simple update. The general principle proposed here is that the production of a new edition entails a significant change in the intellectual content of the file, rather than its encoding or appearance. The addition of analytic coding to a text would thus constitute a new edition, while automatic conversion from one coded representation to another would not. Changes relating to the character code or physical storage details, corrections of misspellings, simple changes in the arrangement of the contents and changes in the output format do not normally constitute a new edition, whereas the addition of new information (e.g., annotations, sound or images, links to external data) almost always does.
Clearly, there will always be borderline cases and the matter is somewhat arbitrary. The simplest rule is: if you think that your file is a new edition, then call it such. An edition statement is optional for the first release of a computer file; it is mandatory for each later release, though this requirement cannot be enforced.
Note that all changes in a file, whether or not they are regarded as constituting a new edition or simply a revision, should be independently noted in the revision description section of the file header (see section
Revision Description).
The edition element should contain phrases describing the edition or version, including the word ‘edition’, ‘version’, or an equivalent term, together with a number or date, or terms indicating difference from other editions such as ‘new edition’, ‘revised edition’, etc. Any dates that occur within the edition statement should be marked with the
date element. The @n attribute of the edition element may be used as elsewhere to supply any formal identification (such as a version number) for the edition.
One or more
respStmt elements may also be used to supply statements of responsibility for the edition in question. These may refer to individuals or corporate bodies and can indicate functions such as that of a reviser, or can name the person or body responsible for the provision of supplementary matter, of appendices, etc., in a new edition.
Some examples follow:
2.1.3. Physical Description of the File
The third component of the fileDesc is a description of the physical qualities of the file. The
extent element is provided for this purpose.
Used to express size in terms other than physical dimensions, such as number of pages,
number of records in file, number of bytes, performance duration for music, audio
recordings
and visual projections, etc.
The
extent element describes the approximate size of a text as stored on some carrier medium, whether digital or non-digital, specified in any convenient units.
For printed books, information about the carrier, such as the kind of medium used and its size, are of great importance in cataloging procedures. The print-oriented rules for bibliographic description of an item’s medium and extent need some re-interpretation when applied to electronic media. An electronic file exists as a distinct entity quite independently of its carrier and remains the same intellectual object whether it is stored as file on a hard disc drive, a CD-ROM, a set of USB devices, or in the internet. Since, moreover, these Guidelines are specifically aimed at facilitating transparent document storage and interchange, any purely machine-dependent information should be irrelevant as far as the file header is concerned.
This is particularly true of information about file-type although library-oriented rules for cataloging often distinguish two types of computer file: ‘data’ and ‘programs’. This distinction is quite difficult to draw in some cases, for example, hypermedia or texts with built-in search and retrieval software.
Although it is equally system-dependent, some measure of the size of the computer file may be of use for cataloging and other practical purposes. Because the measurement and expression of file size is fraught with difficulties, only very general recommendations are possible; the element
extent should contain a phrase indicating the size or approximate size of the computer file in one of the following ways:
in bytes of a specified length (e.g. ‘4000 bytes’)
as falling within a range of values, for example:
less than 1 Mb
between 1 Mb and 5 Mb
between 6 Mb and 10 Mb
over 10 Mb
in terms of any convenient logical units (for example, words or sentences, citations, paragraphs)
in terms of any convenient physical units (for example, compact discs, removable hard drives, DVDs)
The use of standard abbreviations for units of quantity is recommended where applicable, here as elsewhere (see http://physics.nist.gov/cuu/Units/binary.html).
For ease of processability, the use of the @unit attribute is recommended, as in the following example:
2.1.4. Publication, Distribution, etc.
The
pubStmt element is the fourth component of the
fileDesc element and is mandatory.
(publication statement) – Container for information regarding the publication or
distribution of a bibliographic item, including the publisher's name and address,
the date
of publication, and other relevant details.
It may contain either a single
unpub element, indicating that the file has yet to be published, or in the case of published material, one or more elements from the
model.pubStmtPart class. The following elements may be used to provide details regarding the file’s publication and distribution:
(responsibility statement) – Names one or more individuals, groups, or in rare cases,
mechanical processes, responsible for creation or realization of the intellectual
or
artistic content.
The publisher is the person or institution by whose authority a given edition of the file is made public. The distributor is the person or institution from whom copies of the text may be obtained. Use
respStmt to identify other responsible persons or corporate bodies.
The sub-elements of
availability should be used to provide detailed information regarding access to the MEI file.
(system requirements) – System requirements for using the electronic item.
Give any other useful information (e.g., dates of collection of data) in an annotation within the notes statement, which is described below.
Here, as in the description of intellectual responsibility described above, the
respStmt element may be used to contain all statements of responsibility regarding publication and distribution when uniformity is desired regardless of the role of participants in the publication process:
2.1.5. Series Statement
The
seriesStmt element is the fifth component of the
fileDesc element and is optional.
(series statement) – Groups information about the series, if any, to which a publication
belongs.
A series may be defined in one of the following ways:
A group of separate items related to one another by the fact that each item bears, in addition to its own title proper, a collective title applying to the group as a whole. The individual items may or may not be numbered.
Each of two or more volumes of essays, lectures, articles, or other items, similar in character and issued in sequence.
A separately numbered sequence of volumes within a series or serial.
The
seriesStmt element may contain one or more of the following more specific elements:
(responsibility statement) – Names one or more individuals, groups, or in rare cases,
mechanical processes, responsible for creation or realization of the intellectual
or
artistic content.
(series statement) – Groups information about the series, if any, to which a publication
belongs.
The
title,
editor and
identifier elements have the same function described above: identification of the item, in this case the series, and the individuals or groups responsible for its creation. The
title element is required within
seriesStmt.
The
identifier element may be used to supply any identifying number associated with the series, including both standard numbers such as an ISSN and particular issue numbers. Its @type attribute is used to categorize the number further, taking the value ‘ISSN’ for an ISSN, for example.
The contents of the series may be enumerated using the
contents element. Use of this element should be determined by the complexity of the resource and whether or not the information is readily available. The
contents element may consist of a single paragraph when unstructured information is sufficient.
Alternatively,
contentItem elements may be used to provide structure for the content description.
Finally, using the @target attribute, a link to an external table of contents may be supplied in lieu of or in addition to the child elements of
contents.
The
seriesStmt element is allowed to nest within itself in order to accommodate a series within a series.
2.1.6. Notes Statement
The
notesStmt element is the sixth component of the
fileDesc element and is optional. If used, it contains one or more
annot elements, each containing a single piece of descriptive information of the kind treated as ‘general notes’ in traditional bibliographic descriptions.
(notes statement)– Collects any notes providing information about a text additional
to
that recorded in other parts of the bibliographic description.
Some information found in the notes area in conventional bibliography has been assigned specific elements in these Guidelines; in particular the following items should be tagged as indicated, rather than as general notes:
the nature, scope, artistic form, or purpose of the work; also the genre or other intellectual category to which it may belong. These should be formally described within the
workDesc element (section
Work Description).
bibliographic details relating to the source or sources of an electronic text: e.g., ‘Transcribed from a facsimile of the 1743 publication’. These should be formally described in the
sourceDesc element (section
Source Description).
further information relating to publication, distribution, or release of the text, including sources from which the text may be obtained, any restrictions on its use or formal terms on its availability. These should be placed in the appropriate division of the
pubStmt element (section
Publication, Distribution, etc.).
publicly documented numbers associated with the file should be placed in an
altId element within the
meiHead element. International Standard Serial Numbers (ISSN), International Standard Book Numbers (ISBN), and other internationally agreed upon standard numbers that uniquely identify an item, should be treated in the same way, rather than as specialized bibliographic notes. As described elsewhere, identifiers for sources of the file should be recorded within the
sourceDesc.
Nevertheless, the
notesStmt element may be used to record potentially significant details about the file and its features, for example:
dates, when they are relevant to the content or condition of the computer file: e.g. ‘manual dated 2010’, ‘file validated Apr 2011’
names of persons or bodies connected with the technical production, administration, or consulting functions of the effort which produced the file, if these are not named in statements of responsibility in the title or edition statements of the file description: e.g. ‘Historical commentary provided by members of the Big Symphony Orchestra’
availability of the file in an additional medium or information not already recorded about the availability of documentation: e.g. ‘User manual is loose-leaf in eleven paginated sections’
language of work and abstract, if not encoded in the
langUsage element, e.g. ‘Text in English with stage directions in French and German’
Each such item of information may be tagged using the general-purpose
annot element. Groups of annotations are contained within the
notesStmt element, as in the following example:
There are advantages, however, to encoding such information with more precise elements elsewhere in the MEI header, when such elements are available. For example, the notes above might be encoded as follows:
2.1.7. Source Description
The
sourceDesc element is the seventh and final component of the
fileDesc element. In MEI,
sourceDesc is a grouping element containing one or more
source elements, each of which records details of a source from which the computer file is derived. This might be a printed text or manuscript, another computer file, an audio or video recording, or a combination of these. An electronic file may also have no source, if what is being cataloged is an original text created in electronic form.
A bibliographic description of a source used in the creation of the electronic
file.
The content model of the
source element is similar to that of the
fileDesc and
work elements. The list below reflects the order in which the optional components of
source must occur.
(publication statement) – Container for information regarding the publication or
distribution of a bibliographic item, including the publisher's name and address,
the date
of publication, and other relevant details.
(physical description) – Container for information about the appearance, construction,
or handling of physical materials, such as their dimension, quantity, color, style,
and
technique of creation.
(physical location) – Groups information about the current physical location of a
bibliographic item, such as the repository in which it is located and its shelf mark(s),
and
its previous locations.
Provides a container for information about the history of a resource. To facilitate
efficient data interchange, basic information about the circumstances surrounding
the
creation of bibliographic resources should be recorded within the creation element.
Groups information which describes the nature or topic of an entity.
When the MEI.frbr module is available, the following elements may also appear after the classification element. Additional information regarding FRBR (Functional Requirements for Bibliographic Records) can be found at
Functional Requirements for Bibliographic Records (FRBR).
In the simplest case, the
source element may contain nothing more than a notes statement giving a simple prose description or a brief note stating that the document has no physical source:
Alternatively, it may contain a basic bibliographic citation, also in an annotation:
However, more structured bibliographic data, such as that in the example below, facilitates better machine-processing:
A description of more precise capture of dates and date ranges is provided in chapter
Names and Dates.
The
identifier element is provided within
source in order to accommodate identifying strings which cannot be captured by the @xml:id attribute, such as numbers or strings requiring XML markup.
If a source of the file is an unpublished manuscript, it is recommended that the
unpub element be used as the only content of the source’s
pubStmt element. Other identifying information for the manuscript may be collected in the
notesStmt element, as described in section
Notes Statement.
2.1.7.1. Associating Metadata and Data
In the MEI header, the @data attribute may be used to associate metadata with related notational elements.
Similarly, in the body of the MEI document, the @decls attribute may be used to associate parts of the encoded text with related metadata.
The most useful associations of this type are between the bibliographic description of a source and the material taken from it.
2.2. Encoding Description
The
encodingDesc element is the second major subdivision of the MEI header. It specifies the methods and editorial principles which governed the transcription or encoding of the source material. Though not formally required, its use is highly recommended.
(encoding description) – Documents the relationship between an electronic file and
the
source or sources from which it was derived as well as applications used in the
encoding/editing process.
The encoding description may contain elements taken from the model.encodingPart class. By default, this class makes available the following elements:
(project description) – Project-level meta-data describing the aim or purpose for
which
the electronic file was encoded, funding agencies, etc. together with any other relevant
information concerning the process by which it was assembled or collected.
(sampling declaration) – Contains a prose description of the rationale and methods
used
in sampling texts in the creation of a corpus or collection.
Each of these elements is further described in the appropriate section below.
2.2.1. Application Information
It is sometimes convenient to store information relating to the processing of an encoded resource within its header. Typical uses for such information might be:
to allow an application to discover that it has previously opened or edited a file, and what version of itself was used to do that;
to show (through a date) which application last edited the file to allow for diagnosis of any problems that might have been caused by that application;
to allow users to discover information about an application used to edit the file
to allow the application to declare an interest in elements of the file which it has edited, so that other applications or human editors may be more wary of making changes to those sections of the file.
Supplies a version number for an application, independent of its identifier or
display name.
Each
application element identifies the current state of one software application with regard to the current file. This element is a member of the att.datable class, which provides a variety of attributes for associating this state with a date and time, or a temporal range. The @xml:id and @version attributes should be used to uniquely identify the application and its major version number (for example, ‘Music Markup Tool 1.5’). It is not intended that a software application should add a new
application element each time it touches the file.
The following example shows how these elements might be used to record the fact that version 1.5 of an application called ‘Music Markup Tool’ has an interest in two parts of a document. The parts concerned are accessible at the URLs given as targets of the two
ptr elements. When used on
application, the @date attribute specifies when the application was employed, in this case June 6, 2011. Version information for the application should be placed in @version.
2.2.2. Declaration of Editorial Principles
The
editorialDecl element is used to provide details of the editorial practices applied during the encoding of a musical text.
It may contain a prose description only, or one or more of a set of specialized elements; that is, members of the MEI model.editorialDeclPart class.
Some of these policy elements carry attributes to support automated processing of certain well-defined editorial decisions; all of them contain a prose description of the editorial principles adopted with respect to the particular feature concerned. Examples of the kinds of questions which these descriptions are intended to answer are given in the list below.
correction: correctionStates how and under what circumstances corrections have been made in the text. corrlevelIndicates the degree of correction applied to the text. methodIndicates the method employed to mark corrections and normalizations. Was the text corrected during or after data capture? If so, were corrections made silently or are they marked using the tags described in chapter 11 Editorial Markup? What principles have been adopted with respect to omissions, truncations, dubious corrections, alternate readings, false starts, repetitions, etc.?
interpretation: interpretationDescribes the scope of any analytic or interpretive information added to the transcription of the music. Has any analytic or ‘interpretive’ information been provided — that is, information which is felt to be non-obvious, or potentially contentious? If so, how was it generated? How was it encoded?
normalization: normalizationIndicates the extent of normalization or regularization of the original source carried out in converting it to electronic form. methodIndicates the method employed to mark corrections and normalizations. Was the text normalized, for example by regularizing any non-standard enharmonic spellings, etc.? If so, were normalizations performed silently or are they marked using the tags described in chapter 11 Editorial Markup ? What authority was used for the regularization? Also, what principles were used when normalizing numbers to provide the standard values for the value attribute described in section 1.3.4 Names, Dates, Numbers, Abbreviations, and Addresses and what format is used for them?
segmentation: segmentationDescribes the principles according to which the musical text has been segmented, for example into movements, sections, etc. How is the musical text segmented? If mdiv and/or section elements have been used to partition the music for analysis, how are they marked and how was the segmentation arrived at?
standard values: stdVals(standard values) – Specifies the format used when standardized date or number values are supplied. In most cases, attributes bearing standardized values should conform to a defined datatype. In cases where this is not appropriate, this element may be used to describe the standardization methods underlying the values supplied.
Experience shows that a full record should be kept of decisions relating to editorial principles and encoding practice, both for future users of the text and for the project which produced the text in the first instance. Any information about the editorial principles applied not falling under one of the above headings may be recorded as additional prose following the special-use elements.
An editorial practices declaration which applies to more than one text or division of a text need not be repeated in the header of each text or division. Instead, the @decls attribute of each text (or subdivision of the text) to which it applies may be used to supply a cross-reference to a single declaration encoded in the header.
(project description) – Project-level meta-data describing the aim or purpose for
which
the electronic file was encoded, funding agencies, etc. together with any other relevant
information concerning the process by which it was assembled or collected.
The
projectDesc element may be used to describe, in prose, the purpose for which a digital resource was created, together with any other relevant information concerning the process by which it was assembled or collected. This is of particular importance for corpora or miscellaneous collections, but may be of use for any text, for example to explain why one kind of encoding practice has been followed rather than another.
For example:
2.2.4. Sampling Declaration
The samplingDecl element holds a prose description of the rationale and methods used in selecting texts, or parts of text, for inclusion in the resource.
(sampling declaration) – Contains a prose description of the rationale and methods
used
in sampling texts in the creation of a corpus or collection.
The
samplingDecl element should include information about such matters as:
the size of individual samples
the method or methods by which they were selected
the underlying population being sampled
the object of the sampling procedure used but is not restricted to these.
It may also include a simple description of any parts of the source text included or excluded:
A sampling declaration which applies to more than one text or division of a text need not be repeated in the header of each such text. Instead, the @decls attribute of each text (or subdivision of the text) to which the sampling declaration applies may be used to supply a cross-reference to it, as further described in section
Associating Metadata and Data.
2.3. Work Description
The
workDesc element is the third major subdivision of the MEI Header. It is an optional element, the purpose of which is to enable the recording of information characterizing various descriptive aspects of the abstract work.
Provides a detailed description of a work, specifically its history, language use,
and
high-level musical attributes: key, tempo, meter, medium of performance, and intended
duration.
All the components of
work are optional, but they must occur in the following order:
identifier
titleStmt
incip
key
mensuration
meter
tempo
otherChar
history
langUsage
perfMedium
audience
contents
context
biblList
notesStmt
classification
These work description components may be classed into two groups based on their function:
identification of the work: identifier, titleStmt, incip, key, mensuration, meter, tempo, otherChar
contextual information for the work: history, langUsage, perfMedium, audience, contents, context, biblList, notesStmt, classification
2.3.1. Work Identification
The following elements provide minimal identifying information for the intellectual work:
(title statement) – Container for title and responsibility meta-data.
The identifier and title values recorded here may or may not be the same as those assigned to published versions of the work. Fuller details regarding the use of
titleStmt are available in section
Title Statement.
(incipit) – The opening music and/or words of a composition.
The first few notes and/or words of a piece of music are often used for identification purposes, especially when the piece has only a generic title, such as “Sonata no. 3”. They appear in catalogs of music and in tables of contents of printed music that include multiple works.
The following elements are provided for the inclusion of incipits:
The elements
incipCode and
incipText are available for the inclusion of coded incipits of music notation and textual incipits, respectively. The
incipText element should contain only the initial performed text of the work, while
incipCode may contain both words and music, depending on the capabilities of the scheme used to encode it. When both music and text are provided in
incipCode, it may be helpful to repeat the text in
incipText in order to provide easier access to only the text, for example, for indexing of the text without having to extract it from the coded incipit.
Both
incipCode and
incipText allow reference to an external file location via the @target attribute and specification of the internet media type of the external file via the @mimetype attribute. It is a semantic error not to include one of these attributes.
An MEI-encoded incipit may be captured in the
score element.
In addition,
graphic may be used to include an image of an incipit.
2.3.3. Key, Tempo, and Meter
The attributes key, tempo, and meter are often helpful for identifying a musical work when it does not have a distinctive title.
Text and symbols descriptive of tempo, mood, or style, e.g., "allarg.", "a tempo",
"cantabile", "Moderato", "♩=60", "Moderato ♩ =60").
The
key element is used exclusively within bibliographic descriptions. Do not confuse this element with
keySig, which is used within the body of an MEI file to record this data for musical notation. Likewise,
meter should not be confused with the attributes used by staffDef and scoreDef to record meter-related data for notated music. The
tempo element can be used here as well as in the body of an MEI document; however, its attributes other than @xml:id, @label, @n, @base, and @lang are meaningless in the MEI header context, and therefore should be avoided within a work description. The
mensuration element is available for the description of works in the mensural repertoire. When a work uses meter and mensural signs, both
mensuration and
meter elements may be used.
2.3.4. Other Identifying Characteristics
Additional information that aids the identification of the work may be encoded using
otherChar.
(other distinguishing characteristic) – Any characteristic that serves to differentiate
a work or expression from another.
The following components provide detailed information about the work’s context, including the circumstances of its creation, the languages used within it, high-level musical attributes, performing forces, etc.
2.3.5. Work History
The following elements are provided to capture the history of a musical work:
Non-bibliographic details of the creation of an intellectual entity, in narrative
form,
such as the date, place, and circumstances of its composition. More detailed information
may
be captured within the history element.
Provides a container for information about the history of a resource. To facilitate
efficient data interchange, basic information about the circumstances surrounding
the
creation of bibliographic resources should be recorded within the creation element.
The
creation element is intended to contain a brief, machine-processable statement of the circumstances of the work’s creation. Its content is limited to text and the
date and
geogName elements.
The
history element is a container for additional non-bibliographic details relating to a work. It may use the
eventList element to provide a list of key events in the creation and performance history of the work. The
eventList element is comprised of
event elements containing a brief description of the associated event, including dates and locations where the event took place. An event list may use the @type attribute to distinguish between multiple event lists with different functions, such as a list of events in the compositional process and a list of performance dates.
Event lists and other text components, such as paragraphs, tables, lists, and text divisions (
div) may be interleaved when an ‘essay-like’ work history is desired.
The
event element permits either a text-centric or a data-centric model. The text-centric model is provided for prose descriptions, while the data-centric model accommodates event descriptions that consist of a collection of descriptive phrases. In the text-centric model, paragraphs, tables, and lists may be used. In the data-centric model, however, only certain phrase-level elements, may appear.
2.3.6. Language Usage
The
langUsage element is used within the
workDesc element to describe the languages, sublanguages, dialects, etc. represented within a work. It contains one or more
language elements, each of which provides information about a single language.
A
language element may be supplied for each different language used in a document. If used, its @xml:id attribute should specify an appropriate language identifier. This is particularly important if extended language identifiers have been used as the value of @xml:lang attributes elsewhere in the document.
Here is an example of the use of this element:
2.3.7. Performance Medium
The following elements are available for description of a composition’s performing forces:
The
perfMedium element provides the possibility of describing a work in terms of its medium of performance; that is, the performing forces required. In the case of a dramatic work, the dramatis personae and associated voice qualities may be enumerated using
castList. The
perfResList element describes the necessary instrumental and vocal resources.
2.3.7.1. Cast Lists
A cast list is a specialized form of list, conventionally found at the start or end of a dramatic work, usually listing all the speaking/singing and non-speaking/singing roles in the play, often with additional description (‘Cataplasma, a maker of Periwigges and Attires’) or the name of an actor or actress (‘Old Lady Squeamish. Mrs Rutter’).
(performance resource) – Name of an instrument on which a performer plays, a performer's
voice range, or a standard performing ensemble designation.
In the following example,
role provides the name of the dramatic character and
roleDesc contains a brief description of the role. The
perfRes element is used to describe the voice range of the role.
The vocal qualities and associated roles for Beethoven’s opera Fidelio may be encoded as:
However, this element is unlikely to be useful in the context of a work description. It may be used here, however, for the very rare occasion when a work was conceived for and is only performable by a single person or group, as for certain “performance art” works.
It is common to find some roles presented in groups or sublists. Roles are also often grouped together by their function. To accommodate these situations, the
castGrp element is provided as a component of
castList. It may contain any combination of
castItem,
castGrp, and
roleDesc elements.
2.3.7.2. Instrumentation
The
perfResList element is used to capture the solo and ensemble instrumental and vocal resources of a composition. For example, a work for a standard ensemble may be indicated thus:
The detailed make-up of standard and non-standard ensembles may also be enumerated:
Where multiple instruments of the same kind are used, the @count attribute on
perfRes may be used to encode the exact number of players called for.
Instrument or voice specifications may be grouped using the
perfResList element and a label assigned to the group with
head. For example:
The preceding example also demonstrates how instrumental doublings can be accommodate through the use of nested
perfRes elements. Only the outer-most perfRes element should use the @count attribute. Its value should reflect the total number of performers, not the number of instruments played.
The
perfRes element provides the @codedval attribute, which can be used to record a coded value that represents the string value stored as the element’s content. It is recommended that coded values be taken from a standardized list, such as the International Association of Music Libraries’ Medium of performance Codes List or the MARC Instruments and Voices Code List.
Solo parts may be marked with the @solo attribute of
perfRes, like so:
Music for a single player should, however, never use the @solo attribute.
Defines the class of user for which the work is intended, as defined by age group
(e.g.,
children, young adults, adults, etc.), educational level (e.g., primary, secondary,
etc.),
or other categorization.
The historical, social, intellectual, artistic, or other context within which the
work
was originally conceived (e.g., the 17th century restoration of the monarchy in England,
the
aesthetic movement of the late 19th century, etc.) or the historical, social, intellectual,
artistic, or other context within which the expression was realized.
The intended audience for the work and additional information about context for the work that is not captured in more specific elements elsewhere, such as
history and its sub-components, may be recorded in the
audience and
context elements.
Contains a single entry within a content description element.
Often, it is helpful to identify an entity by listing its constituent parts. A simple description of the work’s content, such as may be found in a bibliographic record, can be given in single paragraph element:
Alternatively, a structured list of contents may be constructed using the
contentItem element:
The
biblList element allows citation of bibliographic evidence supporting assertions made within other sub-components of the work description.
2.3.11. Notes Statement
The
notesStmt element may be used within the description of the musical work to capture information not accounted for by the other elements of the description.
2.3.12. Classification
The next component of the core
workDesc element is the
classification element. This element is used to classify a musical text according to one or more of the following methods:
by reference to a recognized international classification scheme such as the Dewey Decimal Classification, the Universal Decimal Classification, the Colon Classification, the Library of Congress Classification, or any other system widely used in library and documentation work
by providing a set of keywords, as provided, for example, by British Library or Library of Congress Cataloguing in Publication data.
The following elements are provided for this purpose:
(classification code) – Holds a citation to the source of controlled-vocabulary terms
used in the <termList> element; for example, Library of Congress Subject Headings
(LCSH), Library of Congress Classification (LCC), Library of Congress Name Authority
File
(LCNAF), or other thesaurus or ontology.
The
termList element categorizes an individual text by supplying a set of terms which may describe its topic or subject matter, its physical or intellectual form, date, etc. Each term is indicated by a
term element. In some schemes, the order of items in the list is significant, for example, from major topic to minor; in others, the list has an organized substructure of its own. No recommendations are made here as to which method is to be preferred. Wherever possible, such terms should be taken from a recognized source.
The classCode element offers the possibility of capturing a bibliographic citation and/or a URI at which the classification scheme or information about it may be found.
The @classcode attribute may be used on each term element to make reference, by means of an identifier, to the classification scheme from which it is drawn.
Alternatively, @classcode may be used on
termList when all the contained terms come from the same source.
2.3.13. Work Relationships
When the FRBR (Functional Requirements for Bibliographic Records) module is available, the following elements may be used within
work to describe relationships between the work being described and other works or between the work and expressions of it:
The final sub-element of the MEI header, the
revisionDesc element, provides a detailed change log in which each change made to a text may be recorded. Its use is optional but highly recommended. It provides essential information for the administration of large numbers of files which are being updated, corrected, or otherwise modified as well as extremely useful documentation for files being passed from researcher to researcher or system to system. Without change logs, it is easy to confuse different versions of a file, or to remain unaware of small but important changes made in the file by some earlier link in the chain of distribution. No change should be made in any MEI-conformant file without corresponding entries being made in the change log.
Individual change within the revision description.
The main purpose of the revision description is to record changes in the text to which a header is prefixed. However, it is recommended practice to include entries also for significant changes in the header itself (other than the revision description itself, of course). At the very least, an entry should be supplied indicating the date of creation of the header.
The log consists of a list of
change elements, each of which contains a detailed description of the changes made. If a number is to be associated with one or more changes (for example, a revision number), the @n attribute may be used to indicate it. The person responsible for the change and the date of the change may be indicated by the
respStmt and
date elements. The description of the change itself is contained within the
changeDesc element, which can hold one or more paragraphs.
It is recommended to give changes in reverse chronological order, most recent first.
For example:
A slightly shorter form for recording changes is also available when a the date of the change can be described by a single date in a standard ISO form and when the name of the agent(s) responsible for the change, encoded elsewhere in the header, can be referred to by one or more URIs given in the @resp attribute. For example:
2.5. Minimal and Recommended Header Information
The MEI header allows for the provision of a very large amount of information concerning the text itself, its source, its encodings, and revisions of it, as well as a wealth of descriptive information, such as the languages it uses and the situation(s) in which it was produced, together with the setting and identity of participants within it. This diversity and richness reflects the diversity of uses to which it is envisaged that electronic texts conforming to these Guidelines will be put. It is emphatically not intended that all of the elements described above should be present in every MEI Header.
The amount of encoding in a header will depend both on the nature and the intended use of the text. At one extreme, an encoder may expect that the header will be needed only to provide a bibliographic identification of the text adequate to local needs. At the other, wishing to ensure that their texts can be used for the widest range of applications, encoders will want to document as explicitly as possible both bibliographic and descriptive information, in such a way that no prior or ancillary knowledge about the text is needed in order to process it. The header in such a case will be very full, approximating the kind of documentation often supplied in the form of a manual. Most texts will lie somewhere between these extremes; textual corpora in particular will tend more to the latter extreme. In the remainder of this section we demonstrate first the minimal, and then a commonly recommended, level of encoding for the bibliographic information held by the MEI header.
Supplying only the level of encoding required, the MEI header of a single text will look like the following example:
The only mandatory component of the MEI Header is the
fileDesc element. Within this element,
titleStmt and
pubStmt are required constituents. Within the title statement, a title is required. Within the
pubStmt, a publisher, distributor, or other agency responsible for the file is required.
While not formally required, additional information is recommended for a minimally effective header. For example, it is recommended that the person or corporate entity responsible for the creation of the encoding should be specified using
respStmt within the
titleStmt element. It is also recommended that information about the source, or sources, of the encoding be included. Each
source element should contain at the least a loosely structured bibliographic citation that identifies the source used to construct the MEI file.
Furthermore, If the electronic transcription is a member of a series of publications, the series title and publisher should be included using the
seriesStmt element. It is also common for cataloging records to include genre and/or form information, here represented by the MEI
classification element.
We now present the same example header, expanded to include additionally recommended information, adequate for most bibliographic purposes, in particular to allow for the creation of an AACR2-conformant bibliographic record.
2.6. Independent Headers
Many libraries, repositories, research sites and related institutions collect bibliographic and documentary information about machine readable music documents without necessarily collecting the music documents themselves. Such institutions may thus want access to the header of an MEI document without its attached text in order to build catalogs, indexes and databases that can be used to locate relevant texts at remote locations, obtain full documentation about those texts, and learn how to obtain them. This section describes a set of practices by which the metadata headers of MEI documents can be encoded separately from those documents and exchanged as freestanding MEI documents. Headers exchanged independently of the documents they describe are called independent headers.
2.6.1. Definition and Principles for Encoders
An independent header is an MEI metadata header that can be exchanged as an independent document between libraries, archives, collections, projects, and individuals.
The structure of an independent header is exactly the same as that of an header attached to a document. This means that an
meiHead can be extracted from an MEI document and sent to a receiving institution with little or no change.
When deciding which information to include in the independent header, and the format or structure of that information, the following should be kept in mind:
The independent header should provide full bibliographic information about the encoded text, its sources, where the text can be located, and any restrictions governing its use.
The independent header should contain useful information about the encoding of the text itself. In this regard, it is highly recommended that the encoding description be as complete as possible. The Guidelines do not require that the encoding description be included in the header (since some simple transcriptions of small items may not require it), but in practice the use of a header without an encoding description would be severely limited.
The independent header should be amenable to automatic processing, particularly for loading into databases and for the creation of publications, indexes, and finding aids, without undue editorial intervention on the part of the receiving institution. For this reason, two recommendations are made regarding the format or structure of the header: first, where there is a choice between a prose content model and one that contains a formal series of specialized elements, wherever possible and appropriate the specialized elements should be preferred to unstructured prose. Second, with respect to corpora, information about each of the texts within a corpus should be included in the overall corpus-level
meiHead. That is, source information, editorial practices, encoding descriptions, and the like should be included in the relevant sections of the corpus
meiHead, with pointers to them from the headers of the individual texts included in the corpus. There are three reasons for this recommendation: first, the corpus-level header will contain the full array of bibliographic and documentary information for each of the texts in a corpus, and thus be of great benefit to remote users, who may have access only to the independent header; second, such a layout is easier for the coder to maintain than searching for information throughout a text; and third, generally speaking, this practice results in greater overall consistency, especially with respect to bibliographic citations.
2.6.2. Header Elements and their Relationship to Other Bibliographic Standards
Mapping elements from the MEI metadata header to another descriptive system may help a repository harvest selected data from the MEI file to build a basic catalog record. For this purpose, the following attribute is provided on most meiHead elements:
Contains a reference to a field or element in another descriptive encoding system
to
which this MEI element is comparable.
The encoding system to which fields are mapped must be specified in @analog. When possible, subfields as well as fields should be specified, e.g., subfields within MARC fields.
2.7. RelatedItem vs. FRBR
MEI offers two related concepts for capturing relations between bibliographic items. The model of
relatedItem, as described in chapter
Related Items of these Guidelines, is derived from MODS v3.4 (see documentation here). Its purpose in MEI is to encode bibliographic references between mostly “secondary” material, like reviews, articles, and so on. It may be used to provide cross-references between information encoded in different places of the header.
However,
relatedItem is less ideal for describing the relations between works, differing versions of these works, the sources in which those versions are transmitted, and where applicable the individual copies of a print. For these situations, it is strongly recommended to use the
Functional Requirements for Bibliographic Records (FRBR) instead. This module is based on the Functional Requirements for Bibliographic Records, as specified by the IFLA. It allows a much finer description of relationships between such “primary” entities. For compatibility reasons, both models should not be confused or mixed under any circumstances.