This chapter describes how to encode words and syllables in vocal notation. This text is typically written under a staff to indicate the text to be vocally performed. As such, this text should not be confused with other text on the score, for which see chapter 9 Text Encoding.
These guidelines suggest two methods for encoding text in vocal notation: encoding syllables as 8.3 Vocally Performed Text Encoded Within Notes and encoding performed text as 8.4 Vocally Performed Text Encoded Separately after the notes (and other staff events) either within layer elements or within measure elements when available (for example in a Common Music Notation context). Each method may be more convenient depending on the source text and on the textual phenomena that the encoding intends to record.
Both methods eventually rely on the syl element, which is part of the ‘shared’ module and is therefore available in all MEI files. The following sections will begin by introducing the general use of syl and then show in detail the two different encoding methods.
By ‘lyric syllable’, these guidelines mean a word or portion of a word that is to be performed vocally. Each syllable is encoded with the syl element, with which it is also possible to specify the position of the syllable in a word, the type of connectors between syllables, alignment adjustments, and the formatting for each syllable. These are the key components:
The attribute wordpos is used to specify the position of the marked-up lyric syllable in a word. It allows the following values:
i: Indicates that the current syllable's position is initial; that is, at the beginning of a word;
m: Indicates that the current syllable is in the middle of a word;
t: Indicates that the syllable's position is terminal; that is, at the end of a word.
When a syllable is at the beginning or in the middle of a word (in which case it will have the wordpos attribute set to ‘i’ or ‘m’), it is recommended to specify the type of connector written between the current and the following syllable. This is expressed with the con attribute, which takes the following values:
s: A space is used as a connector between syllables;
d: A dash is used as a connector between syllables;
u: An underscore sign (indicating prologation of the syllable) is used as a connector between syllables;
t: A tilde is used to indicate elision with the following syllable. This is typically rendered as a small curved line between the syllables.
Occasionally, a word or a final syllable needs to be extended across multiple notes. In this case an ‘extender’ is provided. An extender is a continuous line drawn at the text's baseline from the end of the syllable associated with the first note until the last note to be sung with the syllable.
The use of syl described in this section is common to CMN and other notation systems, such as mensural notation. Other uses specific to certain types of notation and repertoires are addressed in other chapters. See for example 6 Repertoire: Neume Notation.
Using the syl attribute on notes is the simplest way of encoding vocally performed text and is recommended only for simple situations or for those encodings which do not focus on vocally performed text.
The following example from Handel's Messiah (HWV 56) shows the use of syl:
The following example from Handel's Messiah (HWV 56) shows the use of verse:
As it is common practice in written text, it is assumed that a space separates words. Many vocal texts, however, introduce elisions and connect two syllables into one unit. For example, the vocal text from Mozart's Don Giovanni sung by Don Giovanni in Finale II, Ho fermo il core in petto introduces an elision between the word fermo and il and between core and in. An elision can be indicated by placing both syllables within the same note and setting the syl element's con attribute value to 't':
When there is more than one line of text, more than one verse element can be used. The following example from a piano reduction of Wagner's Rheingold has two lines of text, with an English translation on the second line. Note the use of the xml:lang attribute to differentiate the two languages:
Finally, the rhythm attribute can be used to specify a rhythm for the syllable that differs from that of the notes on the staff.
Vocally performed text may also be encoded separately from the notes with the lg element. These are the main components:
Since this element is separated from the encoding of the notes, it must be associated with a staff that will provide rhythm information when required for automated processing. The staff attribute gives the associated staff and if there is more than one layer on that staff, the layer attribute may be used to indicate the layer from which the rhythm should be taken. If there is any divergence between the rhythm of the vocally performed text and the notes, the rhythm attribute on verse may be used to specify the text's rhythm.
The following example from Carl Maria von Weber's Der Freischütz illustrates this encoding method:
In this encoding style, a syl element with its con attribute set to 't' and the following syllable are presumed to be associated with a single note. In the following example, the first two syllables occur on the first note and the third syllable occurs on the second note.
This section is supposed to explain stage directions and speeches in MEI drama.