Tagging Figures, Graphics, and Multimedia

This Tag Set provides several elements related to the still graphic images and the multimedia objects contained within a book or book part such as a chapter. The graphic and multimedia elements do not contain the image or object; they point to an external file that contains the image or object. The primary ones are:
Inline Graphic
The simplest graphic object is the <inline-graphic>, which is used when the image should appear inline with surrounding text. Inline graphics may contain no labels, titles, or captions. A typical inline graphic is an empty element, using an attribute to point to a logo or to a special math or chemistry symbol that is used inside a paragraph or a title. (Best Practice: If the image represents a special private character, the <private-char> element should be used instead of <inline-graphic>.)
Graphic
Still images that do not appear inline are tagged with the <graphic> element, typically inside a figure (<fig>), which holds associated information such as a figure number, title, caption, etc. (see below). When an image in a book has no associated information (title, figure number, etc.), a <graphic> element should not be embedded in a <fig> element.
Most <graphic> elements are empty, used only to point to their external file. A <graphic> element may contain a <label> or <caption>, but <graphic> elements that are block-level objects in the text rather than inside a larger object such as a figure rarely do. When a <graphic> is within a figure (<fig>), the associated <label> or <caption> is typically within the <fig>, not within the <graphic>. A <label> within a <graphic> is typically used only when a single figure (<fig><label>Fig 4.</label>...</fig>) contains 3 or 4 graphics, each with a separate label such as:
    ...<graphic ...><label>Part A</label>...</graphic>...
or
    ...<graphic ...><label>Part B</label>...</graphic>...
Accessibility: A <graphic> that is not inside a figure or a table should contain several sub-elements, including <alt-text> and <long-desc>, which are especially important to enable access for visually impaired users. For graphics inside another element (such as a <fig>), the containing element typically holds the accessibility information, although in rare cases the accessibility elements may need to be in both places.
Decorations: If a graphic is purely decorative and is not part of the narrative of the book (such as a background image of a wheat field on the first page of a chapter about crop grains or a section symbol between sections within a chapter), the <graphic> element may be marked with an empty @alt attribute to indicate that it is purely decorative, and thus can be ignored by screen readers and other accessibility aids.
Media:
Objects such as an MP3 sound clip accompanying a book are tagged with the <media> element, typically inside a figure (<fig>), which holds associated information such as a figure number, title, caption, etc. (see below). When a media object in a book has no associated information (title, figure number, etc.), a <media> element should be not be embedded in a <fig> element.
Accessibility: A <media> object that is not inside a figure or table should contain several sub-elements, including <alt-text> and <long-desc>, which are especially important to enable access for visually impaired users. For objects inside another element (such as a <fig>), the containing element typically holds the accessibility information, although in rare cases the accessibility elements may need to be in both places.
Figures and Figure Groups:
This Tag Set also contains two elements typically used for associating a graphic or media object with related information (such as a caption, label, title, or credits and other attribution) or with other graphic, media, or textual objects.
  • Figure (<fig>) — typically, a labeled wrapper for images and media and their metadata
  • Figure Group (<fig-group>) — A grouping of several <fig> elements
Inside a figure (<fig>), the most commonly occurring related information is a figure number (<label>) and a caption (<caption>). Some publishers consider the display of a figure number definitive for deciding when to use a <fig> element; others depend more on whether the image ought to be listed in a List of Figures. Usually the two criteria match up, but there may be exceptions where numbered figures do not appear in the list, or vice versa.
Like tables, figures typically contain titles and captions. In this Tag Set, the <title> element is included inside the <caption> element. A <caption> may contain many kinds of text associated with its figure, typically a <title> (which acts as a heading for the figure), one or more descriptive paragraphs (<p>), or a legend (tagged, for example, as a list within a <p>).
Figures do not always contain pictures. While many <fig> elements contain graphic objects (<graphic> or <media>), a <fig> element may also contain a variety of display and textual objects, such as formulas, chemical structures, poetry, tables, bulleted or numbered lists, simple paragraphs, definition lists, or a mixture of any of these items as well as one or more graphic objects.
A Figure Group element (<fig-group>) contains one or more Figure elements (<fig>), placed in one container for editorial or production convenience. Specifically, a <fig-group> contains two or more labeled, possibly captioned <fig> elements that form a logical group, perhaps with an overall <label>, <caption>, or other information on the <fig-group> element that applies to all of the figures inside it.
Positioning Graphic Objects:
In common with other block-level display objects, figures, graphics, and media objects may take the @position attribute, which is used to indicate where the object should be displayed. The values for this attribute are slightly overloaded, as two values indicate the position of the object (such as a graphic) in relation to the text:
anchor
Object should appear exactly where it is found in the narrative flow.
float
Object should appear near its position in the narrative flow, but it may float to the top of a page or column, or into another window.
and two indicate the relationship between the object (such as a graphic) and the page:
background
Object, typically a decorative graphic, should appear behind the narrative. This does not displace the narrative; ideally, both the graphic and the narrative are visible, and the narrative is legible through the graphic. Accessibility: Best practice is to mark all purely decorative graphics with an empty @alt attribute, so that screen readers and other accessibility devices know that they may be ignored.
margin
Object should appear beside the text that contains it in the narrative flow, as though it were in the margin of a printed page. Accessibility: Best practice is to mark all purely decorative graphics with an empty @alt attribute, so that screen readers and other accessibility devices know that they may be ignored.
Figure and Graphic Examples
Here is a typical figure with a label, caption, and graphic inside. As is usually the case, the <graphic> element is empty, only used to point to the external file.
...
<fig id="fg-012">
 <label>Figure 12. </label>
 <caption>
  <title>My Dog</title>
  <p>He is so sweet! This is his best pose.</p>
 </caption>
 <graphic xlink:href="frontView.png"/>
</fig>
...
Here is a figure that contains no graphics at all, but encloses some bulleted items in a <list>. Note: The prefix word attribute has been used in the example below to generate a more complicated label; each label will have the prefix “Pet ” (“Pet 1. Poodle”, “Pet 2. Persian Cat”, etc.).
<fig id="fg-012">
 <label>Figure 3. </label>
 <caption><title>Pet Show and Tell Order</title></caption>
 <list list-type="order" prefix-word="Pet">
  <list-item><label>1.</label><p>Poodle</p></list-item>
  <list-item><label>2.</label><p>Persian Cat</p></list-item>
  <list-item><label>3.</label><p>Weaver Finches</p></list-item>
  <list-item><label>4.</label><p>Gecko</p></list-item>
 </list>
</fig>
Here is a figure with three graphics inside. The <fig> element also contains a <label> and <title> that apply to all the graphics. In this example, the <graphic> elements are not empty; each <graphic> has its own <label> and <caption>.
...
<fig id="fg-012">
 <label>Figure 12. </label>
 <caption><title>Three Perspectives on My Dog</title></caption>
 <graphic xlink:href="frontView.png">
  <label>a.</label>
  <caption><p>View A: From the Front, Laughing</p></caption>
 </graphic>
    
 <graphic xlink:href="sideView.png">
  <label>b.</label>
  <caption><p>View B: From the Side, Best Profile</p></caption>
 </graphic>
    
 <graphic xlink:href="motionView.png">
  <label>c.</label>
  <caption><p>View C: In Motion, A Blur on Feet</p></caption>
 </graphic>
</fig>
...
Here is a figure group, with three figures inside, each of which contains a graphic. The <fig-group> also contains a <label> and a <caption> (which has a <title> inside) that apply to all the <fig> elements in the group. Each figure also has its own <label> and <caption>.
...
<fig-group id="dogpix4">
 <label>Figures 12-14 Bonnie Lassie</label>
 <caption><title>Three perspectives on My Dog</title></caption>
 <fig id="fg-12">
  <label>Figure 12.</label>
  <caption><p>View A: From the Front, Laughing</p></caption>
  <graphic xlink:href="frontView.png"/>
 </fig>
 <fig id="fg-13">
  <label>Figure 13.</label>
  <caption><p>View B: From the Side, Best Profile</p></caption>
  <graphic xlink:href="sideView.png"/>
 </fig>
 <fig id="fg-14">
  <label>Figure 14.</label>
  <caption><p>View C: In Motion, A Blur on Feet</p></caption>
  <graphic xlink:href="motionView.png"/>
 </fig>
</fig-group>
...
Other Display Objects
Several other block-level elements may also contain graphic objects, for example, see <boxed-text> and <disp-formula>. For specifics, see the Context Table for where the specific graphic/media objects (<graphic>, <inline-graphic>, <media>, and <inline-media>) may be used.
Additional (Non-integral) Display Objects
The elements for describing supplementary material that is associated with a book or book part but not part of the book (<inline-supplementary-material> and <supplementary-material>) may also contain graphics or media as part of their description of the additional material. Neither of the supplementary elements contains the supplementary object; they contain descriptions of the material. Such descriptions may incorporate <graphic> or <media> elements.
The external supplementary material itself may be tagged using BITS and contain <graphic> or <media> elements. These objects may be pointed to by the narrative text, but are not part of it.