CAIG Center for Entrepreneurship

"We Turn Ideas Into Business"

Wednesday, December 25, 2019

HTML Content Models - CONTENT

Understanding HTML5 Content Models

Earlier this week we looked at the new text-level and structural semantic elements html5 provides. Today I want to continue and talk about content models in html5, specifically the new outline algorithm for creating hierarchy.

Note: Unfortunately no browser or user-agent ever implemented the HTML5 document outline or likely ever will. You might prefer a series I wrote more recently about HTML5 and the document outline. Here’s the series wrap up, which links to all the other posts in the series.

Once again much of the content below comes to me via Jeremy Keith‘s book HTML5 for Web Designers, which I highly recommend.

Unfortunately some of what we’ll look at below isn’t yet supported by browsers. Some of it will be, but not all. Still I think what’s here is important to understand with an eye toward the future.

Venn diagram of html5 content models

Content Models

Before html5 we had two categories of elements, inline and block. With html5 we now have a more fine-grained set of categories with their own content models.
  • Text-level semantics — what were previously inline tags
  • Grouping content — block level elements like paragraphs, lists, and divs
  • Forms — everything inside form tags
  • Embedded content — images, video, audio, and canvas
  • Sectioning content — the new structural tags described in my previous post
Currently to create a hierarchical outline of our content we use a set of h1–h6 tags. They work for the most part, but can break down at times. Consider the following:
  1. <h1>Web Design</h1>
  2.     <p>Some general info about web design</p>
  3.     <h2>Layout</h2>
  4.     <p>Info about layouts</p>
  5.         <h3>Grids</h3>
  6.         <p>Info about grids</p> 
  7.     <h2>Typography</h2>
  8.     <p>Info about typography</p>
  9.     <h2>Color</h2>
  10.     <p>Info about color</p>
  11.     <h2>Design Principles</h2>
  12.     <ul>
  13.         <li>List of</lI>
  14.         <li>several different</lI>
  15.         <li>design principles</lI>
  16.     </ul>
  17. <p>Where in the outline does this paragraph belong?</p>
The above would produce the following outline based on the headings.
  • web design
  • layout
  • grids
  • typography
  • color
  • design principles
In general each paragraph below a heading belongs under that heading in the outline in the hierarchy, but do they have to?

Where in the outline does the very last paragraph belong? Is it under the Design Principles or does it belong under Web Design?

You can tell my intention based on the indentation, but a machine isn’t going to see that with the whitespace stripped and there’s no reason the code needed to be indented the way it is above.

Visually that last paragraph will look just like the one above it as well. Reading you wouldn’t really know which section it belongs to.

HTML5 helps solve the problem above.

I've seen the future. It's in my browser. HTML5

Sectioning Content Model

The first tool html5 provides is the section tag we discussed last time. Using the section element we can rewrite the above as
  1. <h1>Web Design</h1>
  2.     <p>Some general info about web design</p>
  3.     <section>
  4.         <h2>Layout</h2>
  5.         <p>Info about layouts</p>
  6.             <h3>Grids</h3>
  7.             <p>Info about grids</p> 
  8.         <h2>Typography</h2>
  9.         <p>Info about typography</p>
  10.         <h2>Color</h2>
  11.         <p>Info about color</p>
  12.         <h2>Design Principles</h2>
  13.         <ul>
  14.             <li>List of</lI>
  15.             <li>several different</lI>
  16.             <li>design principles</lI>
  17.         </ul>
  18.     </section>
  19. <p>Where in the outline does this paragraph belong?</p>
Once again the outline produced is the same as we saw above, but now it’s much clearer where the last paragraph belongs. We can do better though. Let’s mix in the header element and better define the different sections of the document.
  1. <h1>Web Design</h1>
  2.     <p>Some general info about web design</p>
  3.     <section>
  4.         <header>
  5.             <h2>Layout</h2>
  6.         </header>
  7.         <p>Info about layouts</p>
  8.         <section>
  9.             <header>
  10.                 <h3>Grids</h3>
  11.             </header>
  12.             <p>Info about grids</p>
  13.         </section>
  14.     </section>
  15.     <section>
  16.         <header>
  17.             <h2>Typography</h2>
  18.         </header>
  19.         <p>Info about typography</p>
  20.     </section>
  21.     <section>
  22.         <header>
  23.             <h2>Color</h2>
  24.         </header>
  25.         <p>Info about color</p>
  26.     </section>
  27.     <section>
  28.         <header>
  29.             <h2>Design Principles</h2>
  30.         </header>
  31.         <ul>
  32.             <li>List of</lI>
  33.             <li>several different</lI>
  34.             <li>design principles</lI>
  35.         </ul>
  36.     </section>
  37. <p>Where in the outline does this paragraph belong?</p>
Once again the above html produces the same outline. So far not much is really new other than the addition of some new tags. We could have done the same thing by using divs instead of section and header.

So where’s the new stuff?

CSS outline in Tinderbox application

HTML5 Outline Algorithm

In html5 each sectioning element has its own self-contained outline. What that means is we can start each section with an h1 tag and the algorithm will figure out the overall outline.
  1. <h1>Web Design</h1>
  2.     <p>Some general info about web design</p>
  3.     <section>
  4.         <header>
  5.             <h1>Layout</h1>
  6.         </header>
  7.         <p>Info about layouts</p>
  8.         <section>
  9.             <header>
  10.                 <h1>Grids</h1>
  11.             </header>
  12.             <p>Info about grids</p>
  13.         </section>
  14.     </section>
  15.     <section>
  16.         <header>
  17.             <h1>Typography</h1>
  18.         </header>
  19.         <p>Info about typography</p>
  20.     </section>
  21.     <section>
  22.         <header>
  23.             <h1>Color</h1>
  24.         </header>
  25.         <p>Info about color</p>
  26.     </section>
  27.     <section>
  28.         <header>
  29.             <h1>Design Principles</h1>
  30.         </header>
  31.         <ul>
  32.             <li>List of</lI>
  33.             <li>several different</lI>
  34.             <li>design principles</lI>
  35.         </ul>
  36.     </section>
  37. <p>Where in the outline does this paragraph belong?</p>
Believe it or not the above html where every heading is an h1 still produces the same outline in html5.
  • web design
    • layout
      • grids
    • typography
    • color
    • design principles
Under html 4 the outline would be
  • web design
  • layout
  • grids
  • typography
  • color
  • design principles
Quite a difference. It might seem somewhat strange to have every heading be an h1 tag, but it does have advantages. You won’t have to keep track of your overall hierarchy, only the hierarchy within a section.

Maybe not such a big deal with a single document, but it does allow our content to be more modular and portable, which will get to momentarily.

Other Sectioning Elements

Above I mentioned that the sectioning content model includes all the structural tags we talked about last time. It’s not only the section tag that creates its own self-contained outline.

Tags like aside, article, and nav also do the same.

While it wouldn’t be appropriate had I used article tags instead of section tags in the above code the same outline would have been produced.

group of lego people

The hgroup Element

Note: The W3C removed the hgroup element from the html5 spec in the spring of 2013 citing little real world use. It’s no longer recommended for use.

Sometimes you may want to use headings so you can better show and style visual hierarchy, but you don’t want the heading to be part of the document outline.

hgroup allows us to do just that. For example say you have the following markup:
  1. <hgroup>
  2.     <h1>Main heading</h1>
  3.     <h2>Tagline</h2>
  4. </hgroup>
Only the h1 above would be included in document outline. The h2 wouldn’t be included. Only the first heading, regardless of how many are there would be included in the outline.

The hgroup element can only contain h1–h6 tags and it’s meant to be used for subtitles, alternative titles, and tag lines.

Do we need hgroup? The above could have been coded as:
  1. <h1>Main heading</h1>
  2. <p class="tagline">Tagline<p>
This would produce the same outline and allow for the same visual styles, however the hgroup probably adds more semantic meaning and certainly uses a bit less code.

In addition to using hgroup to hide some headings from the document outline there are a few elements that by default are invisible to the document element and are called sectioning roots.
  • blockquote
  • fieldset
  • td
Even if you use headings inside the above elements those headings won’t be part of the document outline under html5.

Modular lego building

Modular Content

The new outline algorithm helps us create content that is more modular. The idea of not needing to keep track of your hierarchy might not seem like such a big deal until you consider what happens when you move a piece of content around.

For example typical of many blogs is to display the title and a short paragraph of several posts on the main blog page. In the individual posts the headings would be marked up with an h1. On the main blog page you might have an h1 for the page and then have each of the blog post titles as an h2.

With the new outline algorithm you can move the post titles back and forth with the same h1 heading and let the outline algorithm figure out the hierarchy.

This makes any section of content more portable as we can mix it in with other content without worrying that it might break the hierarchy of the page.

While you’ll probably never have need you can now also structure a document with more than 6 levels. Ultimately we can now create an infinite amount of levels using the same h1–h6 elements in nested sections.

Scoped Styles
A new problem is created in being able to move content around from document to document and that’s in the styles that get applied to that content.

Our modular content will inherit the styles of the parent document, which may not be what we want. html5 offers a solution with the boolean scoped attribute that can be applied to the style element as seen below.
  1. <article>
  2.     <style scoped>
  3.         h1 {styles here}
  4.     </style>
  5.     <h1></h1>
  6.     <p></p>
  7. </article>
In the above code the h1 of our article will be the scoped styles regardless of where the article is displayed. This allows us to move not only content, but the styles associated with that content easily.

 Collage of browser logos

Browser Support

In order to use the new semantic elements we defined those elements in our stylesheet as display: block to ensure they won’t break our layouts. We should now add the hgroup element.
  1. section, article, header, footer, nav, aside, hgroup {
  2.     display: block;
  3. }
We’d of course need to create the element for IE as we did with the other elements or include the html5shiv script.
  1. document.createElement('hgroup');
Now for the bad news. Browser support for the html5 outline algorithm is currently not good.

However the good news is you don’t have to use an h1 to start each new section. You can continue to use h2 and h3 tags inside sections to produce the outline you want.

We’ll lose the portability benefits until browsers are supporting the new algorithm, but we can start preparing for when they do offer support.

For now it’s probably better to stick with using headings as you always have, though it is safe to enclose your headings in the new semantic elements.

html5 logo

Summary

HTML5’s sectioning content model gives us greater control over the hierarchy of our documents. The new outline algorithm provides for an unlimited number of heading levels and helps make our content more modular and portable.

At the moment there’s limited browser support the the html5 outline algorithm, but we can still prepare for it while using h1–h6 tags as we do now.

We won’t be able to take advantage of some of the benefits the new outline algorithm will gives us, but we can prepare our documents for when browser support is more robust.

It will probably feel a little strange to markup a document with multiple h1 tags and leave it to the browser to sort out the hierarchy, but hopefully you can see the advantages in such an approach.

SOURCE: https://vanseodesign.com/web-design/html5-content-models/
_______________

HTML Content Models

HTML document is represented with a Document object (DOM) and it has his own address called URL. HTML document consists of a series of HTML elements.
The HTML elements have theirs own content model: a description of the element’s expected contents. The contents of an element are its children in DOM. That means that every HTML element must have a content that abides established rules. As a result there are several content categories into which the HTML elements fall.

There are seven major content models:
  • Metadata Content
  • Flow Content
  • Sectioning Content
  • Phrasing Content
  • Heading Content
  • Embedded Content
  • Interactive Content
The HTML element may be located in one or more categories or may be not included to any category. For example, all the HTML header elements and the HTML sectioning elements also belong to the HTML flow elements, some the HTML phrasing element belong to the flow elements.

Metadata content

Metadata content defines the HTML elements that establish the presentation and behaviour of the rest of the document’s content or set up the relationship between the document and other external documents.
Metadata content elements contains information about the page: styles, scripts and all data that are used to render the web page.
Majority of these elements are placed in the HTML <head> element of the document.

<base>, <link>, <meta>, <nonscript>, <script>, <style>, <template>, <title>

Flow content

Flow content defines most element that are used in body of documents.

<a>, <abbr>, <address>, <area> (if it is a descendant of a map element) <article>, <aside>, <audio>, <b>, <bdi>, <bdo>, <blockquote>, <br>, <button>, <canvas>, <cite>, <code>, <data>, <datalist>, <del>, <details>, <dfn>, <div>, <dl>, <em>, <embed>, <fieldset>, <figure>, <footer>, <form>, <h1>, <h2>, <h3>, <h4>, <h5>, <h6>, <header>, <hr>, <i>, <iframe>, <img>, <input>, <ins>, <kbd>, <keygen>, <label>, <main>, <map>, <mark>, <math>, <menu>, <meter>, <nav>, <noscript>, <object>, <ol>, <output>, <p>, <picture>, <pre>, <progress>, <q>, <ruby>, <s>, <samp>, <script>, <section>, <select <small <span <strong <sub <sup <svg <table <template <textarea <time <u <ul <var <video <wbr>

Sectioning content

Sectioning content defines elements that allow to organize document into logical structure. The HTML sectioning content elements create outline that defines the scope of <header> and <footer> elements, and heading content(<h1-h6>.

<article>, <aside>, <nav>, <section>

Phrasing content

Phrasing content defines elements that content is text of the document or elements that mark up that text at the intra-paragraph level. The HTML elements that belong to this category are the same that used to belong to the HTML inline elements.

<a>, <abbr>, <area> (if it is a descendant of a map element) <audio>, <b>, <bdi>, <bdo>, <br>, <button>, <canvas>, <cite>, <code>, <data>, <datalist>, <del>, <dfn>, <em>, <embed>, <i>, <iframe>, <img>, <input>, <ins>, <kbd>, <keygen>, <label>, <map>, <mark>, <math>, <meter>, <noscript>, <object>, <output>, <picture>, <progress>, <q>, <ruby>, <s>, <samp>, <script>, <select>, <small>, <span>, <strong>, <sub>, <sup>, <svg>, <template>, <textarea>, <time>, <u>, <var>, <video>, <wbr>

Heading content

Heading content defines the HTML elements that define headers:

<h1>, <h2>, <h3>, <h4>, <h5>, <h6>

Embedded content

Embedded content defines elements that their content is imported from other resources.
Some embedded content elements can have fallback content in case if external reosurces can not be used.

<audio>, <canvas>, <embed>, <iframe>, <img>, <math>, <object>, <picture>, <svg>, <video>

Interactive content

Interactive content defines elements that are designed for users interaction.

<a> ( if the href attribute is present), <audio> (if the controls attribute is present), <button>, <details>, <embed>, <ifreame>, <img> (if the usemap attribure is present), <input> (if the type attribute is not in the hidden state), <keygen>, <label>, <select>, <textarea>, <video> (if the controls attribute is present)

SOURCE: http://majadc.com/html-content-models

CAIG Center For Entrepreneurship

Author & Editor

Has laoreet percipitur ad. Vide interesset in mei, no his legimus verterem. Et nostrum imperdiet appellantur usu, mnesarchum referrentur id vim.

0 comments:

Post a Comment