Skip to main content

Home/ DITA XML/ Group items tagged semantic

Rss Feed Group items tagged

Jason Owen

Subject classification with DITA and SKOS - 4 views

  • In a topic-oriented architecture such as DITA, content is authored in small, independent units that are assembled to provide help systems, books, courses, and other deliverables. Each unit of information answers a single question for a specific purpose. That is, each topic has specific, independent subject matter
  • Because each topic has a specific meaning, DITA topics are tailor-made for semantic processing. However, current semantic processors can't read the text of a topic to find out what it means. What's missing is a formal declaration of the topic's subject matter that a semantic processor can understand
  • Simple Knowledge Organization System (SKOS) provides a standard for indicating the subject matter of content. SKOS lets you define the subjects for a particular subject matter area (organizing these subjects as a taxonomy if desired) and then classify each piece of content to indicate its subject. For instance, using SKOS, you could define configuration and security as subjects, and classify the three example topics that relate to those subjects so that users could browse the subjects to find the content regardless of whether the words "configuration" or "security" actually appear in the text.
  • ...16 more annotations...
  • SKOS is expressed with Resource Description Framework (RDF), the fundamental language of the Semantic Web.
  • DITA has a natural fit with SKOS in solutions where DITA topics are classified with subjects that are expressed in SKOS for runtime processing.
  • Formal subjects are often defined by glossary topics or other topics that already exist within the published information set.
  • Even if you don't include the subject definitions in your published information, you can use your standard content tools for your subject definitions. For instance, you can author the subject definitions with your XML editor, and archive and version the subject definitions along with your content in your content management or version control system.
  • Subject classification is as much a part of the information architecture of your content as the navigational organization.
  • The DITA topic specifies the subject with a specialized section element that includes the following kinds of information:Default labels, including synonyms and denotative imagesNotes on the definition and on the scope of coverage for the subjectListing 1 shows an example of the definition for the Configuring subject:
  • Because the meaning of a formal topic should never vary based on its use, these fields should be part of the topic.
  • You can have multiple schemes for the same subjects. For instance, different audiences might be interested in a different subset of the taxonomy.This approach of imposing alternative organizational structures on subjects fits well with the standard use of DITA maps for separating context from content, allowing different organizations to be imposed on the same content. That is, the scheme can be considered a special kind of context for subject definition topics.
  • Schemes can use non-DITA subject definitions (such as publicly-defined SKOS, OWL, or TopicMaps subjects). You cite the public identifier of the subject with the subjectdef element and identify the subject definition format with the format attribute. This allows you to incorporate publicly-defined subjects into your schemes, or to integrate a formal ontology maintained by your organization with concepts that are specific to your content.
  • To classify content, another map specialization associates formal subjects with topics (see Figure 4).
  • Inside the topicref element that references and contains references to the classified content, you nest a topicsubject element to specify the subjects of the content. You can identify a primary subject with the href attribute of the topicsubject element, which also contains subjectref elements for the secondary subjects. If no subject is primary, the topicsubject element should be a container without the href attribute.
  • In the same way that subject schemes can cite public non-DITA subjects, you can classify DITA content with SKOS, OWL, or TopicMaps subjects by citing the public URI identifiers with the subjectref element and setting the format attribute.
  • The central circle represents a conceptual topic (such as Security), which:Has a broader relationship to a subject (perhaps System Concerns) within a schemeIs classified by two other subjects (perhaps the Background type and the Novice User role)Contributes to the classification of one topic (such as Web Security)Occupies the second position in a navigation sequence (perhaps under a Glossary heading)
  • Because subjects are defined by special topics, you can include the subject definition in the content and use it for classification. For instance, the subject topic for Security can both classify content about security and describe security within the Web site or help system content. Figure 5 illustrates this scenario:
  • Because the classification map is distinct from the scheme map, you can apply multiple schemes to the same classification without requiring changes to the classification. To combine the scheme and classification maps for a deliverable, a higher-level map can refer to both maps using a DITA map reference (see Figure 6).
  • You might process a single map to generate both an HTML representation of the content and a SKOS representation of the subjects and classification.
  •  
    Use a DITA specialization to manage the subject matter of your document content -- that is, identify and process your content based on what each topic is about. With the approach outlined in this article, you can take advantage of the technologies of the
Jason Owen

Survey: Semantics in Information Standards for Technical Communication - 2 views

You are invited to participate in a survey study about the importance of semantics in information standards for technical communication. Participation in this study should take about five minutes o...

dita xml docbook s1000d standards

started by Jason Owen on 01 Nov 10 no follow-up yet
Jason Owen

Using Dublin Core - 0 views

  • The linkage between a metadata record and the resource it describes may take one of two forms: elements may be contained in a record separate from the item, as in the case of the library's catalog record; or the metadata may be embedded in the resource itself.
  • The Dublin Core metadata standard is a simple yet effective element set for describing a wide range of networked resources. The Dublin Core standard includes two levels: Simple and Qualified. Simple Dublin Core comprises fifteen elements; Qualified Dublin Core includes three additional elements (Audience, Provenance and RightsHolder), as well as a group of element refinements (also called qualifiers) that refine the semantics of the elements in ways that may be useful in resource discovery.
  • Another way to look at Dublin Core is as a "small language for making a particular class of statements about resources". In this language, there are two classes of terms -- elements (nouns) and qualifiers (adjectives) -- which can be arranged into a simple pattern of statements.
  • ...4 more annotations...
  • Dublin Core can be seen as a "metadata pidgin for digital tourists": easily grasped, but not necessarily up to the task of expressing complex relationships or concepts.
  • Each element is optional and may be repeated.
  • In general Dublin Core metadata describes one manifestation or version of a resource, rather than assuming that manifestations stand in for one another. For instance, a jpeg image of the Mona Lisa has much in common with the original painting, but it is not the same as the painting.
  • Dublin Core was originally developed with an eye to describing document-like objects
1 - 7 of 7
Showing 20 items per page