Popis: |
Most IT professionals and business professionals think of metadata as existing in and relating to the world of structured information. The world of data and associated metadata is divided into two major types of technology: structured and unstructured. This chapter discusses the importance of unstructured business metadata and describes the major benefits that managing it in an organized, systematic way can bring to the enterprise. Unstructured technology consists generally of text. In order to make sense of the text, a distillation process occurs, which encompasses the following steps: words are separated into business-relevant words and non-business-relevant words; extraneous words are removed; words are reduced to a common stem; themes of words are created for a document for the purpose of understanding what the document is about; and industrial recognition of the words occurs by classifying documents according to their content. In the unstructured world, unlike the structured world, there is no format, no structure. Anything can be said in any way in any language. Typical unstructured applications include e-mail, telephone transcriptions, spreadsheets, text file documents, and file reports. The recognition of the world of unstructured data has spawned a new class of technologies aimed at capturing and managing the unstructured data and associated business metadata in the environment. These technologies can add significant value to the business metadata and provide increasing opportunities to expand our business capabilities. |