Knowledge Builders

what is the principle of controlled indexing language

by Llewellyn Anderson Published 3 years ago Updated 2 years ago

What is the principle of controlled indexing language?

  • Best Answer. Controlled vocabularies solve the problems of homographs, synonyms and polysemes by a bijection between concepts and authorized terms.
  • Video Answer
  • Similar Questions. What is the process of indexing? Indexing is regarded as the process of describing and identifying documents in terms of their subject contents.

Full Answer

What is vocabulary control in indexing?

Vocabulary Control. The term ‘vocabulary control’ refers to a limited set of terms that must be used to index documents, and to search for these documents, in a particular system. It may be defined as a list of terms showing their relationships and used to represent the specific subject of the document.

What are the components of indexing language?

Like natural language, an indexing language consists of three elements: (a) Vocabulary (not free vocabulary, but controlled vocabulary), (b) Syntax, and (c) Semantics. All the structured indexing languages are based upon careful subject analysis.

What is the difference between controlled and free indexing language?

Controlled indexing language – only approved terms can be used by the indexer to describe the document. Natural language indexing language – any term from the document in question can be used to describe the document. Free indexing language – any term (not only from the document) can be used to describe the document.

What are the rules of syntax for indexing?

These rules of syntax will vary from one indexing language to another. Coordination of concepts is carried out by the indexer at the time of indexing (i.e. at input stage) in pre-coordinate indexing and by the searcher at the time searching (i.e. at output stage) in post-coordinate indexing.

What is controlled indexing language?

Controlled Indexing Language Controlled indexing languages are indexing languages in which the terms used to represent subjects and the process by which terms are assigned to a particular document are controlled or executed by a person.

What is the purpose of controlled vocabulary?

Controlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri, taxonomies, and other knowledge organization systems.

What are the examples of indexing language?

A controlled vocabulary and formalized structure are features of an artificial indexing language. The extreme example of an artificial indexing is the notation of a classification scheme; instead of natural language terms, heat treatment of aluminum, or the more formalized aluminum-heat treatment, we use 669.71. 04.

What is importance of indexing language?

Indexing language is developed and used for a special purpose, i.e. for the representation of the thought content of the documents as well as queries of the users. A natural language is a free language and there is no control of synonyms and homographs.

What is controlled vocabulary example?

Examples of controlled vocabularies include subject headings, thesauri, ontologies, and taxonomies. Using a controlled vocabulary will aid in searching and finding your data and will make your data more shareable with researchers in the same discipline.

Why is there need to use controlled vocabulary in indexing periodicals?

The use of controlled vocabularies ensures consistent description of resources and their attributes and enables effective information retrieval and resource discovery.

How many types of indexing languages are there?

There are three main types of indexing languages.

What is the nature of indexing language?

Indexing languages are a subset of natural languages. These languages, used to describe documents, are part of the information science techniques. The goal is to represent information in such a way as to improve the retrieval of relevant documents.

What is the indexing language that uses the language of the author?

Derived Term of IL: When an indexer prepares an index by using the author is known as a derived term of indexing language. All the descriptions are taken from the documents itself. Sometimes it is called natural language or free text indexing. As far examples; author index, citation index.

What are the types of indexing?

There are primarily three methods of indexing:Clustered Indexing.Non-Clustered or Secondary Indexing.Multilevel Indexing.

What is a subject indexing language for information retrieval?

Indexing language is the language used to describe a subject or other aspects. of information in a library catalogue or an index. Indexing language is defined. as- 'a list of terms or notations that might be used as access points in an index.' An indexing language may also be referred to as a retrieval language.

What do you call a component of indexing language which means grammatical structure?

SyntaxVocabulary is a list of terms/words used in a particular natural language. Syntax comprises a grammatical structure or a set of rules that govern the sequence of occurrence of terms/words in a sentence.

What is indexing language?

An index serves as an intermediary between the information contained in a document and their potential users. The language which serves as the medium of communication between the two is the indexing language [12]. In other words, the language used in expressing the concepts representing the contents of a document in an index is the indexing language. Indexing language is an artificial language, which uses the vocabulary of a natural language, but is different from that language in many ways. A detailed discussion on indexing language may be found in the next chapter.

What is controlled vocabulary?

A controlled vocabulary is defined as “an organized arrangement of words and phrases used to index content and/or to retrieve content through browsing or searching” (Harpring, 2010 ). Basically, a controlled vocabulary consists of a list of predetermined terms or descriptors that are assigned to records in databases by human indexers to describe the content of those information items. If you think this sounds similar to tagging items online, you are on the right track, except that this kind of indexing is carried out by trained information professionals, rather than being user-generated. The aim of controlled vocabularies is to organize records in electronic systems in a consistent and standardized fashion so that items that are about similar topics are organized under the same headings and will, therefore, be easier for searchers to retrieve. Popular controlled vocabularies that are widely used in database indexing include the Library of Congress Subject Headings and MeSH (Medical Subject Headings). For instance, imagine you have two items with very different titles, but each effectively deals with the same subject, such as the following two items:

What is a HILCC?

The HILCC (Hierarchical Interface to Library of Congress Classification) project at Columbia University Library generated a structured, hierarchical menuing system for subject access to resources in the library’s electronic collections by using the Library of Congress Classification numbers as provided in standard catalogue records. Developed jointly by the library’s systems, cataloguing and reference staff, this classification mapping table linked each Library of Congress Classification range with entry vocabulary in a three-level subject tree. Classification numbers and other metadata elements were extracted from catalogue records in the OPAC on a weekly basis and matched against the HILCC mapping table and then used to create browsable subject category menus to guide users to electronic resource subject content ( Davis,2002 ). Frank and Paynter (2004) provide another example of an interoperability system. They developed an interoperability system which automatically assigned Library of Congress class numbers to a work given its set of Library of Congress subject headings. The system organised the Library of Congress Classification in tree structure, i.e. the root node comprised all possible topics and the leaf node comprised the most specialised topics. This interoperability system used machine learning techniques and training data from a large library catalogue to learn a model that maps from sets of Library of Congress List of Subject Headings to classification from the Library of Congress Classification tree. Mapping is one of the methods of interoperability. Jianbo and Hanqing (2005) analysed the feasibility mapping of enumerative classification, discussed the mode of mapping and the principles of an automatic mapping system. The authors constructed four semantic mapping relationships based on the differences of class meaning between the Chinese Library Classification and the Dewey Decimal Classification. When one has to choose between a general classification and special classification as a switching language, one should prefer general classifications as they provide access to collections nationally and internationally ( Mai, 2004c ).

Why is it so difficult to express complex relationships?

Difficulty in expressing complex relationships, due to the absence of well-defined rules.

Who proposed the faceted classification system?

Louise Spiteri analyzed the rules laid down by S.R. Ranganathan and the CRG for designing faceted classification systems, and proposed her own, simpler set.

Can competing views on genealogical classification be satisfactorily represented in a single diagram?

competing views on genealogical classification cannot be satisfactorily represented in a single diagram ( Section 23.5.3.2 );

What are the different types of indexing languages?

There are three main types of indexing languages. Controlled indexing language – only approved terms can be used by the indexer to describe the document. Natural language indexing language – any term from the document in question can be used to describe the document.

Why use controlled vocabulary?

Controlled vocabularies are often claimed to improve the accuracy of free text searching, such as to reduce irrelevant items in the retrieval list. These irrelevant items ( false positives) are often caused by the inherent ambiguity of natural language. Take the English word football for example. Football is the name given to a number of different team sports. Worldwide the most popular of these team sports is association football, which also happens to be called soccer in several countries. The word football is also applied to rugby football ( rugby union and rugby league ), American football, Australian rules football, Gaelic football, and Canadian football. A search for football therefore will retrieve documents that are about several completely different sports. Controlled vocabulary solves this problem by tagging the documents in such a way that the ambiguities are eliminated.

What is XFML in semantic web?

To create a Semantic Web, it may be necessary to draw from two or more metadata systems to describe a Web page's contents. The eXchangeable Faceted Metadata Language (XFML) is designed to enable controlled vocabulary creators to publish and share metadata systems. XFML is designed on faceted classification principles.

What is a controlled vocabulary scheme?

Controlled vocabulary schemes mandate the use of predefined, authorised terms that have been preselected by the designers of the schemes, in contrast to natural language vocabularies, which have no such restriction.

How do controlled vocabularies solve homographs?

Controlled vocabularies solve the problems of homographs, synonyms and polysemes by a bijection between concepts and authorized terms. In short, controlled vocabularies reduce ambiguity inherent in normal human languages where the same concept can be given different names and ensure consistency.

What is subject heading in the Library of Congress?

For example, in the Library of Congress Subject Headings (a subject heading system that uses a controlled vocabulary), authorized terms—subject headings in this case—have to be chosen to handle choices between variant spellings of the same word (American versus British), choice among scientific and popular terms ( cockroach versus Periplaneta americana ), and choices between synonyms ( automobile versus car ), among other difficult issues.

When did government agencies start developing controlled vocabularies?

In the 1950s , government agencies began to develop controlled vocabularies for the burgeoning journal literature in specialized fields; an example is the Medical Subject Headings (MeSH) developed by the U.S. National Library of Medicine.

What is vocabulary control?

Vocabulary Control refers to the process of creating, maintaining, and using a controlled vocabulary, where a limited set of terms must be used to index documents, and to search for these documents, in a particular system. It may be defined as a list of terms showing their relationships and used to represent the specific subject of the document. ...

Why is vocabulary control important?

In short, vocabulary control helps in overcoming problems that occur due to natural language of the document’s subject. Hence, if vocabulary control is not exercised different indexers or the same indexer might use different terms for ...

How does an information system help the user?

An information system may help the user by explicitly assigning index terms (that is, words or notations) to the documents and controlling, at least in the case of alphabetical (word) systems, the semantic and often the syntatic relationships between these index terms the words (which may be subject headings or descriptors) are assigned from recognized subject heading lists or thesauri, and the notations from recognized classification schedules, and thus use controlled vocabulary. A controlled vocabulary is one in which there is only one term or notation in the vocabulary for any one concept. The Library of Congress List of Subject Headings is an example of a controlled alphabetical vocabulary, and the Dewey Decimal Classification is an example of a notational vocabulary (By definition, all notational vocabularies must be controlled).

Why is the entry vocabulary important?

The entry vocabulary now becomes doubly important, because not only does it contain all the non-preferred terms as well as the preferred terms, but being arranged alphabetically-it also forms our only means of access to the systematic arrangement, via the voice vocabulary. We need to look up terms in which we are interested in the entry vocabulary, which will tell us what codes have been used to denote them:

How to show semantic relationships?

The above discussion on showing semantic relationships is related to one method of arrangement, the alphabetical. We can also show relationships by juxtaposition, that is, grouping related concepts together in a systematic arrangement to form a classification scheme. Such an arrangement will show hierarchical relationships as well as coordinate relationships, and may well also show others such as instruments and materials. In this way, a substantial part of the cross-reference structure required by an alphabetical arrangement is eliminated, because the relationships are shown by the way that the concepts are grouped. We normally arrange books on the shelves of a library in this way in order to help the user, who will find the books they are interested in shelves in the same area.

What is specific and direct entry?

Specific and Direct Entry - The principles of specific and direct entry require that a document be assigned directly under the most specific subject that accurately and precisely represents its subject content. Common Usage - This principle states that the word (s) used to express a subject must represent common usage.

Is thesauri a vocabulary control device?

Both thesauri and subject headings list are vocabulary control devices, but they are used in different situation. The essential characteristics which differentiate them are considered in the following sections:

1.Indexing Languages - an overview | ScienceDirect Topics

Url:https://www.sciencedirect.com/topics/social-sciences/indexing-languages

32 hours ago  · An indexing language is ‘artificial’ in the sense that it may depend upon the vocabulary of natural language, though not always, but its syntax, semantics, word forms, etc. …

2.Controlled indexing languages (Chapter 10) - Essential …

Url:https://www.cambridge.org/core/books/essential-classification/controlled-indexing-languages/DF2DF712094359FB8D4EA341872358FC

24 hours ago  · An indexing language is a controlled language. There is a restriction in using the words/terms in indexing language. Synonyms and homographs are controlled. There is …

3.Indexing language concept types and characteristics

Url:https://www.slideshare.net/DrUtpalDas/indexing-language-concept-types-and-characteristics

23 hours ago I.7 Indexing Language. An index serves as an intermediary between the information contained in a document and their potential users. The language which serves as the medium of …

4.Controlled vocabulary - Wikipedia

Url:https://en.wikipedia.org/wiki/Controlled_vocabulary

1 hours ago We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie …

5.2. What Are Controlled Vocabularies? - getty.edu

Url:http://getty.edu/research/publications/electronic_publications/intro_controlled_vocab/what.pdf

27 hours ago  · Controlled indexing language: • Controlled indexing language refers to the indexing language in which only approved terms are allowed to be used to describe the document. …

6.Vocabulary Control - LIBRARIANSHIP STUDIES

Url:https://www.librarianshipstudies.com/2017/03/vocabulary-control.html

16 hours ago Controlled vocabularies provide a way to organize knowledge for subsequent retrieval. They are used in subject indexing schemes, subject headings, thesauri, taxonomies and other …

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 1 2 3 4 5 6 7 8 9