User Tools

Site Tools


formalized_languages

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
formalized_languages [2017/11/16 13:34]
andries
formalized_languages [2018/11/12 16:23]
andries [3. Categories of kinds of relations]
Line 1: Line 1:
-====== ​Formalized ​languages ======+====== ​Formalization of natural ​languages ======
  
-The Gellish family of Formalized Languages ​is based on language independent ​semantics and syntax, whereas it is nevertheless ​close to natural languages. ​This is realized by the recognition that concepts are basically language independent things, although they are denoted in different languages by different termsand by the discovery that it is possible to express ideas in a language ​natural independent way as collections of basically binary relations that are arranged in a network of relations. ​Such expressions can be unambiguously ​be interpreted when they conform to patterns in the Gellish universal basic semantic ​structureFor this reason, each concept is represented in the Gellish family by a natural language independent unique identifier (UID) and each expression is part of a pattern that is derived from the basic semantic ​structure. Thus the languages of the Gellish family are based a universal basic semantic structure ​and a generic syntax, as will be described in this wiki. \\+The Gellish family of Formalized Languages ​use the vocabulary of natural languages, whereas its expressions are based on semantics ​(meaning) ​and syntax ​(expression structure) that is universal and independent of any particular language. As a result Gellish expressions are close to natural languages. ​Therefore, they are also called Controlled Natural Languages (CNLs). The formalization ​is realized by building on two observations:​ 1) The recognition that concepts ​and the meanings of expressions ​are basically language independent things, although they are denoted in different languages by different terms and phrases and 2) By the discovery that it is possible to express ideas in a natural ​language ​independent way as collections of basically binary relations that are arranged in a network of relations. ​ \\ 
 +The languages of the Gellish family are called //formal// languages, because they are formally defined, so that computers can unambiguously interpret the meaning from the expressions. Computers cannot do that with natural languages, neither they can do that with databases without precise knowledge about the dedicated data models of those databases. Information that is expressed in a Gellish formal language is computer interpretable and application system independent. It can be generated when exporting data from systems for data exchanged between systems and parties in [[:​gellish_messages|messages in Gellish]] and can be imported, interpreted and stored in [[:​gellish_databases|database systems]] and files. Queries can be expressed in Gellish formal languages as well. The language has a native standard Gellish Expression format (syntax), although it can also be stored in other formats. 
 +The expressions can be unambiguously ​interpreted when they conform to the Gellish universal basic semantic ​patternsTo enable natural language independent interpretations, each concept is represented in the Gellish family by a natural language independent unique identifier (UID) and each expression is part of a pattern that complies with the basic semantic ​patterns and uses concepts that are selected from or added to the Gellish Dictionary. Thus the languages of the Gellish family are based a universal basic semantic structurea generic syntax ​and a standardized vocabulary, as is described in this wiki and described in more detail in the book '​Semantic Information Modeling Methodology'​. \\
  
-The Gellish ​family includes natural specific language variants by adding a dictionary that relates ​the UIDs of the concepts ​to the terms in a natural language ​and possibly to synonym termscodes or abbreviations ​in a particular language community within that languageIn this way the family currently includes Formal English[[:​gellish_nederlands|Formal Dutch]] ​and parts of other formal language variantsThese formal ​languages ​thus become [[http://​www.gellish.net/​index.php/​semantic-modeling.html|semantic modeling languages]] for the modeling of knowledgerequirements,​ definitions,​ queries as well as information about any individual things and individual processes with as objective enabling universal system independent ​data exchange ​and data integration\\ +The Gellish ​languages are intended for facilitating data integration and the export and import ​of data exchange messages from and to database systems. Therefore Gellish enabled software is not intended to become ​natural language ​interpretersbut only be interpreters of formalized languages ​in tabular formThis differs from conventional data definitions for databases and for data exchange and data interpretation. Conventionally information analystsdata modelers ​and programmers use 'data models'​ to define the structure ​of data and the meaning of '​instances'​Data models are usually not called 'languages' and input or output is usually not called '​expressions'​Howeverconventional ​data models define expressions structures ​and terminology,​ which together in fact define some form of dedicated languages. But for example database tables or data exchange files cannot easily be read as natural language expressions. The formal ​languages ​of the Gellish family ​are a step further towards natural ​languages. In a Gellish Expression table, the information is expressed as collections of expressions, ​each with a structure ​that is in essence ​binary relationThe core elements of each line in such a table can be read as (nearly) normal natural ​language ​expression. The definition of the Gellish ​family of languages ​can be considered equivalent to a very large and flexible data model (actually it is a further development of the generic data model of the ISO 15926-2 standard)The formalization however implies that Gellish expressions do not have the free form that natural languages have.\\
-The languages are called //​formal// ​languages, ​because they are formally defined, so that computers can unambiguously interpret the meaning from the expressions, ​Computers cannot do that with natural languages and without knowledge about the dedicated data models they cannot do that with databases either. Information ​that is expressed ​in a Gellish formal language is computer interpretable and application system independentIt can be generated when exporting data from systems for data exchanged between systems and parties ​in [[:​gellish_messages|Gellish Messages]] and can be imported, interpreted and stored in [[:​gellish_databases|database systems]] and files. Queries can also be expressed in Gellish formal ​language. The language has a native standard ​Gellish ​Expression format, although it can also be stored in other formats.+
  
-===== 1. Logic =====+===== 1. Simplifications of natural languages ===== 
 + 
 +Gellish can thus be described starting from a conventional data model and generalizing such a model towards the flexibility and capabilities of natural languages. But it can also be described starting from natural languages and adopting simplifications and constraints on the allowed expressions to a level that computer software is able to interpret the meaning of the expressions,​ apply logic and to act accordingly.\\ 
 +Starting from a natural language, Gellish adopted the following simplifications:​ 
 +  * Split expressions in collections of binary relations ('//​basic semantic units//'​),​ because in the target language usage (business and technical data processing) every idea can be expressed as a collection of one or more binary relations between related objects. 
 +  * Each binary relation shall be classified by a kind of relation that is predefined in the formal dictionary. The meaning of a kind of relation is independent of the sequence in which the related objects are arranged in the expression (at the left or at the right hand of the kind of relation). Because kinds of relations are usually denoted by a phrases in natural language, each expression has a reading direction. Because the inverse relation is equivalent, there are always two '​phrase types'':​ a base phrase and an inverse phrase for cases where the related objects appear in the reverse order. As a consequence,​ the kind of relation defines the first kind of role and the second kind of role, but allows for base phrases as well as inverse phrases for expressing an idea. (N.B. The phrase type UID represents this order in a language independent way).    
 +  * Thus the core of each binary relation consists of two related objects, a kind of relation, a phrase type and optionally the two roles or kinds of roles that are played by the related objects.  
 +  * Separate concepts from terminology. This means that each idea, relation, related object or role and each of their kinds is represented throughout the family of languages by a natural language independent unique identifier (UID), whereas multiple names, aliases and translations may denote those UIDs in various '​language communities'​. This enables the unambiguous use of synonyms as well as homonyms. 
 +  * Make intentions of expressions explicit by separating the theme of an expression from the explicit intention with which an idea is expressed. This means that grammatical variations in expressions between questions, statements, confirmations,​ denials, etc. are largely eliminated, while maintaining the variety of expressions. This supports computer dialogues instead of the common limitation to statements and facts (this follows from the '​Speech act theory'​ of John Searl). 
 +  * Replace past and future tenses by a uniform expression accompanied by explicit expression of time indications where necessary. These largely reduce that grammatical variety of expressions about was has been the case and what may become the case. 
 +  * Use only concepts that are selected from the taxonomic dictionary of the formal language, or add them and/or their aliases according to the rules for extension of the dictionary.  
 +  * Replace plural names, through using singular terms where possible in combination with explicit cardinalities and numbers of items in collections. This eliminates nearly all plural terms from the dictionary of the formal language.  
 +  * Add explicit contextual information to each basic semantic unit. This eliminates the problem of context dependence of interpretations.\\ 
 + 
 + 
 +The Gellish family includes natural specific language variants by adding a dictionary that relates the UIDs of the concepts to the terms in a natural language and possibly to synonym terms, codes or abbreviations in a particular language community within that language. In this way the family currently includes Formal English, [[:​gellish_nederlands|Formal Dutch]] and parts of French, German and Chinese language variants. These formal languages thus become [[http://​www.gellish.net/​index.php/​semantic-modeling.html|semantic modeling languages]] for the modeling of knowledge, requirements,​ definitions,​ queries as well as information about any individual things and individual processes with as objective enabling universal system independent data exchange and data integration. And with one of its advantages that information that is expressed in one natural language variant can be searched and presented in any other language variant for which a dictionary is available.\\ 
 + 
 +===== 2. Logic =====
  
 People implicitly make use of logic when interpreting natural languages. For example, we know that the concept car is a subtype of vehicle. Thus when we talk about a vehicle we know that that may be about a car. In order to enable software to draw similar conclusions when using formal languages it is necessary to include relations between concepts in the definition of the concepts in the formal language. Definitions of concepts therefore relate them to other concepts by standardized kinds of relations. In particular, each concept is defined as a subtype of one or more direct supertype concepts, thus forming a taxonomy of concepts. Gellish formal languages therefore includes an electronic [[:​gellish_english_dictionary|Taxonomic Dictionary]] that provide the concepts and terminology of the languages, as well as relations of particular kinds between the defined concepts, as far as required for the definition of the meaning of the concepts. Furthermore,​ every concept is identified by a natural language independent unique identifier (UID). Those UIDs represent the concepts independent of any language. Every concept can be denoted by various terms, names, codes or phrase, as are used by different language communities in different natural languages. Information that is expressed in one of the formal languages include the UIDs of the concepts. This enables that the expressions can be [[:​automated_translation|automatically translated]] from one formal language to another by Gellish enabled software. For example, queries in Formal Dutch, can be executed on databases that are in English, and the queries as well as the results can be presented to users in Dutch and vice versa. This is possible between any languages for which formal dictionaries are available. People implicitly make use of logic when interpreting natural languages. For example, we know that the concept car is a subtype of vehicle. Thus when we talk about a vehicle we know that that may be about a car. In order to enable software to draw similar conclusions when using formal languages it is necessary to include relations between concepts in the definition of the concepts in the formal language. Definitions of concepts therefore relate them to other concepts by standardized kinds of relations. In particular, each concept is defined as a subtype of one or more direct supertype concepts, thus forming a taxonomy of concepts. Gellish formal languages therefore includes an electronic [[:​gellish_english_dictionary|Taxonomic Dictionary]] that provide the concepts and terminology of the languages, as well as relations of particular kinds between the defined concepts, as far as required for the definition of the meaning of the concepts. Furthermore,​ every concept is identified by a natural language independent unique identifier (UID). Those UIDs represent the concepts independent of any language. Every concept can be denoted by various terms, names, codes or phrase, as are used by different language communities in different natural languages. Information that is expressed in one of the formal languages include the UIDs of the concepts. This enables that the expressions can be [[:​automated_translation|automatically translated]] from one formal language to another by Gellish enabled software. For example, queries in Formal Dutch, can be executed on databases that are in English, and the queries as well as the results can be presented to users in Dutch and vice versa. This is possible between any languages for which formal dictionaries are available.
  
-===== 2. Categories of kinds of relations ​(relation types) ​=====+===== 3. Categories of kinds of relations =====
  
-The expression power of formal languages is largely determined by the number and variety of kinds of relations that are available in the language definition. The Gellish formalized language definition includes ​about 1000 standard kinds of relations. They are defined in the upper ontology section of the Taxonomic Dictionary-Ontology. Their textual definitions and subtype-supertype hierarchy (taxonomy) is documented a computer readable file that itself is written in Gellish. The hierarchy is also available in printed form in the book [[http://​www.lulu.com/​shop/​andries-van-renssen/​taxonomic-dictionary-of-relations/​paperback/​product-22534783.html|Taxonomic Dictionary of Relations]] and in its Dutch equivalent [[http://​www.lulu.com/​shop/​andries-van-renssen/​taxonomisch-woordenboek-van-relaties/​paperback/​product-22534768.html|Taxonomisch Woordenboek van Relaties]]. The electronic version also includes ​definitions of the allowed roles and the allowed role players for the kinds of relations as well as a taxonomy of roles and of role players. Together that defines what are correct formal expressions and how expressions should be interpreted. The electronic language definition can be licensed, and can then be directly imported in Gellish enabled application systems to enable generation and computer ​interpretation of formal language expressions.+The expression power of formal languages is largely determined by the number and variety of kinds of relations ​(also called relation types) ​that are available in the language definition. The Gellish formalized language definition includes ​over 1000 standard kinds of relations. They are defined in the upper ontology section of the Taxonomic Dictionary-Ontology. Their textual definitions and subtype-supertype hierarchy (taxonomy) is documented a computer readable file that itself is written in Gellish. The hierarchy is also available in printed form in the book [[http://​www.lulu.com/​shop/​andries-van-renssen/​taxonomic-dictionary-of-relations/​paperback/​product-22534783.html|Taxonomic Dictionary of Relations]] and in its Dutch equivalent [[http://​www.lulu.com/​shop/​andries-van-renssen/​taxonomisch-woordenboek-van-relaties/​paperback/​product-22534768.html|Taxonomisch Woordenboek van Relaties]]. The definitions ​of kinds of relations also include specifications ​of the allowed roles and the allowed role players for the kinds of relations as well as a taxonomy of roles and of role players. Together that defines what are correct formal expressions and how expressions should be interpreted. ​\\ 
 +The electronic language definition can be directly imported in Gellish enabled application systems to enable ​searching for kinds of relations and for supporting the generation and interpretation of formal language expressions.
  
 +For creating high quality Gellish expressions skills are required in **'​Semantic Modeling'​**,​ which primarily includes expressing information in the form of collections of binary relations conform a consistent methodology. It also includes using the proper kinds of relations for expressing particular meanings. \\
 +The search for the proper kinds of relations is supported by two mechanisms: 1) The fact that kinds of relations are denoted by phrases that apply logical naming conventions and 2) The fact that the kinds of relations are arranged in a taxonomy. A method for finding the required kinds of relations is described further in [[http://​wiki.gellish.net/​gellish_english_dictionary|the taxonomic dictionary section of this wiki]].
  
-Each kind of relation ​(also called '​relation type') is identified by a unique identifier (Gellish UID). Furthermore,​ each relation type is denoted by at least one base phrase and by at least one inverse phrase. For example, a part-whole relation between two individual things is denoted in Formal English by the phrase <is a part of> and by the inverse phrases <has as part> and <is a whole for> ​and has UID 1260. This means that the same fact or idea can be expressed in either of the two ways. For example, the expression 'A <is a part of> B' has the same meaning as the expression 'B <has as part> A'. A relation ​type can also be denoted by alternative phrases, such as <is an assembly of>, or by phrases in other languages. Users may even define their own synonym phrases ​(provided that the UID remains the same) and they can indicate a '​language'​ and/or a '​language community'​ in which their phrase is preferred. For example: the Dutch (Nederlands) equivalent ​of the name of UID 1260 in the above example is: <is een deel van>.+=== 3.1 Phrases === 
 +Each kind of relation is denoted by at least one base phrase and by at least one inverse phrase. For example, a part-whole relation between two individual things is denoted in Formal English by the phrase <is a part of> and by the inverse phrases <has as part> and <is a whole for>. This means that the same statement ​or idea can be expressed in either of the two ways. For example, the expression 'A <is a part of> B' has the same meaning as the expression 'B <has as part> A'. A kind of relation can also be denoted by alternative phrases, such as <is an assembly of>, or by phrases in other languages. Users may even define their own synonym phrases and they can indicate a '​language'​ and/or a '​language community'​ in which their phrase is preferred. For example: the Dutch (Nederlands) equivalent ​base phrase for denoting ​the concept '​part-whole relation' ​in the above example is: <is een deel van>.
  
 +=== 3.2 Unique identifiers (UIDs) ===
 +Apart from the multiple phrases for denoting a kind of relation, each kind of relation is identified by a unique identifier (its Gellish UID). For example, the concept '​part-whole relation'​ has UID 1260. Thus all aliases, synonyms, abbreviations and translations of phrases that denote a particular kind of relation share the same UID. 
  
-The kinds of relations together form a taxonomy ​(a subtype-supertype hierarchy of kinds of relations) with the following branches:+=== 3.3 The taxonomy - the hierarchy of kinds of relations === 
 +The kinds of relations together form a taxonomy, being a subtype-supertype hierarchy of kinds of relations. This implies that all kinds of relations are subtypes of one concept, called **relation**. The first subtypes of '​relation'​ are the concepts //binary relation// and //higher order or variable order relation//. The binary relations have the following branches ​in the hierarchy:
  
  
-    - Kinds of relations between individual things +    - **Kinds of relations between individual things**. Expressions with such relations specify information about individual things. 
-    - Kinds of relations between an individual thing and a kind of thing +    - **Kinds of relations between an individual thing and a kind of thing**. Expressions with such relations mainly specify the nature or role of individual things, or they can be used for searching things of particular kinds. 
-    - Kinds of relations between kinds of things +    - **Kinds of relations between kinds of things**. Expressions with such relations specify knowledge about kinds, such as possibilities,​ definitions and requirements about kinds. 
-    - Kinds of relations between a single thing and a plurality +    - **Kinds of relations between a single thing and a collection**. Expressions with such relations specify for example which elements belong to which collections. 
-    - Kinds of relations between ​pluralities+    - **Kinds of relations between ​collections**. Expressions with such relations specify for example that a sub-collection is a partial collection of a larger collection.
  
  
-The kinds of relations include binary relations as well as higher ​order relations. Higher ​order relations include ​relations for modeling ​occurrences,​ such as processes, activities and events and relations for modeling ​correlations,​ such as for physical laws or geometric and mathematical formula.+Kinds of higher order relations include ​kinds that represent kinds of occurrences,​ such as processes, activities and events and kinds that represent ​correlations,​ such as for physical laws or geometric and mathematical formula. ​For specification of a higher order relation such as an occurrence, it is required to specify which and how objects are involved in the occurrences. This done by relating the occurrence to the involved objects by relations of kinds that are subtypes of binary //​involvement relation//. For example, a kind of activity is //​project//,​ and specifying an involvement of John as a manager of Project X is expressed as follows:​\\ 
 +  * John   //is manager of//   ​Project X
  
 +The definition of a [[http://​gellish.net/​index.php/​downloads/​file/​54-gellish-english.html|base collection of kinds of relations]] is free available in Gellish expression format in CSV via the download section of this website.
  
-For different application area's different kinds of relations are applicable.  +//Continue with:// ​[[http://wiki.gellish.net/​outline_of_gellish|Outline ​of Gellish]]
-The definition of a [[http://​gellish.net/​index.php/​downloads/​file/​54-gellish-english.html|core collection ​of kinds of relations]] is free available in Gellish expression format in CSV via the download section of this website.+
formalized_languages.txt · Last modified: 2018/11/12 16:23 by andries