User Tools

Site Tools


querying_a_gellish_english_database

This is an old revision of the document!


Querying a Gellish English database

1. Queries, questions and dialogues in Gellish English

Queries for conventional databases are usually expressed in a special query language, such as SQL.
Gellish does not require a special query language, because Gellish Queries are nearly identical to ordinary propositions. In a natural language there is no big difference between the structure of sentence that expresses a question and a sentence that expresses a proposition. That same principle is applied in Gellish English. If you know how to express facts in Gellish English, then you only need to know the few additional rules that are explained below in order to be able to write queries for a Gellish Database. As the response to a Gellish Query also consists of Gellish expressions, this implifies the creation of Software Agents and Distributed Gellish databases.

There are basically two kinds of questions:

  • Questions that ask for something that is unknown
  • Requests that ask for a response (a confirmation, a promise, etc.)

Both kinds of queries and the responses to them are discussed below.

2. Questions about unknown objects

A question about things that are unknown has as basic structure:

  • the basic question: which object(s) have a relation of a particular type with another object?
  • a specification of additional constraints that specify values for characteristics for the object(s) that are the result of the basic question.

For example, the basic question could be: ‘which objects are classified as a pump?’
In Gellish English such a question should be expressed as if it was a positive proposition about an individual object, while replacing the unknown object(s) name by 'what'. For example, the above query is expressed in Gellish English (ignoring the auxiliary facts) as is illustrated in Table 1:

Name of left hand objectName of relation typeName of right hand object
what is classified as a pump

Table 1, A simple Gellish English query
The example question asks for the object(s) that have a relation of type “is classified as a” with the concept “pump”. Instead of the term 'what' there are several other reserved terms to indicate an unknown object that is wanted to be identified and about which information is provided in one or more expressions. Gellish English has defined the following reserved terms:

  • what
  • who
  • which object
  • which aspect
  • which person
  • where
  • when

The use of any of these terms means that the expression should be interpreted by a computer as a question, whereas the term 'which aspect' means that not just a possible name of the aspect should be retrieved, but also it value and if applicable the scale for the value.
Full Gellish English expressions use unique identifiers (UID's) to identify objects. To identify objects (or collections of objects) that are unknown in full Gellish English the range of unique identifiers 1-100 is reserved. Thus, the use of a UID in that range means that the objects are unknown and are requested. The name of the unknown is then free, although the use of one or more of the above reserved terms is still recommended.

2.1 Querying results and inheritance

The query is in fact a template for the results. So the search engine should find all the facts that fit in the template. Thus, the result of a query will be a list of facts that satisfy the template.
If the query asks for objects that are classified by a concept that is defined in the Gellish English Dictionary, then the inheritance rules for the subtype-supertype hierarchy in the taxonomy imply that not only objects that are directly classified by that concept satisfy the question, but also objects that are classified by one of the subtypes of that concept. So, the question:
- what is classified as a pressure gauge?
means:
- what is classified as a pressure gauge or as one of its subtypes?
This means that the result of such a query is a list that includes not only things that are directly classified as pressure gauge, but also for example as a manometer, because according to the Gellish English Dictionary/Taxonomy a manometer is a subtype o a pressure gauge. So the result of the above query could be for example:

  • Pr-101 is classified as a pressure gauge
  • Pr-102 is classified as a manometer
  • Pr-103 is classified as a manometer

Software that enables Gellish English querying should therefore be able to automatically generate the answer as a list of things that are either classified by a concept or are classified as one of its subtypes (where the software should indicate when objects are classified more than once).

2.2 Specification of additional constraints

Additional constraints are expressed in a similar way as if it was an expression of the aspects of an individual object that is called ‘what’. For example:

Name of left hand objectName of relation typeName of right hand objectUoM
what is classified as a pump
what is classified as a capacity of pump
capacity of pump is classified as a capacity (volume flow rate)
capacity of pump has on scale a value grater than 30 dm3/s

Table 2, Example of a query with constraints

This example also illustrates that the query expresses a pattern that should be satisfied by the search engine, because if we would replace the term 'what' by the name of a pump, such as P-101, then the above expressions would have been a specification of information about pump P-101.

2.2.1 Denials

A specification may also include a denial. This means that things are search because they have some aspect, but because they do not have such an aspect. Such a denial can be expressed as a separate qualifier of the expression, called the ‘intention’ of the expression (see the paragraph on ‘Requests and Responses’ below) that indicates whether the statement is for the found things should be the case or not be the case. This is indicated by the intention ‘true’, ‘false’ (or ‘not’) or ‘questionable’ (or ‘maybe’).

2.2.2 Multiple unknowns

If more than one unknown object is included in a query, then the query should make a distinction between the different unknowns. This can be done by using one of the other reserved terms for unknown objects, but it is essentially done by giving them different unique identifiers (UID’s). The range of UID’s 1 – 100 are reserved for free usage for unknown queried objects in Gellish queries.

  • Note: Application software may hide the UID’s for users by adopting the convention to allow a dash and a sequence number to distinguish different objects and to translate that to a UID and a standard term.

For example, a query about the capacities of the found pumps would be identical to the above query, except that the name of the aspect ‘capacity of pump’ should be replaced by the term ‘which aspect’. However, in addition to that the unknown aspect shall also have a UID that differs from the UID of the unknown object. So the query would become:

UID of left hand objectName of left hand objectUID of factUID of relation typeName of relation typeUID of right hand objectName of right hand object
1 what 101 1225 is classified as a 130206 pump
1 what 102 1727 has aspect 2 which aspect
2 which aspect 103 1225 is classified as a 550318 capacity (volume fow rate)

Table 3, Example of a query with multiple unknowns

Such a query could result for example in the following answer:

  • P-1 is classified as a reciprocating pump
  • P-1 has aspect cap. of P-1
  • cap. of P-1 is classified as a capacity (volume flow rate)
  • cap. of P-1 has on scale a value equal to 45 dm3/s

2.2.3 Constraints about different kinds of objects

Some queries include constraints about different kinds of objects. For example, we may ask for pumps as well as compressors. In such a case the different unknown objects should be distinguished to enable to specify that one constraint is applicable to objects of one kind and the other constraint is applicable to objects of another kind. For example, if we ask for pumps as well as for compressors, then the result should be things that are classified either as a pump or as a compressor, and the query is not about objects that are classified both as a pump and as a compressor (such facts should not be present in a database). The distinction between the two objects can be made by giving them a different UID. The example query about pumps and compressors should therefore be expressed as follows (leaving away the other UID's):

UID of left hand objectName of left hand objectName of relation typeName of right hand object
1 what is classified as a pump
2 what is classified as a compressor

Table 4, Example of a query about different object types

3. Requests and responses

Questions sometimes have a structure that is difficult to distinguish from statements. In spoken language they are usually distinguished by the tone, the melody of the sentence. In written English, a question mark is then added. Gellish English supports such kinds of expressions by enabling the explicit expression of the ‘communication intent’ of the author of the expression. This enables to express various intentions, such as:

  • a question
  • a request
  • an answer
  • a confirmation
  • a denial
  • a promise
  • a declination
  • a statement
  • a probability
  • an acceptance
  • Note: The ‘allowed values’ for the intention can be found in the Gellish English dictionary as qualitative classes that are a qualification of ‘intention’ or one of its subtypes.

Such an ‘intention’ indicates whether or to what extent something is the case according to the author of the expression. In other words, it indicates the extent to which the relation expresses what is the case or it indicates the status of the process to become the case. This is called the “intention” of the proposition. So, an intention is a quality of an expression that thus can express not only that a proposition is a question, but it can also express that it is a confirmation, a probability, etc. For example, the flowing table contains a proposition with the intent to be a questions and the same proposition that is qualified as a confirmation:

Name of left hand objectIntentionName of relation typeName of right hand object
P-1 question is classified as a pump
P-1 confirmation is classified as a pump

Table 5, Examples of expressions about intentions of a proposition, without unknown objects
This means that basically the same proposition can be used for different purposes in a communication. This implies that the above two lines express two different opinions about the same fact.

4. Author, date and time of propositions

The ‘author’ and the moment of creation of a proposition such as a question or an answer can be added as auxiliary facts to a Gellish English expression. A Gellish Database table has dedicated columns for that: the date of creation of a fact (a date/time) and the originator (of the latest change). As there are many ways in which a date/time can be expressed, a Gellish Database uses the standard ‘[http://support.microsoft.com/kb/q180162/ 1900 date system]’ convention to make the date/time computer interpretable. A user interface may present it in any notation.
For example:

Name of left hand objectIntentionName of relation typeName of right hand objectDate of creation of fact (start of validity)Date of latest changeAuthor (of latest change)Source or reference
P-1 question is classified as a pump 20 Aug'07 20 Aug07 John Doe
P-1 confirmation is classified as a pump 20 Aug07 20 Aug}07 Andrew my database

Table 6, Example of a question and confirmation by an explit author at a particular date, without unknown objects

  • Note that these expressions are about the same fact. Therefore the fact identifier will be the same for these propositions, even if a statement is modified later, e.g. when at a later date the confirmation is changed into a denial.

Continue with Gellish Semantic Web

querying_a_gellish_english_database.1321643587.txt.gz · Last modified: 2017/11/15 11:05 (external edit)