Language is a set of terms (word forms, word groups, phrases) and rules that denote or encode objects, concepts and their transformations in time and space. Using these terms and grammatical rules the humans produce new codes - the natural (oral, written, or sign) speech to describe the surrounding reality to other humans.
The speech has two: the expression (roughly S-structure) and the content (roughly D-structure) planes. Content encodes the meaning. The speech content is described as a tree of word phrases - the Context Tree (syntactic or phrase structure tree).
This structure, which is universal for natural languages, encodes the meaning of speech. The universality is achieved not by using common abstract types in the Context Tree, but rather by including specific types from all languages. The Context Tree is not an abstract-universal, but rather a specific-cumulative structure. Such approach minimizes losses at representing the Content Tree build for a sentence in one language in another (translation) to a theoretical minimum.
The main goal of the book is to design data structures and algorithms for building the Context Tree. It touches also the difficulties of translation: encoding the meaning by Context Tree in one language and decoding it in another.
The design of the Context Tree that can represent natural speech in any language is examined using Eastern Armenian language. Morphological data and syntactic rules of the Eastern Armenian language described in tabular form and by algorithms: the wordforms generation, stemming, tagging, and lemmatization. These algorithms along with the dictionaries of morphemes allow delinearizing/linearizing sentences into/from Context Trees. They are the core for building various natural speech processing applications such as spell checking, translation, corpus texts tagging and indexing, etc.
The book is written in Eastern Armenian.
More detailed author's summary is available:
1. In English - https://aramhayr.wixsite.com/aram-hayrapetyan/en/post/on-syntactic-structure-representation
2. In Armenian - https://aramhayr.wixsite.com/aram-hayrapetyan/post/բնական-խոսքի-ընդհանրական-ներկայացման-մի-տարբերակի-մասին - contains list of typos
Details
- Publication Date
- Aug 19, 2022
- Language
- Armenian
- ISBN
- 9781387668359
- Category
- Science & Medicine
- Copyright
- All Rights Reserved - Standard Copyright License
- Contributors
- By (author): Aram Airapetian
Specifications
- Pages
- 324
- Binding Type
- Paperback Perfect Bound
- Interior Color
- Black & White
- Dimensions
- Executive (7 x 10 in / 178 x 254 mm)