download25.com
Your free TOP 25 download source!Carabao Language Kit 1.2.0.0
A customizable language construction framework
|
||||||||||||||||||||||
|
Review Carabao is a family of multipurpose linguistic tools. It provides the following capabilities: * Sense disambiguation * Detailed, sentence by sentence domain extraction * Deep morphological analysis and synthesis * Automatic linguistic profiling * Idiom extraction * Universal measure conversion * Transliteration between scripts * Machine readability evaluation of texts * Automatic translation between languages The most distinctive feature of Carabao is its complete abstraction from the linguistic point of view. All the linguistic logic resides in a database complete with a powerful GUI data editor. By removing the linguistic logic from the source code, a few goals are achieved: * Separation of tasks between software developers and linguists * Faster and more reliable development of new linguistic engines which does not require participation of IT people * Ease of programmatic use and customization |
||||||||||||||||||||||
|
Changelog: Version: 1.2.0.0(07 Sep 2008) Version 1.2.0.0 FIXED * Unknown patterns were translated as hypernyms * Regression: certain category-based sequences were omitted on second execution because of a malfunctioning guess scan caching mechanism * In analytical mode (Carabao DeepAnalyzer), there was a mismatch between word index number and an idiom member index, in sentences with attached tokens such as 'em, 'm * When copying a token with 1 rule units or less, the text is always reset to the original ADDED * Capability to match numbers as patterns * When a translation is not found, the engine tries to fall back to a matching hypernym instead * New methods to Carabao DeepAnalyzer that enable accessing the members of the detected idioms * New methods to Carabao CDA that enable accessing the unknown heuristics table * New sequences * Russian morphological exceptions IMPROVED * If an "unknown pattern" is forced to match a known word, it will not create a new guess if a guess with a same hypernym already exists. For example, if you force to check, whether a known word can be a city, a new record will not be created, if there is already a guess with a known city * Automatic input language switching in locator fields * Locator fields are pre-filled with the list of all existing languages in the database, eliminating the need to jump to the next language Version: 1.1.0.1(12 Mar 2008) Version 1.1.0.1 FIXED * Crash when using sequence extraction option (regression from 1.1.0.0) ADDED * Capability to import sequences by data entry directly from the Sequence Sheet * Capability to manually set sequence descriptions * Some sequences IMPROVED * Processing speed and memory consumption - further boost * Token GUI Version 1.1.0.0 FIXED * Volatility of newly assigned rule units in late sequences * Inconsistencies in the generation of inflected forms in design time ADDED * All (or nearly all) the Russian morphological exceptions - over a 1,000 of new prefixes * Friendly GUI of meta-rules such as lemmatized forms and generation of inflected forms * MorphoLogic now inspects the design time data generation meta-rules when generating inflected forms IMPROVED * Processing speed and memory consumption * Increased maximum length of the meta-rule content field * Increased some fields to accommodate large sequences and a lot of grammatical data Version 1.0.0.3 FIXED * Various tagging problems * A bug with mid-sentence sequences priority setting ADDED * A button to tag new entries morphologically * A handful of commonly used business entities (e.g., address, phone, fax, business hours) IMPROVED * Accuracy of sequences * Domains Version 1.0.0.2 FIXED * Inflection generation problems of TagLemma results (words not in the dictionary) in Carabao MorphoLogic ADDED * Capability to inspect other guesses. For example, in a sequence like "adverb" + "adverb", it is possible to quickly scrap the entire sequenec if the second adverb can be a preposition * Comprehensive morphology of Russian language IMPROVED * Removed description of negative constraint elements (those that do not have an identity) in sequence in order to make the descriptions less cluttered * Performance of sequence processing * Accuracy of sequences * Domains reviewed Version 1.0.0.1 FIXED * Various validation problems with attached tokens * Lookup windows are no longer maximized on opening * Incorrect tooltips after deletion in the dictionary table ADDED * GUI support for negative constraints in sequences * Handling of irregular 'smart quotes' in Translation Console * Manual disambiguation table in Carabao Linguist Edition * Style tags to the tooltips in the dictionary table IMPROVED * Supplied sequences * In the translation console, the original thesaurus article is suppressed when the word is part of an idiom - to prevent confusion Version 1.0.0.0 Carabao Language Kit has been released Version: 1.0.0.1 Tags: Machine Translation | Nlp | Morphological Analysis | Dictionary | Thesaurus |
||||||||||||||||||||||
|
|
||||||||||||||||||||||