download25.com

Your free TOP 25 download source!

Carabao Language Kit 1.2.0.0

A customizable language construction framework
Publisher: Digital Sonata Pty Ltd
Category: Language
Version: 1.2.0.0
License: freeware
Cost: 0$
Size: carabaoFree.exe 93.92 MB
Updated: 04 Oct 2008
Screenshot: view screenshot
Bookmark: Click here
award
Review
Carabao is a family of multipurpose linguistic tools. It provides the following capabilities:

* Sense disambiguation
* Detailed, sentence by sentence domain extraction
* Deep morphological analysis and synthesis
* Automatic linguistic profiling
* Idiom extraction
* Universal measure conversion
* Transliteration between scripts
* Machine readability evaluation of texts
* Automatic translation between languages

The most distinctive feature of Carabao is its complete abstraction from the linguistic point of view. All the linguistic logic resides in a database complete with a powerful GUI data editor. By removing the linguistic logic from the source code, a few goals are achieved:

* Separation of tasks between software developers and linguists
* Faster and more reliable development of new linguistic engines which does not require participation of IT people
* Ease of programmatic use and customization

Changelog:
Version: 1.2.0.0(07 Sep 2008)
Version 1.2.0.0
FIXED
* Unknown patterns were translated as hypernyms
* Regression: certain category-based sequences were omitted on second execution because of a malfunctioning guess scan caching mechanism
* In analytical mode (Carabao DeepAnalyzer), there was a mismatch between word index number and an idiom member index,
in sentences with attached tokens such as 'em, 'm
* When copying a token with 1 rule units or less, the text is always reset to the original

ADDED
* Capability to match numbers as patterns
* When a translation is not found, the engine tries to fall back to a matching hypernym instead
* New methods to Carabao DeepAnalyzer that enable accessing the members of the detected idioms
* New methods to Carabao CDA that enable accessing the unknown heuristics table
* New sequences
* Russian morphological exceptions

IMPROVED
* If an "unknown pattern" is forced to match a known word, it will not create a new guess if a guess with a same hypernym already exists.
For example, if you force to check, whether a known word can be a city, a new record will not be created, if there is already a guess with a known city
* Automatic input language switching in locator fields
* Locator fields are pre-filled with the list of all existing languages in the database, eliminating the need to jump to the next language
Version: 1.1.0.1(12 Mar 2008)
Version 1.1.0.1
FIXED
* Crash when using sequence extraction option (regression from 1.1.0.0)

ADDED
* Capability to import sequences by data entry directly from the Sequence Sheet
* Capability to manually set sequence descriptions
* Some sequences

IMPROVED
* Processing speed and memory consumption - further boost
* Token GUI

Version 1.1.0.0
FIXED
* Volatility of newly assigned rule units in late sequences
* Inconsistencies in the generation of inflected forms in design time

ADDED
* All (or nearly all) the Russian morphological exceptions - over a 1,000 of new prefixes
* Friendly GUI of meta-rules such as lemmatized forms and generation of inflected forms
* MorphoLogic now inspects the design time data generation meta-rules when generating inflected forms

IMPROVED
* Processing speed and memory consumption
* Increased maximum length of the meta-rule content field
* Increased some fields to accommodate large sequences and a lot of grammatical data

Version 1.0.0.3
FIXED
* Various tagging problems
* A bug with mid-sentence sequences priority setting

ADDED
* A button to tag new entries morphologically
* A handful of commonly used business entities (e.g., address, phone, fax, business hours)

IMPROVED
* Accuracy of sequences
* Domains

Version 1.0.0.2
FIXED
* Inflection generation problems of TagLemma results (words not in the dictionary) in Carabao MorphoLogic

ADDED
* Capability to inspect other guesses. For example, in a sequence like "adverb" + "adverb", it is possible to quickly scrap the entire sequenec if the second adverb can be a preposition
* Comprehensive morphology of Russian language

IMPROVED
* Removed description of negative constraint elements (those that do not have an identity) in sequence in order to make the descriptions less cluttered
* Performance of sequence processing
* Accuracy of sequences
* Domains reviewed

Version 1.0.0.1
FIXED
* Various validation problems with attached tokens
* Lookup windows are no longer maximized on opening
* Incorrect tooltips after deletion in the dictionary table

ADDED
* GUI support for negative constraints in sequences
* Handling of irregular 'smart quotes' in Translation Console
* Manual disambiguation table in Carabao Linguist Edition
* Style tags to the tooltips in the dictionary table

IMPROVED
* Supplied sequences
* In the translation console, the original thesaurus article is suppressed when the word is part of an idiom - to prevent confusion

Version 1.0.0.0
Carabao Language Kit has been released
Version: 1.0.0.1
Tags: Machine Translation | Nlp | Morphological Analysis | Dictionary | Thesaurus

download (carabaoFree.exe - 93.92 MB)