Tool template
Jump to navigation
Jump to search
Contents
Synopsis
XAIRA : XML Aware Indexing and Retrieval Architecture
Features
- operates on large collections (1 to 1000 million tokens) of XML documents
- provides full range of linguistic analysis tools and methods
- platform independent indexer and server with documented API
- full-featured Microsoft windows client
- combines the structured search cabilities of e.g. exist or a rdbms with the free text capabilities of eg Lucene or a traditional concordancer
System requirements
Clients need a reasonable amount of memory (e.g. 640 Mb); indexer performs faster with more memory, but is primarily disk-bound. A corpus of x Mb will need 2x Mb of index, and 4x scratch space during indexing process.
Source code and licensing
GPL.
No fees.
Support for TEI
Supports TEI documents out of the box.
Language(s)
Written in C++, API is documented with dOxygen.
User manuals etc. in TEI (but still under development)
Tech support
Best endeavours... there is a mailing list on SourceForge
User community
Yes
Sample implementations
Current version number and date of release
1.23 Feb 2007
History of versions
(type in that information here)
How to download or buy
Visit http://xaira.sf.net