Tool template

Synopsis
XAIRA : XML Aware Indexing and Retrieval Architecture

Features

 * operates on large collections (1 to 1000 million tokens) of XML documents
 * provides full range of linguistic analysis tools and methods
 * platform independent indexer and server with documented API
 * full-featured Microsoft windows client
 * combines the structured search cabilities of e.g. exist or a rdbms with the free text capabilities of eg Lucene or a traditional concordancer

System requirements
Clients need a reasonable amount of memory (e.g. 640 Mb); indexer performs faster with more memory, but is primarily disk-bound. A corpus of x Mb will need 2x Mb of index, and 4x scratch space during indexing process.

Source code and licensing
GPL.

No fees.

Support for TEI
Supports TEI documents out of the box.

Language(s)
Written in C++, API is documented with dOxygen.

User manuals etc. in TEI (but still under development)

Tech support
Best endeavours... there is a mailing list on SourceForge

User community
Yes

Sample implementations
http://www.natcorp.ox.ac.uk

Current version number and date of release
1.23 Feb 2007

History of versions
(type in that information here)

How to download or buy
Visit http://xaira.sf.net