Tool template

From TEIWiki
Revision as of 17:30, 2 February 2007 by Lou (talk | contribs)
Jump to navigation Jump to search


Synopsis

XAIRA : XML Aware Indexing and Retrieval Architecture

Features

  • operates on large collections (1 to 1000 million tokens) of XML documents
  • provides full range of linguistic analysis tools and methods
  • platform independent indexer and server with documented API
  • full-featured Microsoft windows client
  • combines the structured search cabilities of e.g. exist or a rdbms with the free text capabilities of eg Lucene or a traditional concordancer

System requirements

Clients need a reasonable amount of memory (e.g. 640 Mb); indexer performs faster with more memory, but is primarily disk-bound. A corpus of x Mb will need 2x Mb of index, and 4x scratch space during indexing process.

Source code and licensing

GPL.

No fees.

Support for TEI

Supports TEI documents out of the box.


Language(s)

Written in C++, API is documented with dOxygen.

User manuals etc. in TEI (but still under development)

Tech support

Best endeavours... there is a mailing list on SourceForge

User community

Yes

Sample implementations

http://www.natcorp.ox.ac.uk


Current version number and date of release

1.23 Feb 2007

History of versions

(type in that information here)

How to download or buy

Visit http://xaira.sf.net


Additional notes