Difference between revisions of "Wiki2TEI"

From TEIWiki
Jump to navigation Jump to search
m (Links: whoops, + categories)
Line 1: Line 1:
<!-- Wikimedia wiki (which version?), or is there some standard for wiki syntax? -->
+
[[Category:Tools]]
'''Wiki2TEI''' is a tool that converts wiki pages to TEI format. It is written in PHP and (will be) released under the BSD License.
+
[[Category:Conversion and preprocessing tools]]
 +
 
 +
== Synopsis ==
 +
 
 +
The mediawiki format is used by wikimedia fundation wikis (Wikipedia, Wikibooks, Wikisource), and many other wikis using the mediawiki software. Large amounts of free hight-quality structured texts are available in this format. These texts are used more and more often in NLP (natural language processing) projects. However, the mediawiki parser is oriented towards rendition and the mediawiki syntax is complex and hard to parse.
 +
 +
The Wiki2Tei converter makes available the information contained in wiki syntax (structuration, highlighting, etc.), and allows to properly retrieve the plain text. This conversion is intended to preserve all the properties of the original text. Wiki2Tei is closely coupled with the mediawiki software, allowing to convert all the features of the mediawiki syntax.
 +
 
 +
== Features ==
 +
* Tools for converting from mediawiki database or from collection of files
 +
* Tools for checking well formedness and validation
 +
* documentation of the vocabulary used in an ODD document
 +
* work with mediawiki 1.5 software;
 +
 
 +
== System requirements ==
 +
 
 +
You need to set up a mediawiki software, that is:
 +
- a mysql server
 +
- a php 5 interpreter.
 +
- some third parties tools are needed for specific task (openjade, xsltproc)
 +
 
 +
== Source code and licensing ==
 +
 
 +
This software is released under a BSD licence.
 +
 
 +
== Support for TEI ==
 +
 
 +
 
 +
== Language(s) ==
 +
 
 +
This converter use PHP 5 language only.
 +
 
 +
== Documentation ==
 +
 
 +
Documentation is available online:
 +
 
 +
    http://wiki2tei.sourceforge.net/
 +
    http://wiki2tei.sourceforge.net/Wiki2TeiHelp.html
 +
 
 +
== Tech support ==
  
Currently (October '07) in the beta stage, no files released.
+
A technical support is provided through the following mailing list:
  
==== Links ====
+
    https://lists.sourceforge.net/lists/listinfo/wiki2tei-users
* Project home page: http://sourceforge.net/projects/wiki2tei/
 
* [http://wiki2tei.sourceforge.net/demo/ Online demo]
 
  
[[Category: Tools]]
+
== User community ==
[[Category:Conversion and preprocessing tools]]
+
 
{{stub}}
+
The community tools of the SourceForge web site may be used.
 +
 
 +
== Sample implementations ==
 +
 
 +
    http://wiki2tei.sourceforge.net/demo/
 +
 
 +
== Current version number and date of release ==
 +
 
 +
Version 1.0, release 10-10-2007.
 +
 
 +
== History of versions ==
 +
 
 +
 
 +
== How to download or buy ==
 +
 
 +
    http://sourceforge.net/project/showfiles.php?group_id=198407

Revision as of 13:54, 11 October 2007


Synopsis

The mediawiki format is used by wikimedia fundation wikis (Wikipedia, Wikibooks, Wikisource), and many other wikis using the mediawiki software. Large amounts of free hight-quality structured texts are available in this format. These texts are used more and more often in NLP (natural language processing) projects. However, the mediawiki parser is oriented towards rendition and the mediawiki syntax is complex and hard to parse.

The Wiki2Tei converter makes available the information contained in wiki syntax (structuration, highlighting, etc.), and allows to properly retrieve the plain text. This conversion is intended to preserve all the properties of the original text. Wiki2Tei is closely coupled with the mediawiki software, allowing to convert all the features of the mediawiki syntax.

Features

  • Tools for converting from mediawiki database or from collection of files
  • Tools for checking well formedness and validation
  • documentation of the vocabulary used in an ODD document
  • work with mediawiki 1.5 software;

System requirements

You need to set up a mediawiki software, that is: - a mysql server - a php 5 interpreter. - some third parties tools are needed for specific task (openjade, xsltproc)

Source code and licensing

This software is released under a BSD licence.

Support for TEI

Language(s)

This converter use PHP 5 language only.

Documentation

Documentation is available online:

   http://wiki2tei.sourceforge.net/
   http://wiki2tei.sourceforge.net/Wiki2TeiHelp.html

Tech support

A technical support is provided through the following mailing list:

   https://lists.sourceforge.net/lists/listinfo/wiki2tei-users

User community

The community tools of the SourceForge web site may be used.

Sample implementations

   http://wiki2tei.sourceforge.net/demo/

Current version number and date of release

Version 1.0, release 10-10-2007.

History of versions

How to download or buy

   http://sourceforge.net/project/showfiles.php?group_id=198407