NLGbAse is an information extraction system based on a structured information built from Wikipedia and wiki syntax. NLGbAse contains more than 2,7 millions multilingual entities. Those entities contains statistical and semantic informations. Those informations are exploited by information retrieval algorithms, virtually capable of unlimited facts extraction associated to each entity. The final objective of the project is to build a robust Natural Langage Generation system.
When ?
NLGbAse is in developpement since August 2008. First online version was launched on september 2008.
Who work on it?
eric charton : main author and conceptor of the project and its code
Ludovic Bonnefoy : Master Student, work on Q&A prototype (see here)
Romain Devaud : Master Student, work on Q&A prototype (see here)
Raphaël Rubino : Phd student, work on information extraction for medical translation applications
... And you ?
Who help us?
Scientific advisor and referee
Pr Juan Manuel Torres Moreno - Laboratoire Informatique d'Avignon University of Avignon
Infrastructure and organisation
Many thanks for
hosting and infrastructure help and specially to Dr
G.Linarès
Thanks for their contributions to students and members of following
organisations and institutions:
![]() |
![]() |
(c) Information :
Until to now, all NLGbAse parts (including tools and database) are free to use and redistribute, unless you keep (c) informations and cite the original provider (this site).
NLGbAse is built with mathematical algorythm's from xml files provided by Wikipedia the Free Encyclopedia.
There is no parts (texts, sentences, images) into generated redistribuable and downloadable NLGbAse content and tools concerned by the (c) policy of Wikipedia Fundation.
NLGbAse is (c) copyrighted by E. Charton, 2008, 2009.
Cache files that might be displayed on this site are extracted from the xml files provided by Wikipedia the Free Encyclopedia. Those texts are copyrighted and redistribuables under the following conditions.
The text contained in Wikipedia is copyrighted (automatically, under the Berne Convention) by Wikipedia contributors and licensed to the public under the GNU Free Documentation License (GFDL). The full text of this license is at Wikipedia:Text of the GNU Free Documentation License.
Website publisher:
This web site is hosted by courtesy in Laboratoire Informatique d'Avignon, from University of Avignon. Publisher of the site is Mr Eric Charton as individual. Applicable law for publication responsability is the french law.

