Cross-Language Knowledge Induction Workshop
International Workshop
held as part of the
EuroLAN 2005 Summer
School
August 2-4, 2005
Editors: Diana Inkpen
and Carlo Strapparava
Papers
Automatic Translation of WordNet Glosses
Jesus Gimenez, Lluís Màrquez, and German Rigau
PP Attachment for Basque Based on English parses
Eneko Agirre, Aitziber Atutxa, Koldo
Gojenola, Kepa Sarasola, and David Terrón
A Text Processing Tool for the Romanian Language
Oana Frunza, Diana Inkpen, and David Nadeau
To Cause Or Not To Cause: Cross-Lingual Semantic Matching for Paraphrase Modelling
Sebastian Padó and Katrin Erk
WSD Using English-Spanish Aligned Phrases over Comparable Corpora
David Fernández-Amorós
4LEX: a
Multilingual Lexical Resource
Cross-Language Transfer of Syntactic Relations Using Parallel Corpora
Verginica Barbu Mititelu and Radu Ion
Literality Based Sample Sorting for Syntax Projection
Bruno Cavestro and Nicola Cancedda
Knowledge of words and
text behavior in other languages has recently been used to help solving tasks
in a first language. An example of such a task is word sense disambiguation by
using translations in a second language. Another example is verb classification
by studying properties of verbs in several languages.
A second modality of
knowledge transfer across languages is to take advantage of resources already
built for English and for a few other resource-rich languages. These resources
have been used to induce knowledge in languages for which few linguistic
resources are available. This was made possible by the wider availability of
parallel corpora (better alignment methods at paragraph, sentence, and word
level). Examples of knowledge induction tasks are: learning morphology,
part-of-speech tags, and grammatical gender. The development of wordnets for
many languages used as a starting point knowledge transfer from the Princeton
WordNet.
This workshop provides a
forum for discussion between leading names in the field and researchers
involved in cross-language applications. Topics of interest include, but are
not limited to: applications that exploit parallel corpora (aligned at paragraph,
sentence, or word level); induction of knowledge from a language for which
resources are abundant to another language for which fewer resources are
available; using other languages to solve a task in a first language (word-sense
disambiguation by using translations in other languages, verb classification by
studying verb properties in several languages, and other tasks of this kind); identifying
and using cognate words between languages; building wordnets by knowledge
transfer; and exploiting multi-language wordnets for NLP applications.
We would like to thank
all the authors who submitted papers for the hard work that went behind their
submissions. We express our deepest gratitude to the committee members for
their thorough reviews. We also thank the EuroLAN 2005 organizers for their
help with administrative matters.
Diana Inkpen
Carlo Strapparava
Program Committee
Eneko
Agirre (University of the Basque Country, Donostia-San Sebastián,
Paul
Buitelaar (DFKI,
Silviu
Cucerzan (Microsoft
Mona Diab (Columbia University, US)
Lluís Màrquez (Universitat
Politècnica de Catalunya, Barcelona, Spain)
Joel
Martin (National Research Council of
Rada
Mihalcea (
Viviana
Nastase (
Ted
Pedersen (
Emanuele Pianta (ITC-IRST, Povo-Trento, Italy)
Philip
Resnik (
Laurent
Romary (LORIA,
Michel
Simard (Xerox Research Centre Europe,
Suzanne
Stevenson (
Doina
Tătar (
Amalia
Todiraşcu (Université Marc Bloch,
Dan
Tufiş (
Nikolai
Vazov (
Organizing committee
Diana
Inkpen (
Carlo Strapparava (ITC-IRST,
Povo-Trento, Italy)