Standards and best practices for the Multilingual Web

Workshop on Linked Data, Language Technologies and Multilingual Content Analytics

Co-Chairs: Jorge Gracia (Universidad Politécnica de Madrid), Felix Sasaki (DFKI / W3C Fellow)

Following the success of the first LIDER roadmapping workshop in Athens, the LIDER roadmapping workshop is the second public event of the W3C Linked Data for Language Technology Community Group. The workshop will be structured around a keynote and a series of panels with the common topic Linked Data, Language Technologies and Multilingual Content Analytics.

On afternoon 8 May, the focus will be on applied text analytics and the usefulness of Wikipedia and Dbpedia for multilingual text and content analytics.

On morning 9 May, the focus will be on language resources and aspects of converting selected types of language resources into RDF.

Who should attend:

  • The afternoon 8 May session is of targeting everybody interested in multilingual aspects of Wikipedia / DBpedia and content analytics applications. No deep technical knowledge is needed to follow or to contribute to the discussion. The session will be held in parallel with breakout sessions of the MultilingualWeb workshop.
  • The morning 9 May session is mainly targeting the language resources and linked data communities. Discussions will dive into both technical aspects and community building aspects of migrating language resources into RDF.


Session 1 (8th May afternoon, 13:30 - 17:45) "Content Analytics and Wikipedia"

  • 13:30 Welcome and introduction
  • 13:45 Keynote presentation by Seth Grimes: Text Analytics Applied
  • 14:15 Panel 1 on "Using Wikipedia for multilingual web content analytics across 287 languages". Participants:
    • Pau Giner (Moderator)
    • Amir Aharoni
    • Alolita Sharma
  • 15:00 coffee break
  • 15:30 Panel 2 on "Growing Wikipedia editing with intelligent multi-language suggestion lists for article translation as well as other techniques and tools". Participants:
    • Runa Bhattacharjee (Moderator)
    • Pau Giner
    • Santhosh Thottingal
  • 16:15 Requirements gathering about wikipedia/dbpedia-based content analytics
  • 17:00 Report to the main MultilingualWeb workshop audience

Session 2 (9th May morning, 9:00 - 12:45) "Linked Data and Language Technologies"

  • 9:00 - 9:10 Introduction and goals of the second day.
  • 9:10 - 9:45 Briefing of initiatives and community groups related to Linked Data and Language Technologies. Chair: Felix Sasaki. Speakers:
  • 9:45 - 10:45 Panel 3 on "Data and Metadata of Language Resources as Linked Data on the Web”. Chair: Jorge Gracia. Speakers:
    • Christian Chiarcos (Goethe-University Frankfurt am Main, Germany): statement
    • Philipp Cimiano (University of Bielefeld, Germany): statement
    • Roberto Navigli (Sapienza University of Rome, Italy): statement
    • Stelios Piperidis (ILSP-Athena RC, Greece): statement
    • Marta Villegas (Universitat Pompeu Fabra, Spain): statement
  • 10:45 - 11:10 Coffee break
  • 11:10 - 12:00 Panel 4 on "Multilingual Corpus transformation into Linked Data". Chair: Philipp Cimiano. Speakers:
    • Martin Brümmer (University of Leipzig, Germany): statement
    • Thierry Declerck (DFKI, Germany): statement
    • Roberto Navigli (Sapienza University of Rome, Italy): statement
    • Laurette Pretorius (University of South Africa): statement
    • Felix Sasaki (DFKI / W3C Fellow): statement
  • 12:00 - 12:30 Requirements gathering about language resources data/metadata transformation and use cases discussion
  • 12:30 - 12:45 Wrap-up

How Can I Participate and contribute?

Participation is free. Please register for the event. For the 9 May morning session we welcome position statements related to the topic of language resource RDF conversion. If you are interested in contributing a position statement please indicate this in the dedicated field in the registration form. The workshop organizers will come back to you with questions to answer in the position statement. We then will select statements and inform you by 28 April.

The event is supported by the LIDER EU project, the MultilingualWeb community, the NLP2RDF project, the Working Group for Open Data in Linguistics as well as the DBpedia Project.