Print

Abstract of Sebastian Hellmann's talk

Challenges, Approaches and Future Work for Linguistic Linked Open Data (LLOD)
While the Linguistic Linked Open Data (LLOD) Cloud (http://linguistic-lod.org/) has evolved beyond expectations - thanks to the effort of a vibrant community - overall progress has to be seen under a more scrutinizing light.
Initial challenges which have been formulated by Christian Chiarcos, Sebastian Nordhoff and me as early as 2011[1][2] have been discussed extensively in the LDL, MLODE and NLP & DBpedia workshop series and in several W3C community groups. In particular, the LIDER FP7 project (http://www.lider-project.eu/) - originally conceived to tackle these challenges and build a Linguistic Linked Open Data Cloud - rather gave them more shape and uncovered that there is yet quite a long road ahead to solve problems such as proper metadata, contextualisation of knowledge, data quality, hosting, open licensing and provenance, timely updated network links, knowledge integration and interoperability on the largest possible scale - the Web.
The invited talk attempts to give a full account of these abovementioned challenges and presents and critically evaluates pertinent efforts and approaches including evolving standards such as the NLP Interchange Format (NIF)[3][4], DataID[5], SHACL[6], lemon[7] and the LIDER guidelines[8] as well as practical services such as LingHub[9], LODVader[10], RDFUnit[11] (just to mention a few).
As a glimmer of hope, the talk will conclude with the recent efforts of the DBpedia community to coordinate the creation of a public data infrastructure for a large, multilingual, semantic knowledge graph, which is, of course, not a panacean golden hammer, but a potential step in the right direction to bridge the gap between language and knowledge.
________________
[1] Towards a Linguistic Linked Open Data cloud : The Open Linguistics Working Group (http://www.atala.org/IMG/pdf/Chiarcos-TAL52-3.pdf ) Christian Chiarcos, Sebastian Hellmann, and Sebastian Nordhoff. TAL 52(3):245 - 275 (2011)
[2] Linked Data in Linguistics. Representing Language Data and Metadata (http://www.springer.com/computer/ai/book/978-3-642-28248-5 ) Christian Chiarcos, Sebastian Nordhoff, and Sebastian Hellmann (Eds.). Springer, Heidelberg, (2012)
[3] http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core
[4] https://www.w3.org/community/ld4lt/
[5] http://wiki.dbpedia.org/projects/dbpedia-dataid
[6] http://w3c.github.io/data-shapes/shacl/
[7] https://www.w3.org/2016/05/ontolex/
[8] http://www.lider-project.eu/guidelines
[9] http://linghub.lider-project.eu/
[10] http://lodvader.aksw.org/
[11] http://aksw.org/Projects/RDFUnit


Last updated by: Hanne Erdman Thomsen 02/06/2016