Content on the web is predominantly written in English, making it inaccessible to those who only speak other languages. Knowledge graphs can store multilingual information, facilitate the creation of multilingual applications, and make content accessible to multiple language communities.
This book, Multilinguality in Knowledge Graphs, presents studies which assess and improve the state of labels and languages in knowledge graphs and the application of multilingual information. The author proposes ways of using multilingual knowledge graphs to reduce the gaps in coverage between languages, and the book explores the current state of language distribution in knowledge graphs by developing a framework based on existing standards, frameworks, and guidelines to measure label and language distribution in knowledge graphs. Applying this framework to a dataset representing the web of data, and to Wikidata, both a lack of labeling on the web and a bias towards a small set of languages were found. The book explores how a knowledge of labels and languages can be used in the domain of answering questions, and demonstrates how the framework can be applied to the task of ranking and selecting knowledge graphs for a set of user questions. Transliteration and translation of knowledge graph labels and aliases are also covered, as is the automatic classification of labels into one or the other to train a model for each task.
The book provides a wide range of information on working with data and knowledge graphs in less-resourced languages.