Embedding Knowledge Graphs with RDF2vec

Embedding Knowledge Graphs with RDF2vec PDF

Author: Heiko Paulheim

Publisher: Springer Nature

Published: 2023-06-03

Total Pages: 165

ISBN-13: 3031303873

DOWNLOAD EBOOK →

This book explains the ideas behind one of the most well-known methods for knowledge graph embedding of transformations to compute vector representations from a graph, known as RDF2vec. The authors describe its usage in practice, from reusing pre-trained knowledge graph embeddings to training tailored vectors for a knowledge graph at hand. They also demonstrate different extensions of RDF2vec and how they affect not only the downstream performance, but also the expressivity of the resulting vector representation, and analyze the resulting vector spaces and the semantic properties they encode.

Knowledge Graphs

Knowledge Graphs PDF

Author: Aidan Hogan

Publisher: Morgan & Claypool Publishers

Published: 2021-11-08

Total Pages: 257

ISBN-13: 1636392369

DOWNLOAD EBOOK →

This book provides a comprehensive and accessible introduction to knowledge graphs, which have recently garnered notable attention from both industry and academia. Knowledge graphs are founded on the principle of applying a graph-based abstraction to data, and are now broadly deployed in scenarios that require integrating and extracting value from multiple, diverse sources of data at large scale. The book defines knowledge graphs and provides a high-level overview of how they are used. It presents and contrasts popular graph models that are commonly used to represent data as graphs, and the languages by which they can be queried before describing how the resulting data graph can be enhanced with notions of schema, identity, and context. The book discusses how ontologies and rules can be used to encode knowledge as well as how inductive techniques—based on statistics, graph analytics, machine learning, etc.—can be used to encode and extract knowledge. It covers techniques for the creation, enrichment, assessment, and refinement of knowledge graphs and surveys recent open and enterprise knowledge graphs and the industries or applications within which they have been most widely adopted. The book closes by discussing the current limitations and future directions along which knowledge graphs are likely to evolve. This book is aimed at students, researchers, and practitioners who wish to learn more about knowledge graphs and how they facilitate extracting value from diverse data at large scale. To make the book accessible for newcomers, running examples and graphical notation are used throughout. Formal definitions and extensive references are also provided for those who opt to delve more deeply into specific topics.

The Semantic Web – ISWC 2016

The Semantic Web – ISWC 2016 PDF

Author: Paul Groth

Publisher: Springer

Published: 2016-10-05

Total Pages: 698

ISBN-13: 3319465236

DOWNLOAD EBOOK →

The two-volume set LNCS 9981 and 9982 constitutes the refereed proceedings of the 15th International Semantic Web Conference, ISWC 2016, which was held in Kobe, Japan, in October 2016. The 75 full papers presented in these proceedings were carefully reviewed and selected from 326 submissions. The International Semantic Web Conference is the premier forum for Semantic Web research, where cutting edge scientific results and technological innovations are presented, where problems and solutions are discussed, and where the future of this vision is being developed. It brings together specialists in fields such as artificial intelligence, databases, social networks, distributed computing, Web engineering, information systems, human-computer interaction, natural language processing, and the social sciences. The Research Track solicited novel and significant research contributions addressing theoretical, analytical, empirical, and practical aspects of the Semantic Web. The Applications Track solicited submissions exploring the benefits and challenges of applying semantic technologies in concrete, practical applications, in contexts ranging from industry to government and science. The newly introduced Resources Track sought submissions providing a concise and clear description of a resource and its (expected) usage. Traditional resources include ontologies, vocabularies, datasets, benchmarks and replication studies, services and software. Besides more established types of resources, the track solicited submissions of new types of resources such as ontology design patterns, crowdsourcing task designs, workflows, methodologies, and protocols and measures.

Entity Matching and Disambiguation Across Multiple Knowledge Graphs

Entity Matching and Disambiguation Across Multiple Knowledge Graphs PDF

Author: Michael Farag

Publisher:

Published: 2019

Total Pages:

ISBN-13:

DOWNLOAD EBOOK →

Knowledge graphs are considered an important representation that lie between free text on one hand and fully-structured relational data on the other. Knowledge graphs are a back-bone of many applications on the Web. With the rise of many large-scale open-domain knowledge graphs like Freebase, DBpedia, and Yago, various applications including document retrieval, question answering, and data integration have been relying on them. In this thesis, We are primarily interested in knowledge graphs from the perspective of integrating disparate heterogeneous sources, with an eye towards applications such as document retrieval and question answering. Integrating different knowledge graphs is very important for enriching the knowledge shared among them. The core part of this integration process is matching entities across the knowledge graphs. The biggest challenge to entity matching is the ambiguity. The obvious solution is to make use of the graph structure and entity neighbourhoods for matching and disambiguating entities. We formalize the entity matching problem and present the rst large-scale dataset, Ambiguous DBpedia-Wikidata, for this task based on exiting cross-ontology links between DBpedia and Wikidata, focused on several hundred thousand ambiguous entities. We propose an entity matching framework that is capable of disambiguating entities across different knowledge graphs. The framework consists of fuzzy string matcher and graph embedding-based matcher. Using a classifi cation-based approach, we find that a simple multi-layered perceptron based on representations derived from RDF2VEC graph embeddings of entities in each knowledge graph is sufficient to achieve high accuracy, with only limited training data. The contribution of our work is both a large dataset for examining this problem and strong baselines on which future work can be based. We also present SimpleDBpediaQA, a new benchmark dataset for simple question answering over knowledge graphs that was created by mapping SimpleQuestions entities and predicates from Freebase to DBpedia. We show how entity matching using manual annotations can be used for migrating datasets across knowledge graphs. Although this mapping is conceptually straightforward, there are a number of nuances that make the task non-trivial, owing to the different conceptual organizations of the two knowledge graphs. Finally, if manual annotations are scarce, we show how our entity matching framework can be used to generate free annotations to train our model and then use it for disambiguation. In that essence, we introduce SimpleQuestions++, a new question answering benchmark that have all questions linked to Freebase, DBpedia, and Wikidata.

Reproducing and Explaining Entity and Relation Embeddings for Link Prediction in Knowledge Graphs

Reproducing and Explaining Entity and Relation Embeddings for Link Prediction in Knowledge Graphs PDF

Author: Narayanan Asuri Krishnan

Publisher:

Published: 2021

Total Pages: 61

ISBN-13:

DOWNLOAD EBOOK →

"Embedding knowledge graphs is a common method used to encode information from the graph at hand projected in a low dimensional space. There are two shortcomings in the field of knowledge graph embeddings for link prediction. The first shortcoming is that, as far as we know, current software libraries to compute knowledge graph embeddings differ from the original papers proposing these embeddings. Certain implementations are faithful to the original papers, while others range from minute differences to significant variations. Due to these implementation variations, it is difficult to compare the same algorithm from multiple libraries and also affects our ability to reproduce results. In this report, we describe a new framework, AugmentedKGE (aKGE), to embed knowledge graphs. The library features multiple knowledge graph embedding algorithms, a rank-based evaluator, and is developed completely using Python and PyTorch. The second shortcoming is that, during the evaluation process of link prediction, the goal is to rank based on scores a positive triple over a (typically large) number of negative triples. Accuracy metrics used in the evaluation of link prediction are aggregations of the ranks of the positive triples under evaluation and do not typically provide enough details as to why a number of negative triples are ranked higher than their positive counterparts. Providing explanations to these triples aids in understanding the results of the link predictions based on knowledge graph embeddings. Current approaches mainly focus on explaining embeddings rather than predictions and single predictions rather than all the link predictions made by the embeddings of a certain knowledge graph. In this report, we present an approach to explain all these predictions by providing two metrics that serve to quantify and compare the explainability of different embeddings. From the results of evaluating aKGE, we observe that the accuracy metrics are better than the accuracy metrics obtained from the standard implementation of OpenKE. From the results of explainability, we observe that the horn rules obtained explain more than 50% of all the negative triples generated."--Abstract.

Exploiting Semantic Web Knowledge Graphs in Data Mining

Exploiting Semantic Web Knowledge Graphs in Data Mining PDF

Author: P. Ristoski

Publisher: IOS Press

Published: 2019-06-28

Total Pages: 246

ISBN-13: 1614999813

DOWNLOAD EBOOK →

Data Mining and Knowledge Discovery in Databases (KDD) is a research field concerned with deriving higher-level insights from data. The tasks performed in this field are knowledge intensive and can benefit from additional knowledge from various sources, so many approaches have been proposed that combine Semantic Web data with the data mining and knowledge discovery process. This book, Exploiting Semantic Web Knowledge Graphs in Data Mining, aims to show that Semantic Web knowledge graphs are useful for generating valuable data mining features that can be used in various data mining tasks. In Part I, Mining Semantic Web Knowledge Graphs, the author evaluates unsupervised feature generation strategies from types and relations in knowledge graphs used in different data mining tasks such as classification, regression, and outlier detection. Part II, Semantic Web Knowledge Graphs Embeddings, proposes an approach that circumvents the shortcomings introduced with the approaches in Part I, developing an approach that is able to embed complete Semantic Web knowledge graphs in a low dimensional feature space where each entity and relation in the knowledge graph is represented as a numerical vector. Finally, Part III, Applications of Semantic Web Knowledge Graphs, describes a list of applications that exploit Semantic Web knowledge graphs like classification and regression, showing that the approaches developed in Part I and Part II can be used in applications in various domains. The book will be of interest to all those working in the field of data mining and KDD.

Representation Learning Based Query Answering on Knowledge Graphs

Representation Learning Based Query Answering on Knowledge Graphs PDF

Author: Xuelu Chen

Publisher:

Published: 2021

Total Pages: 117

ISBN-13:

DOWNLOAD EBOOK →

Knowledge graphs provide structured representations of facts about real-world entities and relations, serving as a vital knowledge source for numerous artificial intelligence applications. This dissertation seeks to extend the scope and provide theoretical guidance for representation learning based query answering on knowledge graphs. The incompleteness of knowledge graphs has recently motivated the use of representation learning models in recent years to generalize from known facts and infer new knowledge for query answering. Despite advances in answering atomic queries by representing deterministic facts within a monolingual knowledge graph, existing models must overcome the following three challenges: (i) they must address the need to incorporate uncertainty information into query answering, which is critical to many knowledge-driven applications; (ii) they must effectively leverage complementary knowledge from knowledge graphs in different languages; (iii) they must be able to embed complex first-order logical queries.In this dissertation, we address the aforementioned challenges and extend the scope of query answering on knowledge graphs through contributions on the following three fronts: (i) To capture fact uncertainty and support reasoning under uncertainty, we propose two knowledge graph embedding models that are capable of encoding uncertain facts in the embedding space. Our proposed models thus learn entity and relation embeddings according to the confidence scores of uncertain facts. We introduce probabilistic soft logic to infer the confidence score to provide extra supervision for training. We also explore using box embeddings to embed uncertain knowledge graphs and imposing relation property constraints to enhance performance on sparse uncertain knowledge graphs. (ii) To effectively combine knowledge graphs in different languages, we introduce an ensemble learning framework that embeds all knowledge graphs in a shared embedding space, where the association of entities is captured based on self-learning. The framework performs ensemble inference to combine prediction results from embeddings of multiple language-specific knowledge graphs, for which multiple ensemble techniques are investigated. (iii) To support answering complex first-order logical queries, we present a query embedding framework based on fuzzy logic that allows us to define logical operators in a principled and learning-free manner, whereby learn- ing is only required for entity and relation embeddings. The proposed model can further benefit when complex logical queries are available for training. As a result of this research we were able to identify some of the desirable properties that embedding models ought to possess and analyze which of the existing models have these properties. Therefore, the results presented in this dissertation advance the state-of-the-art of query answering on knowledge graphs along different axes and provide conceptual guidance for future research in this field.

The Semantic Web: ESWC 2022 Satellite Events

The Semantic Web: ESWC 2022 Satellite Events PDF

Author: Paul Groth

Publisher: Springer Nature

Published: 2022-07-19

Total Pages: 332

ISBN-13: 3031116097

DOWNLOAD EBOOK →

This book constitutes the proceedings of the satellite events held at the 19th Extended Semantic Web Conference, ESWC 2022, during May—June in Hersonissos, Greece, 2022. The included satellite events are: the poster and demo session; the PhD symposium; industry track; project networking; workshops and tutorials. During ESWC 2022, the following ten workshops took place:10th Linked Data in Architecture and Construction Workshop (LDAC 2022); 5th International Workshop on Geospatial Linked Data (GeoLD 2022); 5th Workshop on Semantic Web solutions for large-scale biomedical data analytics (SeMWeBMeDA 2022); 7th Natural Language Interfaces for the Web of Data (NLIWOD+QALD 2022); International Workshop on Knowledge Graph Generation from Text (Text2KG 2022); 3rd International Workshop on Deep Learning meets Ontologies and Natural Language Processing (DeepOntoNLP 2022); 1st Workshop on Modular Knowledge (ModularK 2022); Third International Workshop On Knowledge Graph Construction (KGCW 2022); Third International Workshop On Semantic Digital Twins (SeDIT 2022); and the 1st International Workshop on Semantic Industrial Information Modelling (SemIIM 2022).

The Semantic Web: ESWC 2020 Satellite Events

The Semantic Web: ESWC 2020 Satellite Events PDF

Author: Andreas Harth

Publisher: Springer Nature

Published: 2020-11-10

Total Pages: 326

ISBN-13: 3030623270

DOWNLOAD EBOOK →

Chapter “ABECTO: An ABox Evaluation and Comparison Tool for Ontologies” is available open access under a Creative Commons Attribution 4.0 International License via link.springer.com.