Knowledge Graph Fusion for Language Model Fine-Tuning

Nimesh Bhana, Terence L. Van Zyl

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Language Models such as BERT (Bidirectional Encoder Representations from Transformers) have grown in popularity due to their ability to be pre-trained and perform robustly on a wide range of Natural Language Processing tasks. Often seen as an evolution over traditional word embedding techniques, they can produce semantic representations of text, useful for tasks such as semantic similarity. However, state-of-the-art models often have high computational requirements and lack global context or domain knowledge which is required for complete language understanding. To address these limitations, we investigate the benefits of knowledge incorporation into the fine-tuning stages of BERT. An existing K-BERT model, which enriches sentences with triplets from a Knowledge Graph, is adapted for the English language and extended to inject contextually relevant information into sentences. As a side-effect, changes made to K-BERT for accommodating the English language also extend to other word-based languages. Experiments conducted indicate that injected knowledge introduces noise. We see statistically significant improvements for knowledge-driven tasks when this noise is minimised. We show evidence that, given the appropriate task, modest injection with relevant, high-quality knowledge is most performant.

Original languageEnglish
Title of host publication2022 9th International Conference on Soft Computing and Machine Intelligence, ISCMI 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages167-172
Number of pages6
ISBN (Electronic)9798350320886
DOIs
Publication statusPublished - 2022
Event9th International Conference on Soft Computing and Machine Intelligence, ISCMI 2022 - Toronto, Canada
Duration: 26 Nov 202227 Nov 2022

Publication series

Name2022 9th International Conference on Soft Computing and Machine Intelligence, ISCMI 2022

Conference

Conference9th International Conference on Soft Computing and Machine Intelligence, ISCMI 2022
Country/TerritoryCanada
CityToronto
Period26/11/2227/11/22

Keywords

  • BERT
  • Knowledge Graph
  • Language Model

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computational Mathematics
  • Control and Optimization
  • Numerical Analysis

Fingerprint

Dive into the research topics of 'Knowledge Graph Fusion for Language Model Fine-Tuning'. Together they form a unique fingerprint.

Cite this