Repository logoCyprus University of Technology
Log In(current)
Ελληνικά
English
  1. Home
  2. Cyprus University of Technology (Research Output)
  3. Πτυχιακές Εργασίες/ Bachelor's Degree Theses
  4. An Evaluation of Automated Fact-Checking using Grounding Large Language Models with Knowledge Graphs
  • Details

An Evaluation of Automated Fact-Checking using Grounding Large Language Models with Knowledge Graphs

Date Issued
May 2025
Author(s)
Gypsiotis, Sotiris  
Advisor
Sirivianos, Michael  
Abstract
Automated fact-checking has become an essential tool in combating the widespread spread of misinformation
in the digital age. This thesis explores the integration of Large Language Models with Knowledge
Graphs to enhance fact-checking systems, for both structured and unstructured data, in order to
improve accuracy and scalability. LLMs, particularly Transformer-based architectures such as GPT and
DeepSeek, have demonstrated remarkable capabilities in understanding and generating natural language.
However, their sensitivity to hallucinations and reliance on probabilistic reasoning present challenges
in ensuring factual accuracy. In contrast, KGs provide structured, relational data that encodes factual
knowledge in a deterministic manner. By combining the contextual language understanding of LLMs
with the structured reasoning of KGs, automated fact-checking systems can achieve greater robustness
and reliability.
This thesis investigates the mechanisms of both LLMs and KGs, detailing their strengths and limitations
in the fact-checking pipeline. The research first provides an overview of AI, ML, and DL, focusing on
their applications in misinformation detection. It then examines the construction and representation of
KGs, including methods for entity recognition, relationship extraction, and integration with language
models. Furthermore, we explore the role of LLMs in claim parsing, evidence retrieval, and truthfulness
assessment. The integration of these technologies introduces challenges such as knowledge alignment,
consistency, computational efficiency, and understandability.
To evaluate the effectiveness of this integration, this work implements an automated fact-checking pipeline
using the Ollama framework and three DeepSeek R1 models: 1.5B, 7B, and 8B. A dataset of 200 factual
statements was sourced from the Wikidata KG and used to test the models’ ability to extract, validate,
and compare facts. Evaluation was conducted using classification metrics (Precision, Recall, F1-score)
and lexical overlap metrics (ROUGE score). The 7B model achieved the highest F1-score (0.58), outperforming
both the smaller 1.5B model and the larger 8B model. Notably, the 8B model showed signs
of overfitting and increased hallucination, despite its larger parameter count. These results suggest that
model size does not linearly correlate with performance, and that mid-sized models, like the 7B, may
provide better generalization and factual alignment in structured verification tasks.
The findings of this thesis contribute to the field of AI-driven misinformation detection, offering insights
into how the cooperation between LLMs and KGs can lead to more accurate, transparent, and scalable
fact-checking solutions. This thesis concludes by discussing future directions, including the potential
for real-time claim verification and ethical considerations in AI-driven fact-checking. By advancing the
integration of structured and unstructured data sources, this work lays the groundwork for more reliable
automated verification systems, ultimately for a more trustworthy information system.
Subjects

Automated Fact-Checki...

Large Language Models...

Misinformation

Knowledge Graph (KG)

File(s)
Thumbnail Image
Name

Gypsiotis_2025-BSC_abstract.pdf

Size

138.84 KB

Format

Adobe PDF

Checksum (MD5)

1c0ed10ab2243a5c9b3d092c954747a8

Explore by
  • Collections
  • Research Outputs
  • Researchers
  • Faculty & Departments
  • Theses
  • Patents
  • Projects
  • Journals
  • Conferences
Useful Links
  • Researcher Portfolio Guide
  • Researcher Profile
  • Create an ORCID ID
  • CUT Open Access Author Fund
  • ETDS Guide
Copyright Policies

Use Sherpa/Romeo to find publisher copyright policies

Go
Go
  • SPARC Author Addendum Engine
  • National Open Access Policy in Cyprus
Deposit your work to Ktisis
  • Self-archiving. Please sign in to Ktisis.
  • Email your work to:
    library.dspace@cut.ac.cy
  • Contact your subject librarian

Member of

OpenAIREre3dataOpenDOARCOREDART
Cyprus University of Technology
Library and
Information
Services

Copyright © 2022 - Library and Information Services Feedback - Built with DSpace-CRIS - 4Science

  • Accessibility settings
  • Privacy policy
  • End User Agreement
COAR NotifyCOAR Notify