✨ KG-TRACES: Unleashing Explainable Reasoning in LLMs with Knowledge Graphs ✨

Welcome to the official repository for KG-TRACES! 🚀 We're enhancing Large Language Models to reason with explainable, accuracy, and traceability by leveraging the power of Knowledge Graphs.

🎯 The Challenge: LLMs Lost in Thought?

Vanilla LLMs are amazing, but when it comes to complex, multi-hop reasoning, they can sometimes...

🤯 Hallucinate facts
❓ Provide answers without clear justification
🚧 Hit a wall in scenarios demanding trustworthy, step-by-step explanations

This limits their use in critical domains. That's where KG-TRACES steps in!

*Figure 1: KG-TRACES (d) stands out by generating faithful, attributable responses, adapting to different KG access conditions.*

💡 Our Solution: KG-TRACES

KG-TRACES is a novel framework that explicitly teaches LLMs how to reason by supervising their internal "thought process" with knowledge graphs guidance. We guide them to:

🗺️ Chart the Course: Predict symbolic knowledge graph reasoning paths from question to answer.
📝 Show Their Work: Generate attribution-aware reasoning explanations, clearly claim whether each step comes from the KG or the LLM's internal knowledge 🧠, and how effective it was!

*Figure 2: The KG-TRACES framework*

🌟 Why KG-TRACES Rocks

🔍 Crystal-Clear Explanations: Understand why the LLM reached its conclusion.
🛡️ Trustworthy & Attributable: Know the evidence source of each reasoning step.
💪 Robust Performance: Excels even with limited or no direct KG access during inference.
🌍 Versatile: Shows strong generalization to specialized fields like medicine.

💬 Chat cases:

🚀 Updates & News

[2025-06-04]: We opensource KG-TRACES codebase and the training dataset of KG-TRACES.
[2025-06-03]: arxiv KG-TRACES paper is live! Check it out on arXiv.

🛠️ Get Started with KG-TRACES

Ready to dive in? Here's how:

1. Prerequisites

Make sure you have:

Python 3.12+
PyTorch 2.60+
🤗 Transformers & Datasets
deepspeed 0.16+

2. Installation

Set up environment:

git clone https://github.com/Edaizi/KG-TRACES.git
cd KG-TRACES
conda create -n kg_traces python=3.12
pip install -r requirements.txt

📚 Datasets: The Fuel for KG-TRACES

We've meticulously prepared augmented SFT datasets for WebQSP and CWQ, packed with reasoning paths and augmented reasoning process with source attributions. Find them on Hugging Face:

Using the Datasets:

from datasets import load_dataset

webqsp_sft_data = load_dataset("Edaizi/KG-TRACES-WebQSP")
cwq_sft_data = load_dataset("Edaizi/KG-TRACES-CWQ")

print("Example WebQSP SFT instance:")
print(webqsp_sft_data['train'][0]) # Show an example

🚂 Training KG-TRACES Model

Just run scripts/train.sh easily:

bash scripts/train.sh

🧪 Inference

Just run scripts/predict.sh easily:

bash scripts/predict.sh

🎁 Pretrained Models: Ready to Use!

Don't want to train from scratch? Grab our fine-tuned KG-TRACES models from the Hugging Face Model Hub: KG-TRACES

from transformers import AutoModelForCausalLM, AutoTokenizer

model_hub_name = "Edaizi/KG-TRACES"
tokenizer = AutoTokenizer.from_pretrained(model_hub_name)
model = AutoModelForCausalLM.from_pretrained(model_hub_name)

📈 Results Highlights

📞 Contact

For any questions or feedback, please:

Open an issue in the GitHub repository
Reach out to us at [email protected]

📜 Citation

If KG-TRACES helps your research or project, we'd love a shout-out! Please cite:

@misc{wu2025kgtracesenhancinglargelanguage,
      title={KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision}, 
      author={Rong Wu and Pinlong Cai and Jianbiao Mei and Licheng Wen and Tao Hu and Xuemeng Yang and Daocheng Fu and Botian Shi},
      year={2025},
      eprint={2506.00783},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2506.00783}, 
}

🙏 Acknowledgements

We utilized the following repos during development:

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
config		config
prompts		prompts
scripts		scripts
src		src
tools		tools
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

✨ KG-TRACES: Unleashing Explainable Reasoning in LLMs with Knowledge Graphs ✨

🎯 The Challenge: LLMs Lost in Thought?

💡 Our Solution: KG-TRACES

🌟 Why KG-TRACES Rocks

💬 Chat cases:

🚀 Updates & News

🛠️ Get Started with KG-TRACES

1. Prerequisites

2. Installation

Set up environment:

📚 Datasets: The Fuel for KG-TRACES

🚂 Training KG-TRACES Model

🧪 Inference

🎁 Pretrained Models: Ready to Use!

📈 Results Highlights

📞 Contact

📜 Citation

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Languages

KnowledgeXLab/KG-TRACES

Folders and files

Latest commit

History

Repository files navigation

✨ KG-TRACES: Unleashing Explainable Reasoning in LLMs with Knowledge Graphs ✨

🎯 The Challenge: LLMs Lost in Thought?

💡 Our Solution: KG-TRACES

🌟 Why KG-TRACES Rocks

💬 Chat cases:

🚀 Updates & News

🛠️ Get Started with KG-TRACES

1. Prerequisites

2. Installation

Set up environment:

📚 Datasets: The Fuel for KG-TRACES

🚂 Training KG-TRACES Model

🧪 Inference

🎁 Pretrained Models: Ready to Use!

📈 Results Highlights

📞 Contact

📜 Citation

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages