ycj/GRAG-main: 大论文子图检索、知识图谱增强大模型项目

大论文子图检索、知识图谱增强大模型项目

ycj 7933264233 1.滤波器添加		пре 1 месец
.idea	7933264233 1.滤波器添加	пре 1 месец
dataset	ab8b7aa483 1.项目初始化	пре 3 месеци
figs	ab8b7aa483 1.项目初始化	пре 3 месеци
src	7933264233 1.滤波器添加	пре 1 месец
LICENSE.txt	ab8b7aa483 1.项目初始化	пре 3 месеци
README.md	ab8b7aa483 1.项目初始化	пре 3 месеци
inference.py	ab8b7aa483 1.项目初始化	пре 3 месеци
requirements.txt	ab8b7aa483 1.项目初始化	пре 3 месеци
run.sh	7933264233 1.滤波器添加	пре 1 месец
train.py	7933264233 1.滤波器添加	пре 1 месец
train_local.py	7933264233 1.滤波器添加	пре 1 месец

GRAG: Graph Retrieval-Augmented Generation

Naive Retrieval-Augmented Generation (RAG) methods are not aware of the topological information of the retrieved documents / knowledge graphs, and using only text information cannot solve graph question answering scenarios:

We introduce GRAG, retrieving relevant subgraphs instead of just discrete entities. The generation of LLM is controlled by the query and the relevant text subgraph:

Try GRAG

1) Data Preprocessing

python -m src.dataset.preprocess.expla_graphs
python -m src.dataset.expla_graphs

Training

# GRAG with the frozen LLM
python train.py --dataset expla_graphs --model_name graph_llm

# GRAG with fine-tuned LLM by LoRA
python train.py --dataset expla_graphs --model_name graph_llm --llm_frozen False

Citation

@article{hu2024grag,
  title={GRAG: Graph Retrieval-Augmented Generation},
  author={Hu, Yuntong and Lei, Zhihan and Zhang, Zheng and Pan, Bo and Ling, Chen and Zhao, Liang},
  journal={arXiv preprint arXiv:2405.16506},
  year={2024}
}

Acknowledgements

Thanks to the gpt-fast project for its code.
Thanks to the G-Retriever contributors for their valuable work and open source contributions.

README.md