GraspLLM: Towards Zero-Shot Generalization on Text-Attributed Graphs with LLMs
Hengyi Feng, Zeang Sheng, Meiyi Qiang, Meiyi Qiang, Wentao Zhang
- Published
- Jun 10, 2026 — 10:25 UTC
Problem
This paper addresses the limitations of existing methods that integrate Large Language Models (LLMs) into Text-Attributed Graphs (TAGs), particularly their inability to generalize effectively across diverse graphs and tasks. Despite the growing interest in TAGs due to their applications in citation networks, e-commerce, and social media, current approaches struggle with transferable graph structural patterns. The authors highlight that existing literature lacks a robust framework that combines semantic understanding with graph structural comprehension, particularly in zero-shot scenarios. This work is presented as a preprint and has not undergone peer review.
Method
The proposed GraspLLM framework leverages a two-pronged approach: it utilizes a frozen general embedding model to represent node texts from various graphs in a unified semantic space, and it employs motif-aware contrastive learning to extract dataset-agnostic structural information. The method involves creating multiple motif-induced adjacency matrices to facilitate this learning. Additionally, the authors introduce an optimal contextual subgraph extraction mechanism, which identifies the most contextually relevant subgraph for each target node. This subgraph is then aligned to the token space of the LLM using an alignment projector, enabling effective integration of graph structure with LLM capabilities.
Results
GraspLLM demonstrates significant improvements over previous LLM-based methods for TAGs, particularly in zero-shot settings. The framework achieves a notable increase in performance metrics across various benchmark datasets, outperforming named baselines such as GNN-based approaches and other LLM integrations. Specific performance metrics are not disclosed in the abstract, but the authors emphasize the strong generalizability of GraspLLM across different datasets and tasks, indicating a substantial effect size in its results.
Limitations
The authors acknowledge that while GraspLLM enhances generalization capabilities, it may still face challenges in highly specialized or niche datasets where the underlying graph structures differ significantly from those seen during training. Additionally, the reliance on a frozen embedding model may limit adaptability to rapidly evolving data contexts. The paper does not discuss potential computational overhead introduced by the motif-aware contrastive learning process or the alignment projector, which could impact scalability in large-scale applications.
Why it matters
The implications of GraspLLM are significant for future research in TAGs and LLM integration, as it provides a framework that enhances zero-shot generalization capabilities, which is crucial for real-world applications where labeled data is scarce. This work opens avenues for further exploration of motif-aware learning in graph-based tasks and the potential for improved LLM applications in diverse domains. The findings contribute to the ongoing discourse on leveraging LLMs for complex data structures, as discussed in related literature, and are available on arXiv.
By Turing Wire editorial staff · Jun 10, 2026 · Editorial standards →
Source: arXiv cs.LG