Context-Augmented Code Generation Using Programming Knowledge Graphs

Shahd Seddik, Fahd Seddik, Iman Saberi et al.

January 28, 2026 Score: 8.6

Interest Score Breakdown

Seismic Impact (30%)

8.0/10

Industry-wide significance

Ecosystem Relevance (70%)

9.0/10

Applicable to your apps

Abstract

Large Language Models (LLMs) excel at code generation but struggle with complex problems. Retrieval-Augmented Generation (RAG) mitigates this issue by integrating external knowledge, yet retrieval models often miss relevant context, and generation models hallucinate with irrelevant data. We propose Programming Knowledge Graph (PKG) for semantic representation and fine-grained retrieval of code and text. Our approach enhances retrieval precision through tree pruning and mitigates hallucinations via a re-ranking mechanism that integrates non-RAG solutions. Structuring external data into finer-grained nodes improves retrieval granularity. Evaluations on HumanEval and MBPP show up to 20% pass@1 accuracy gains and a 34% improvement over baselines on MBPP. Our findings demonstrate that our proposed PKG approach along with re-ranker effectively address complex problems while maintaining minimal negative impact on solutions that are already correct without RAG. The replication package is published at https://github.com/iamshahd/ProgrammingKnowledgeGraph

Source

arXiv ID: 2601.20810

Download PDF

Context-Augmented Code Generation Using Programming Knowledge Graphs

Interest Score Breakdown

Abstract

Deep Analysis

How to Use in Your Ecosystem

Source