TODO list

待办事项（有生之年……）

待读论文：

LLM-Based Multi-Agent Systems for Software Engineering: Literature Review, Vision and the Road Ahead

Large Language Model for Vulnerability Detection and Repair: Literature Review and the Road Ahead

A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models

Deep Learning Library Testing: Definition, Methods and Challenges

Towards an Understanding of Context Utilization in Code Intelligence

https://arxiv.org/pdf/2503.02951

https://arxiv.org/pdf/2508.05170

https://arxiv.org/pdf/2509.17325

采用 RL 的框架：https://github.com/volcengine/verl，分别跑 GRPO 和 DAPO 两个算法。

先跑论文数据集复现：https://huggingface.co/datasets/BytedTsinghua-SIA/DAPO-Math-17k

然后在新的数据集上测测： https://huggingface.co/datasets/zwhe99/DeepMath-103K