From ac5639251d1920f8be43c2eff134b5b5bb83eafa Mon Sep 17 00:00:00 2001 From: tpoisonooo Date: Wed, 7 May 2025 18:09:09 +0800 Subject: [PATCH 1/4] Update README.md --- README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README.md b/README.md index 5a504094f..3bce9b3d2 100644 --- a/README.md +++ b/README.md @@ -276,6 +276,10 @@ For more examples, please see [chat.md](./docs/en/user_guides/chat.md). - We recommend using [OpenCompass](https://github.com/InternLM/opencompass), a comprehensive and systematic LLM evaluation library, which currently supports 50+ datasets with about 300,000 questions. +### Data Preparation + +- You can use **[GraphGen](https://github.com/open-sciencelab/GraphGen)** to create synthetic data for fine-tuning. + ## 🤝 Contributing We appreciate all contributions to XTuner. Please refer to [CONTRIBUTING.md](.github/CONTRIBUTING.md) for the contributing guideline. From 9121df6f8bdfdd4d8d3058f67c8a8ac1a5f5bc50 Mon Sep 17 00:00:00 2001 From: tpoisonooo Date: Wed, 7 May 2025 18:11:23 +0800 Subject: [PATCH 2/4] Update README_zh-CN.md --- README_zh-CN.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/README_zh-CN.md b/README_zh-CN.md index 7ddc7864e..6eff930d6 100644 --- a/README_zh-CN.md +++ b/README_zh-CN.md @@ -277,6 +277,10 @@ xtuner chat internlm/internlm2-chat-7b --prompt-template internlm2_chat - 推荐使用一站式平台 [OpenCompass](https://github.com/InternLM/opencompass) 来评测大语言模型,其目前已涵盖 50+ 数据集的约 30 万条题目。 +### 训练数据 + +- 推荐使用 **[GraphGen](https://github.com/open-sciencelab/GraphGen)** 合成 SFT 所需训练数据,目前已在多个垂域验证效果。 + ## 🤝 贡献指南 我们感谢所有的贡献者为改进和提升 XTuner 所作出的努力。请参考[贡献指南](.github/CONTRIBUTING.md)来了解参与项目贡献的相关指引。 From e3a068fdc76b5639d9e2b7172516a528a4479674 Mon Sep 17 00:00:00 2001 From: tpoisonooo Date: Wed, 7 May 2025 18:14:08 +0800 Subject: [PATCH 3/4] Update README.md --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 3bce9b3d2..233c47592 100644 --- a/README.md +++ b/README.md @@ -278,7 +278,7 @@ For more examples, please see [chat.md](./docs/en/user_guides/chat.md). ### Data Preparation -- You can use **[GraphGen](https://github.com/open-sciencelab/GraphGen)** to create synthetic data for fine-tuning. +- You can use [GraphGen](https://github.com/open-sciencelab/GraphGen) to create synthetic data for fine-tuning. ## 🤝 Contributing From 1e8a4f96fe65eb88877bb54474701a4a106013b2 Mon Sep 17 00:00:00 2001 From: tpoisonooo Date: Wed, 7 May 2025 18:15:05 +0800 Subject: [PATCH 4/4] Update README_zh-CN.md --- README_zh-CN.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README_zh-CN.md b/README_zh-CN.md index 6eff930d6..2f228e5a4 100644 --- a/README_zh-CN.md +++ b/README_zh-CN.md @@ -277,9 +277,9 @@ xtuner chat internlm/internlm2-chat-7b --prompt-template internlm2_chat - 推荐使用一站式平台 [OpenCompass](https://github.com/InternLM/opencompass) 来评测大语言模型,其目前已涵盖 50+ 数据集的约 30 万条题目。 -### 训练数据 +### 数据 -- 推荐使用 **[GraphGen](https://github.com/open-sciencelab/GraphGen)** 合成 SFT 所需训练数据,目前已在多个垂域验证效果。 +- 推荐使用 [GraphGen](https://github.com/open-sciencelab/GraphGen) 合成 SFT 所需训练数据,目前已在多个垂域验证效果。 ## 🤝 贡献指南