RapidAI
diff --git a/‎.github/workflows/docs_build_develop.yml‎
Lines changed: 36 additions & 0 deletions b/‎.github/workflows/docs_build_develop.yml‎
Lines changed: 36 additions & 0 deletions
diff --git a/‎docs/README_zh.md‎ ‎README_zh.md‎docs/README_zh.md renamed to README_zh.md b/‎docs/README_zh.md‎ ‎README_zh.md‎docs/README_zh.md renamed to README_zh.md
diff --git a/‎docs/blog/.authors.yml‎
Lines changed: 6 additions & 0 deletions b/‎docs/blog/.authors.yml‎
Lines changed: 6 additions & 0 deletions
diff --git a/‎docs/blog/.meta.yml‎
Lines changed: 3 additions & 0 deletions b/‎docs/blog/.meta.yml‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎docs/blog/index.md‎ b/‎docs/blog/index.md‎
diff --git a/‎docs/blog/posts/custom_llm_api.md‎
Lines changed: 162 additions & 0 deletions b/‎docs/blog/posts/custom_llm_api.md‎
Lines changed: 162 additions & 0 deletions
diff --git a/‎docs/blog/posts/supported_llm.md‎
Lines changed: 19 additions & 0 deletions b/‎docs/blog/posts/supported_llm.md‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎docs/changelog.md‎
Lines changed: 48 additions & 0 deletions b/‎docs/changelog.md‎
Lines changed: 48 additions & 0 deletions
diff --git a/‎docs/index.md‎
Lines changed: 63 additions & 0 deletions b/‎docs/index.md‎
Lines changed: 63 additions & 0 deletions
diff --git a/‎docs/online_demo.md‎
Lines changed: 29 additions & 0 deletions b/‎docs/online_demo.md‎
Lines changed: 29 additions & 0 deletions
@@ -0,0 +1,36 @@
+name: Build/Publish Develop Docs
+on:
+  push:
+    branches:
+      - master
+      - main
+permissions:
+  contents: write
+jobs:
+  deploy:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+        with:
+          fetch-depth: 0
+      - name: Configure Git Credentials
+        run: |
+          git config user.name github-actions[bot]
+          git config user.email 41898282+github-actions[bot]@users.noreply.github.com
+      - uses: actions/setup-python@v5
+        with:
+          python-version: 3.x
+      - run: echo "cache_id=$(date --utc '+%V')" >> $GITHUB_ENV
+      - uses: actions/cache@v4
+        with:
+          key: mkdocs-material-${{ env.cache_id }}
+          path: .cache
+          restore-keys: |
+            mkdocs-material-
+      - run: pip install mike mkdocs-material jieba mkdocs-git-revision-date-localized-plugin mkdocs-git-committers-plugin-2 mkdocs-static-i18n
+      - run: |
+          git fetch origin gh-pages --depth=1
+          mkdocs build
+          ls -la site/
+          mike set-default main
+          mike deploy --push --update-aliases main latest
@@ -0,0 +1,6 @@
+authors:
+  SWHL:
+    name: SWHL
+    description: Creator
+    avatar: https://avatars.githubusercontent.com/u/28639377?v=4
+    url: https://github.com/SWHL
@@ -0,0 +1,3 @@
+comments: true
+hide:
+  - feedback
@@ -0,0 +1,162 @@
+---
+title: 自定义LLM API
+date:
+  created: 2023-09-11
+authors: [SWHL]
+categories:
+    - General
+comments: true
+---
+
+### 引言
+
+{{% alert context="info" %}}该项目的LLM部分是独立的，用户可在 **knowledge_qa_llm/llm** 自定义配置所需的LLM接口。{{% /alert %}}
+
+下面以自定义支持InterLM-7b大模型为例，说明如何支持的。前提是本地满足部署LLM的推理条件。
+
+### 步骤如下
+
+#### 1. 部署LLM模型到本地
+
+具体如何下载，参见Hugging Face中[internlm-7b](https://huggingface.co/internlm/internlm-7b)。
+
+#### 2. 编写模型的部署推理代码
+
+这一点可以参考[ChatGLM](https://github.com/THUDM/ChatGLM-6B/blob/main/api.py)API的实现。只需要替换模型加载部分为InternLM的即可。具体如下：
+
+<details>
+
+```python {linenos=table}
+from fastapi import FastAPI, Request
+from transformers import AutoTokenizer, AutoModel
+import uvicorn, json, datetime
+import torch
+
+DEVICE = "cuda"
+DEVICE_ID = "0"
+CUDA_DEVICE = f"{DEVICE}:{DEVICE_ID}" if DEVICE_ID else DEVICE
+
+
+def torch_gc():
+    if torch.cuda.is_available():
+        with torch.cuda.device(CUDA_DEVICE):
+            torch.cuda.empty_cache()
+            torch.cuda.ipc_collect()
+
+
+app = FastAPI()
+
+
+@app.post("/")
+async def create_item(request: Request):
+    global model, tokenizer
+    json_post_raw = await request.json()
+    json_post = json.dumps(json_post_raw)
+    json_post_list = json.loads(json_post)
+    prompt = json_post_list.get('prompt')
+    history = json_post_list.get('history')
+    max_length = json_post_list.get('max_length')
+    top_p = json_post_list.get('top_p')
+    temperature = json_post_list.get('temperature')
+    response, history = model.chat(tokenizer,
+                                prompt,
+                                history=history,
+                                max_new_tokens=max_length if max_length else 2048,
+                                top_p=top_p if top_p else 0.7,
+                                temperature=temperature if temperature else 0.95)
+    now = datetime.datetime.now()
+    time = now.strftime("%Y-%m-%d %H:%M:%S")
+    answer = {
+        "response": response,
+        "history": history,
+        "status": 200,
+        "time": time
+    }
+    log = "[" + time + "] " + '", prompt:"' + prompt + '", response:"' + repr(response) + '"'
+    print(log)
+    torch_gc()
+    return answer
+
+
+if __name__ == '__main__':
+    tokenizer = AutoTokenizer.from_pretrained("internlm/internlm-chat-7b-v1_1", trust_remote_code=True)
+    model = AutoModel.from_pretrained("internlm/internlm-chat-7b-v1_1", trust_remote_code=True).half().cuda()
+    model.eval()
+    uvicorn.run(app, host='0.0.0.0', port=8000, workers=1)
+```
+
+</details>
+
+#### 3. 编写调用接口部分代码
+
+在以下项目`knowledge_qa_llm/llm/`目录下创建`internlm_7b.py`文件，具体代码如下：
+
+<details>
+
+```python {linenos=table}
+import json
+from typing import List, Optional
+
+import requests
+
+
+class InternLM_7B:
+    def __init__(self, api_url: str = None):
+        self.api_url = api_url
+
+    def __call__(self, prompt: str, history: Optional[List] = None, **kwargs):
+        if not history:
+            history = []
+
+        data = {"prompt": prompt, "history": history}
+        if kwargs:
+            temperature = kwargs.get("temperature", 0.1)
+            top_p = kwargs.get("top_p", 0.7)
+            max_length = kwargs.get("max_length", 4096)
+
+            data.update(
+                {"temperature": temperature, "top_p": top_p, "max_length": max_length}
+            )
+        req = requests.post(self.api_url, data=json.dumps(data), timeout=60)
+        try:
+            rdata = req.json()
+            if rdata["status"] == 200:
+                return rdata["response"]
+            return "Network error"
+        except Exception as e:
+            return f"Network error:{e}"
+```
+
+</details>
+
+#### 4. 添加导入声明
+
+在`knowledge_qa_llm/llm/__init__.py`中添加对应的`import`部分代码，示例如下：
+
+```python {linenos=table}
+from .baichuan_7b import BaiChuan7B
+from .chatglm2_6b import ChatGLM2_6B
+from .ernie_bot_turbo import ERNIEBotTurbo
+from .qwen7b_chat import Qwen7B_Chat
+from .internlm_7b import InternLM_7B
+
+__all__ = ["BaiChuan7B", "ChatGLM2_6B", "ERNIEBotTurbo", "Qwen7B_Chat", "InternLM_7B"]
+```
+
+#### 5. 更改配置文件
+
+更改`knowledge_qa_llm/config.yaml`
+
+```yaml {linenos=table}
+LLM_API:
+    InternLM_7B: your_api
+    Qwen7B_Chat: your_api
+    ChatGLM2_6B: your_api
+    BaiChuan7B: your_api
+```
+
+#### 6. 启动
+
+```bash {linenos=table}
+streamlit run web_ui.py
+```
@@ -0,0 +1,19 @@
+---
+title: 支持的LLM
+date:
+  created: 2023-09-11
+authors: [SWHL]
+categories:
+    - General
+comments: true
+---
+
+✔ [ChatGLM2-6B](https://huggingface.co/THUDM/chatglm2-6b)
+
+✔ [BaiChuan-7B](https://huggingface.co/baichuan-inc/Baichuan-7B)
+
+✔ [Qwen-7B](https://huggingface.co/Qwen/Qwen-7B)
+
+✔ [llama2](https://github.com/facebookresearch/llama)
+
+✔ [InternLM-7b](https://huggingface.co/internlm/internlm-7b)
@@ -0,0 +1,48 @@
+---
+comments: true
+hide:
+  - toc
+---
+
+
+#### 2023-10-15 v0.0.10 update
+
+- 当不能从文档中搜索到任何有效信息时，会直接调用模型本身的能力。
+- 完善文档，添加超参数的解释
+- 基于erniebot库，统一文心一言版本和仓库主分支版本
+
+#### 2023-09-07 v0.0.9 update
+
+- 解决多人上传的文档，会被其他人搜到的问题
+- 优化UI界面
+
+#### 2023-08-11 v0.0.7 update
+
+- 优化布局，去掉插件选项，将提取向量模型选项放到主页部分
+- 将提示语英语化，便于交流使用。
+- 添加项目logo: 🧐
+- 更新CLI使用代码
+
+#### 2023-08-05 v0.0.6 update
+
+- 适配更多模型接口，包括在线大模型接口，例如文心一言
+- 添加提取特征向量的状态提示
+
+#### 2023-08-04 v0.0.5 update
+
+- 修复了插入数据库数据重复的问题。
+
+#### 2023-07-29 v0.0.4 update
+
+- 基于`streamlit==1.25.0`优化UI
+- 优化代码
+- 录制UI GIF demo
+
+#### 2023-07-28 v0.0.3 update
+
+- 完成文件解析部分
+
+#### 2023-07-25 v0.0.2 update
+
+- 规范现有目录结构，更加紧凑，提取部分变量到`config.yaml`中
+- 完善说明文档
@@ -0,0 +1,63 @@
+---
+comments: true
+hide:
+  - navigation
+  - toc
+---
+
+<div align="center">
+    <div>&nbsp;</div>
+    <div align="center">
+        <b><font size="6">🧐 Knowledge QA LLM</font></b>
+    </div>
+    <div>&nbsp;</div>
+     <a href=""><img src="https://img.shields.io/badge/Python->=3.8,<3.12-aff.svg"></a>
+     <a href=""><img src="https://img.shields.io/badge/OS-Linux%2C%20Win%2C%20Mac-pink.svg"></a>
+     <a href=""><img src="https://img.shields.io/github/v/release/RapidAI/QA-LocalKnowledge-LLM?logo=github"></a>
+     <a href="https://semver.org/"><img alt="SemVer2.0" src="https://img.shields.io/badge/SemVer-2.0-brightgreen"></a>
+     <a href="https://github.com/psf/black"><img src="https://img.shields.io/badge/code%20style-black-000000.svg"></a>
+     <a href="https://choosealicense.com/licenses/apache-2.0/"><img alt="GitHub" src="https://img.shields.io/github/license/RapidAI/Knowledge-QA-LLM"></a>
+     <a href="https://github.com/RapidAI/Knowledge-QA-LLM"><img src="https://img.shields.io/badge/Github-KnowledgeQALLM-brightgreen"></a>
+
+</div>
+
+### 简介
+
+基于本地知识库+LLM的问答系统。该项目的思路是由[langchain-ChatGLM](https://github.com/imClumsyPanda/langchain-ChatGLM)启发而来。
+
+- 缘由：
+    - 之前使用过这个项目，感觉不是太灵活，部署不太友好。
+    - 借鉴[如何用大语言模型构建一个知识问答系统](https://mp.weixin.qq.com/s/movaNCWjJGBaes6KxhpYpg)中思路，尝试以此作为实践。
+- 优势：
+    - 整个项目为模块化配置，不依赖`lanchain`库，各部分可轻易替换，代码简单易懂。
+    - 除需要单独部署大模型接口外，其他部分用CPU即可。
+    - 支持常见格式文档，包括txt、md、pdf, docx, pptx, excel等等。当然，也可自定义支持其他类型文档。
+
+### 整体流程
+
+#### 解析文档并存储在数据库
+
+```mermaid
+flowchart LR
+
+A([Documents]) --ExtractText--> B([sentences])
+B --Embeddings--> C([Embeddings])
+C --Store--> D[(DataBase)]
+```
+
+#### 检索并回答问题
+
+```mermaid
+flowchart LR
+E([Query]) --Embedding--> F([Embeddings]) --> H[(Database)] --Search--> G([Context])
+E --> I([Prompt])
+G --> I --> J([LLM]) --> K([Answer])
+```
+
+### 使用的工具
+
+- 文档分析: [`extract_office_content`](https://github.com/SWHL/ExtractOfficeContent), [`rapidocr_pdf`](https://github.com/RapidAI/RapidOCRPDF), [`rapidocr_onnxruntime`](https://github.com/RapidAI/RapidOCR)
+- 提取语义向量: [`moka-ai/m3e-small`](https://huggingface.co/moka-ai/m3e-base)
+- 向量存储: `sqlite`
+- 向量检索: [`faiss`](https://github.com/facebookresearch/faiss)
+- UI搭建: [`streamlit>=1.25.0`](https://github.com/streamlit/streamlit)
@@ -0,0 +1,29 @@
+---
+comments: true
+hide:
+  - navigation
+  - toc
+---
+
+
+#### 简介
+
+在线demo是基于百度的AI Studio平台搭建，基于文心一言大模型的接口搭建。
+
+因为该项目核心在于利用大模型的总结和提取能力，主打离线私有部署，但是一直没有一个在线demo供大家查看效果。因此有了基于文心一言版的 **🧐 Knowledge QA LLM**。
+
+#### Demo源码
+
+基于`erniebot`库来搭建的，如需使用，需要鉴权，提供**Access Token**，具体教程，参见：[link](https://github.com/PaddlePaddle/ERNIE-Bot-SDK/blob/develop/docs/authentication.md)
+
+地址： <https://aistudio.baidu.com/projectdetail/6675380?contributionType=1>
+
+#### 在线Demo
+
+{{< alert text="该Demo主要侧重查看效果，至于工程化则差一些。" />}}
+
+基于文心一言API的文档知识问答系统: <https://aistudio.baidu.com/application/detail/8138>
+
+<div align="center">
+    <img src="https://github.com/RapidAI/Knowledge-QA-LLM/releases/download/v0.0.1/UIDemo.gif" width="100%" height="100%">
+</div>
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+comments: true`
	`2`	`+hide:`
	`3`	`+ - feedback`