Skip to content

Latest commit

 

History

History
216 lines (211 loc) · 38.3 KB

File metadata and controls

216 lines (211 loc) · 38.3 KB

返回目录问题反馈

中文增速榜 > 软件类 > Python

数据更新: 2024-10-05   /   温馨提示:中文项目泛指「文档母语为中文」OR「含有中文翻译」的项目,通常在项目的「readme/wiki/官网」可以找到

# Repository Description Stars Average daily growth Updated
1 2noise/ChatTTS A generative speech model for daily dialogue. 31182 238 2024-09-21
2 All-Hands-AI/OpenHands 🙌 OpenHands: Code Less, Make More 32695 159 2024-10-04
3 Ucas-HaoranWei/GOT-OCR2.0 Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model 4848 147 2024-10-02
4 KwaiVGI/LivePortrait Bring portraits to life! 12096 129 2024-09-06
5 RVC-Boss/GPT-SoVITS 1 min voice data can also be used to train a good TTS model! (few shot voice cloning) 33481 126 2024-10-02
6 binary-husky/gpt_academic 为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss ... 64510 114 2024-10-01
7 hpcaitech/Open-Sora Open-Sora: Democratizing Efficient Video Production for All 21762 95 2024-08-09
8 myshell-ai/OpenVoice Instant voice cloning by MIT and MyShell. 28993 93 2024-08-21
9 fudan-generative-vision/hallo Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation 9256 80 2024-09-14
10 harry0703/MoneyPrinterTurbo 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM. 16335 79 2024-07-26
11 THUDM/ChatGLM-6B ChatGLM-6B: An Open Bilingual Dialogue Language Model 开源双语对话语言模型 40465 71 2024-06-27
12 gpt-omni/mini-omni open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities. 2742 70 2024-09-25
13 InternLM/MindSearch 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) 4772 69 2024-09-25
14 QwenLM/Qwen2-VL Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud. 2454 66 2024-10-04
15 lm-sys/FastChat An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. 36585 65 2024-09-25
16 hiyouga/LLaMA-Factory Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024) 31861 64 2024-10-01
17 infiniflow/ragflow RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. 18530 62 2024-10-03
18 huggingface/transformers 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. 132919 61 2024-10-04
19 ScrapeGraphAI/Scrapegraph-ai Python scraper based on AI 14711 58 2024-10-04
20 FunAudioLLM/CosyVoice Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. 5234 56 2024-09-29
21 huggingface/speech-to-speech Speech To Speech: an effort for an open-sourced and modular GPT4-o 3177 54 2024-09-27
22 LC044/WeChatMsg 提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手 33615 53 2024-09-23
23 opendatalab/PDF-Extract-Kit A Comprehensive Toolkit for High-Quality PDF Content Extraction 4973 50 2024-10-04
24 PKU-YuanGroup/Open-Sora-Plan This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project. 11291 50 2024-10-04
25 VikParuchuri/marker Convert PDF to markdown quickly with high accuracy 16799 49 2024-09-07
26 OpenBMB/MiniCPM-V MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone 12137 49 2024-09-13
27 jianchang512/ChatTTS-ui 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces. 5994 47 2024-08-29
28 adithya-s-k/omniparse Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks 5122 42 2024-09-23
29 netease-youdao/QAnything Question and Answer based on Anything. 11532 42 2024-09-27
30 RVC-Project/Retrieval-based-Voice-Conversion-WebUI Easily train a good VC model with voice data <= 10 mins! 23509 42 2024-09-05
31 chatanywhere/GPT_API_free Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。 21862 41 2024-09-26
32 Kwai-Kolors/Kolors Kolors Team 3663 40 2024-09-04
33 ultralytics/ultralytics Ultralytics YOLO11 🚀 29476 39 2024-10-04
34 Upsonic/gpt-computer-assistant Intelligence development framework in python for your product like Apple Intelligence 5206 39 2024-09-10
35 THUDM/ChatGLM3 ChatGLM3 series: Open Bilingual Chat LLMs 开源双语对话语言模型 13365 39 2024-07-10
36 zhayujie/chatgpt-on-wechat 基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。 30294 38 2024-09-26
37 VikParuchuri/surya OCR, layout analysis, reading order, line detection in 90+ languages 9961 37 2024-10-03
38 hpcaitech/ColossalAI Making large AI models cheaper, faster and more accessible 38694 36 2024-09-30
39 fishaudio/fish-speech Brand new TTS solution 12837 36 2024-10-03
40 THUDM/GLM-4 GLM-4 series: Open Multilingual Multimodal Chat LMs 开源多语言多模态对话模型 4752 33 2024-09-26
41 THUDM/ChatGLM2-6B ChatGLM2-6B: An Open Bilingual Chat LLM 开源双语对话语言模型 15703 33 2024-06-27
42 QwenLM/Qwen The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. 13591 32 2024-09-24
43 ymcui/Chinese-LLaMA-Alpaca 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) 18225 32 2024-04-30
44 NexaAI/nexa-sdk Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ... 1600 32 2024-10-04
45 qhjqhj00/MemoRAG Empowering RAG with a memory-based data interface for all-purpose applications! 966 31 2024-09-29
46 ultralytics/yolov5 YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite 49998 31 2024-10-01
47 FunAudioLLM/SenseVoice Multilingual Voice Understanding Model 2820 30 2024-09-25
48 jingyaogong/minimind 「大模型」3小时完全从0训练一个仅有26M的小参数GPT,个人显卡即可推理训练! 2105 30 2024-10-04
49 reflex-dev/reflex 🕸️ Web apps in pure Python 🐍 19574 28 2024-10-04
50 microsoft/UFO A UI-Focused Agent for Windows OS Interaction. 7621 28 2024-09-25
51 hiroi-sora/Umi-OCR OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。 26074 28 2024-09-29
52 assafelovic/gpt-researcher LLM based autonomous agent that does online comprehensive research on any given topic 14287 28 2024-10-04
53 PaddlePaddle/PaddleOCR Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and de ... 42999 27 2024-10-02
54 Textualize/rich Rich is a Python library for rich text and beautiful formatting in the terminal. 49067 27 2024-10-04
55 BadToBest/EchoMimic Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning 2553 27 2024-08-15
56 1Panel-dev/MaxKB 🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。 10507 27 2024-10-04
57 linyqh/NarratoAI 利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click. 1399 26 2024-10-01
58 GaiZhenbiao/ChuanhuChatGPT GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. 15178 26 2024-09-25
59 eosphoros-ai/DB-GPT AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents 13401 25 2024-09-27
60 Sinaptik-AI/pandas-ai Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG. 12766 24 2024-09-25
61 xinntao/Real-ESRGAN Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration. 27925 24 2024-08-06
62 TeamWiseFlow/wiseflow Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and upl ... 4008 24 2024-09-04
63 jianchang512/clone-voice A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频 7331 23 2024-08-22
64 OpenBMB/XAgent An Autonomous LLM Agent for Complex Task Solving 8067 23 2024-08-12
65 AiuniAI/Unique3D Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image 2943 23 2024-09-18
66 Zeyi-Lin/HivisionIDPhotos ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。 10570 22 2024-09-28
67 guoyww/AnimateDiff Official implementation of AnimateDiff. 10360 22 2024-07-31
68 Tencent/HunyuanDiT Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding 3326 22 2024-08-15
69 OpenMOSS/MOSS An open-source tool-augmented conversational language model from Fudan University 11928 22 2024-07-13
70 netease-youdao/EmotiVoice EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine 7296 22 2024-08-13
71 dataelement/bisheng BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF ... 8639 21 2024-09-30
72 vvbbnn00/WARP-Clash-API 该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic. 8445 21 2024-09-04
73 Kanaries/pygwalker PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis 12823 21 2024-10-02
74 THUDM/LongWriter LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs 1154 21 2024-09-27
75 modelscope/DiffSynth-Studio Enjoy the magic of Diffusion models! 6397 21 2024-09-30
76 myshell-ai/MeloTTS High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean. 4546 20 2024-08-09
77 microsoft/DeepSpeed DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. 34980 20 2024-10-04
78 modelscope/agentscope Start building LLM-empowered multi-agent applications in an easier way. 4948 19 2024-09-30
79 deepseek-ai/DeepSeek-Coder DeepSeek Coder: Let the Code Write Itself 6619 19 2024-05-21
80 jzhang38/TinyLlama The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. 7709 19 2024-05-03
81 ageitgey/face_recognition The world's simplest facial recognition api for Python and the command line 53030 19 2024-08-21
82 RUCAIBox/LLMSurvey The official GitHub page for the survey paper "A Survey of Large Language Models". 10123 18 2024-08-20
83 xinsir6/ControlNetPlus ControlNet++: All-in-one ControlNet for image generations and editing! 1673 18 2024-09-30
84 facebookresearch/nougat Implementation of Nougat Neural Optical Understanding for Academic Documents 8833 18 2024-04-16
85 OpenGVLab/InternVL [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型 5649 18 2024-09-19
86 OpenInterpreter/01 The #1 open-source voice interface for desktop, mobile, and ESP32 chips. 4917 18 2024-10-02
87 3b1b/manim Animation engine for explanatory math videos 62941 18 2024-10-04
88 fishaudio/Bert-VITS2 vits2 backbone with multilingual-bert 7854 18 2024-10-01
89 buaacyw/MeshAnything From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers" 1989 18 2024-08-05
90 THUDM/CodeGeeX2 CodeGeeX2: A More Powerful Multilingual Code Generation Model 7619 17 2024-07-10
91 FlagOpen/FlagEmbedding Retrieval and Retrieval-augmented LLMs 7021 16 2024-09-26
92 ymcui/Chinese-LLaMA-Alpaca-2 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models) 7063 16 2024-09-23
93 dyang886/Game-Cheats-Manager Easily download and manage game cheats for your convenience 4447 16 2024-09-04
94 marimo-team/marimo A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git. 6607 16 2024-10-04
95 yixiu001/serv00-login 同时支持serv00与ct8自动化批量保号,每3天自动登录一次面板,并且发送消息到Telegram 1437 15 2024-07-19
96 VisionRush/DeepFakeDefenders Image forgery recognition algorithm 519 15 2024-09-09
97 voicepaw/so-vits-svc-fork so-vits-svc fork with realtime support, improved interface and more features. 8712 15 2024-09-30
98 gradio-app/gradio Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work! 32506 15 2024-10-04
99 THUDM/CogVLM a state-of-the-art-level open visual language model 多模态预训练模型 5918 15 2024-05-29
100 OptimalScale/LMFlow An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All. 8231 15 2024-09-30
101 honmashironeko/ProxyCat 一款部署于云端或本地的代理池中间件,可将静态代理IP灵活运用成隧道IP,提供固定请求地址,一次部署终身使用 668 15 2024-09-30
102 6drf21e/ChatTTS_colab 🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。 1960 15 2024-07-02
103 BlinkDL/ChatRWKV ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. 9393 15 2024-07-11
104 THUDM/CogVLM2 GPT4V-level open-source multi-modal model based on Llama3-8B 2025 14 2024-09-03
105 InternLM/InternLM Official release of InternLM2.5 base and chat models. 1M context support 6291 14 2024-09-06
106 THUDM/CodeGeeX4 CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more. 1285 14 2024-08-25
107 HZJQF/help_tool 推理算法助手(降维打击) 500 14 2024-10-01
108 sMythicalBird/ZenlessZoneZero-Auto 绝区零 ZenlessZoneZero 零号空洞 自动战斗 自动化 图片分类 OCR识别 1153 14 2024-10-03
109 xxlong0/Wonder3D Single Image to 3D using Cross-Domain Diffusion for 3D Generation 4711 13 2024-08-29
110 open-mmlab/mmdetection OpenMMLab Detection Toolbox and Benchmark 29229 13 2024-08-21
111 jxxghp/MoviePilot NAS媒体库自动化管理工具 6301 13 2024-10-02
112 WZMIAOMIAO/deep-learning-for-image-processing deep learning for image processing including classification and object-detection etc. 22585 13 2024-07-25
113 xaoyaoo/PyWxDump 获取微信信息;读取数据库,本地查看聊天记录并导出为csv、html等格式用于AI训练,自动回复等。支持多账户信息获取,支持所有微信版本。 5441 13 2024-10-03
114 PeterH0323/Streamer-Sales Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端 ... 2410 13 2024-09-29
115 521xueweihan/GitHub520 😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装) 21180 13 2024-10-04
116 TMElyralab/MuseTalk MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting 2512 13 2024-09-23
117 QwenLM/Qwen-VL The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud. 4885 12 2024-08-07
118 llmware-ai/llmware Unified framework for building enterprise RAG pipelines with small, specialized models 4620 12 2024-10-04
119 YaoFANGUK/video-subtitle-remover 基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures. 4065 12 2024-09-30
120 wenge-research/YAYI2 YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs) 3609 12 2024-04-07
121 fufankeji/MateGen Next-Generation Interactive Intelligent Programming Assistant 1005 12 2024-09-20
122 aigc-apps/sd-webui-EasyPhoto 📷 EasyPhoto Your Smart AI Photo Generator. 4920 12 2024-07-10
123 lipku/metahuman-stream Real time interactive streaming digital human 3554 12 2024-09-21
124 barry-far/V2ray-Configs 🛰️✨ Free V2ray Configs , Updating Every 10 minutes. 4488 12 2024-10-04
125 TMElyralab/MuseV MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising 2361 12 2024-06-28
126 RayVentura/ShortGPT 🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation 5629 12 2024-09-19
127 MustardChef/WSABuilds Run Windows Subsystem For Android on your Windows 10 and Windows 11 PC using prebuilt binaries with Google Play Store (MindTheGapps) and/or Magisk or KernelSU (root solutions) built in. 7822 12 2024-08-16
128 taosdata/TDengine High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios 23285 12 2024-10-02
129 moesnow/March7thAssistant 崩坏:星穹铁道全自动 三月七小助手 4963 12 2024-09-28
130 gusye1234/nano-graphrag A simple, easy-to-hack GraphRAG implementation 858 12 2024-10-01
131 baichuan-inc/Baichuan-7B A large-scale 7B pretraining language model developed by BaiChuan-Inc. 5670 12 2024-07-18
132 aixcoder-plugin/aiXcoder-7B official repository of aiXcoder-7B Code Large Language Model 2194 12 2024-08-29
133 luosiallen/latent-consistency-model Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference 4304 12 2024-06-14
134 Langboat/Mengzi3 - 2032 11 2024-06-28
135 linyiLYi/street-fighter-ai This is an AI agent for Street Fighter II Champion Edition. 6316 11 2024-05-14
136 tyxsspa/AnyText Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing> 4245 11 2024-06-21
137 THUDM/CodeGeeX CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023) 8154 11 2024-08-13
138 BlinkDL/RWKV-LM RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa ... 12470 11 2024-09-23
139 X-PLUG/MobileAgent Mobile-Agent: The Powerful Mobile Device Operation Assistant Family 2776 11 2024-09-26
140 thuml/Time-Series-Library A Library for Advanced Deep Time Series Models. 6500 11 2024-09-29
141 Alpha-VLLM/Lumina-T2X Lumina-T2X is a unified framework for Text to Any Modality Generation 2038 11 2024-08-06
142 QwenLM/Qwen2-Audio The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud. 1124 11 2024-08-13
143 FujiwaraChoki/MoneyPrinterV2 Automate the process of making money online. 2366 10 2024-04-17
144 yihong0618/xiaogpt Play ChatGPT and other LLM with Xiaomi AI Speaker 6148 10 2024-09-22
145 yangjianxin1/Firefly Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 5692 10 2024-09-19
146 ihmily/DouyinLiveRecorder 可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、pandatv、afreecatv、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、花椒、Twitch、Acfun、CHZZK等平台直播录制 4488 10 2024-10-04
147 lanqian528/chat2api A service that can convert ChatGPT on the web to OpenAI API format. 1889 10 2024-09-23
148 yl4579/StyleTTS2 StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models 4797 10 2024-08-10
149 reorx/awesome-chatgpt-api Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota. 5903 10 2024-09-26
150 cubiq/ComfyUI_IPAdapter_plus - 3925 10 2024-09-13
151 ViggoZ/producthunt-daily-hot 自动生成每日Product Hunt热门产品中文榜单,基于GitHub Actions自动提交Markdown文件 591 10 2024-10-04
152 infrost/DeeplxFile 基于Deeplx和Playwright提供的简单易用,快速,免费,不限制文件大小,支持超长文本翻译,跨平台的文件翻译工具 / Easy-to-use, fast, free, unlimited file size and cross platform file translation tool based on Deeplx & Playwright that supports long tex ... 533 10 2024-09-09
153 xorbitsai/inference Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any ... 4986 10 2024-09-30
154 modelscope/modelscope ModelScope: bring the notion of Model-as-a-Service to life. 6869 9 2024-10-02
155 google-deepmind/penzai A JAX research toolkit for building, editing, and visualizing neural networks. 1651 9 2024-09-11
156 modelscope/ms-swift Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visi ... 3682 9 2024-10-04
157 THUDM/CogVideo text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023) 7847 9 2024-10-04
158 InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs. 4342 9 2024-09-28
159 hankcs/HanLP Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification 33629 9 2024-09-08
160 recommenders-team/recommenders Best Practices on Recommendation Systems 18893 9 2024-09-29
161 mli/autocut 用文本编辑器剪视频 6604 9 2024-04-16
162 PaddlePaddle/PaddleNLP 👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ... 12004 9 2024-09-30
163 modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc. 6182 9 2024-09-30
164 xlang-ai/OpenAgents [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild 3930 9 2024-07-08
165 QwenLM/Qwen-Agent Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension. 3233 9 2024-09-25
166 LinYuanovo/pikpak_auto_invite PikPak自动邀请程序,附带图像识别过验证码,支持本地及GitHub Actions云端运行 1058 9 2024-07-04
167 CVHub520/X-AnyLabeling Effortless data labeling with AI support from Segment Anything and other awesome models. 3829 8 2024-10-02
168 kohya-ss/sd-scripts - 5029 8 2024-10-04
169 jianchang512/stt Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式 2210 8 2024-09-23
170 bilibili/Index-1.9B A SOTA lightweight multilingual LLM 877 8 2024-09-20
171 tgbot-collection/YYeTsBot 🎬 人人影视 机器人和网站,包含人人影视全部资源以及众多网友的网盘分享 14181 8 2024-07-21
172 AutoGPTQ/AutoGPTQ An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm. 4376 8 2024-09-28
173 THUDM/VisualGLM-6B Chinese and English multimodal conversational language model 多模态中英双语对话语言模型 4077 8 2024-08-23
174 pkuliyi2015/multidiffusion-upscaler-for-automatic1111 Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0 4726 8 2024-08-07
175 open-compass/opencompass OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. 3832 8 2024-10-02
176 Plachtaa/VITS-fast-fine-tuning This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion 4704 8 2024-07-03
177 MzeroMiko/VMamba VMamba: Visual State Space Models,code is based on mamba 2071 8 2024-09-25
178 EstrellaXD/Auto_Bangumi AutoBangumi - 全自动追番工具 6732 8 2024-09-26
179 zhulu111/ComfyUI_Bxb SD变现宝:一键把comfyui工作流转换成小程序。 1065 8 2024-10-03
180 XPixelGroup/DiffBIR Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior 3290 8 2024-07-03
181 hitsz-ids/synthetic-data-generator SDG is a specialized framework designed to generate high-quality structured tabular data. 3263 8 2024-09-13
182 sml2h3/ddddocr 带带弟弟 通用验证码识别OCR pypi版 9740 8 2024-07-25
183 madawei2699/myGPTReader A community-driven way to read and chat with AI bots - powered by chatGPT. 4424 8 2024-04-25
184 QiuChenly/InjectLib 你知道我要说什么 929 8 2024-09-28
185 ok-oldking/ok-wuthering-waves 鸣潮 后台自动战斗 自动刷声骸上锁合成 Automation for Wuthering Waves 1002 8 2024-10-04
186 z1069614715/objectdetection_script 一些关于目标检测的脚本的改进思路代码,详细请看readme.md 5162 8 2024-10-01
187 InternLM/xtuner An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...) 3805 8 2024-09-29
188 Evil0ctal/Douyin_TikTok_Download_API 🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。 8833 8 2024-09-26
189 fxsjy/jieba 结巴中文分词 33159 8 2024-08-21
190 DennisThink/awesome_twitter_CN 值得关注的中文twitter用户 593 7 2024-09-26
191 continue-revolution/sd-webui-animatediff AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI 3064 7 2024-09-22
192 aigc-apps/EasyAnimate 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion 1210 7 2024-08-22
193 xingpingcn/enhanced-FaaS-in-China 提升部署在cloudflare、vercel或netlify的网页在中国的访问速度和稳定性 Improve the access speed and stability in China of web pages hosted on cloudflare, vercel or netlify by merely changing your CNAME record. cf优选域名 cf优选ip ... 1511 7 2024-10-04
194 om-ai-lab/OmAgent A multimodal agent framework for solving complex tasks [EMNLP'2024] 696 7 2024-10-01
195 malinkang/weread2notion-pro - 2018 7 2024-10-04
196 modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated. 3369 7 2024-08-22
197 PKU-YuanGroup/MoE-LLaVA Mixture-of-Experts for Large Vision-Language Models 1932 7 2024-05-15
198 InternLM/InternLM-XComposer InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output 2468 7 2024-08-30
199 DachunKai/EvTexture [ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution 990 7 2024-09-17
200 sqlmapproject/sqlmap Automatic SQL injection and database takeover tool 32143 7 2024-09-25

↓ -- 感谢读者 -- ↓

榜单持续更新,如有帮助请加星收藏,方便后续浏览,感谢你的支持!