Skip to content

Commit 74e29e3

Browse files
committed
Update
1 parent feab9ba commit 74e29e3

File tree

1 file changed

+7
-5
lines changed

1 file changed

+7
-5
lines changed

README.md

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -2,15 +2,17 @@
22

33
## Introduction
44

5-
CogKit is an open-source project that provides a user-friendly interface for researchers and developers to utilize ZhipuAI's [**CogView**](https://huggingface.co/collections/THUDM/cogview-67ac3f241eefad2af015669b) (image generation) and [**CogVideoX**](https://huggingface.co/collections/THUDM/cogvideo-66c08e62f1685a3ade464cce) (video generation) models. It streamlines multimodal tasks such as **text-to-image (T2I)**, **text-to-video (T2V)**, and **image-to-video (I2V)**. Users must comply with legal and ethical guidelines to ensure responsible implementation.
5+
CogKit is an open-source project that provides a user-friendly interface for researchers and developers to utilize models from ZhipuAI, currently supports [CogView](https://huggingface.co/collections/THUDM/cogview-67ac3f241eefad2af015669b) (image generation) and [CogVideoX](https://huggingface.co/collections/THUDM/cogvideo-66c08e62f1685a3ade464cce) (video generation) series. Users must comply with legal and ethical guidelines to ensure responsible implementation.
66

7-
Visit our [**Docs**](https://thudm.github.io/CogKit) to start.
7+
Visit our [Docs](https://thudm.github.io/CogKit) to start.
88

99
## Features
1010

11-
- **Fine-tuning Methods**: Supports **LoRA** and **full-parameter fine-tuning** across various setups, including **single-machine single-GPU**, **single-machine multi-GPU**, and **multi-machine multi-GPU** configurations.
12-
- **Inference**: Provides an **OpenAI-style API** (T2I Only) and a **command-line interface** for seamless model deployment.
13-
- **Embed Cache**: Optimizes GPU memory usage to enhance efficiency during inference.
11+
- Training Optimization: Includes pre-computation and caching of latents and embeddings, sequence packing, and various memory-efficient strategies to improve training throughput and reduce GPU memory usage.
12+
13+
- Native Resolution Training Support: Seamlessly train models at original image resolutions for improved quality and consistency.
14+
15+
- Easy-to-use Interface: Offers multiple easy-to-use inference options, including a CLI, OpenAI-compatible API server, and interactive Gradio-based UIs for both training and inference.
1416

1517
## Roadmap
1618

0 commit comments

Comments
 (0)