| license | mit |
|---|
StyQA is a unified quasi-agent framework for versatile style transfer tasks (pixel-level, semantic-level and continuous style transfer, etc.). This agent is attempted to be prompted with style transfer pipeline and normal style transfer instructions.
Typically, StyQA will conduct style-analysis
StyQA relies on uv and we provided pyproject.toml in this repo. Note that the embedded base model relies on the newest diffusers which should be installed with:
uv pip install git+https://github.com/huggingface/diffusersComputational Resource: Make sure your device satisfies the computation requirements:
- The model with
bfloat16will comsume about 52G. If LoRA is used, each LoRA module consumes about 1G. So for better inference with at least$1024\times 1024$ pictures, $\geq 80$G is required.
main.py provides demos for continuous style transfer:
@hydra.main(version_base="v1.2", config_path="configs", config_name="agent")
def main(cfgs: OmegaConf):
prompt = "Convert the pixel colors of Picture 1 into the pixel colors of Picture 2 with strength 0.75 and then transfer the semantic style into Picture 3."
cnt_image_path = "demos/content1.jpg"
ref_image_paths = ["demos/pixel.jpg", "demos/semantic1.jpg"]
agent: StyQA = instantiate(cfgs.agent)
user_input = UserInput(
prompt=prompt,
cnt_image_path=cnt_image_path,
ref_image_paths=ref_image_paths,
)
agent.run_pipeline(user_input)Simply run
python main.pyand you will get the outputs like:
The agent configurations (seed, num_inference_steps, etc.) can be modified in configs/agent.yaml, or you can modify them using bash command thanks to hydra.
python main.py agent.seed=1234 agent.num_inference_steps=16Note that the prompt can points a specific value and style transfer task types explicitly (it is what we recommanded) or with an implicit representation such as
"Using the colors and textures from Picture 2 ... moderate strength ... as if they are in same style category ..."We provide more visualization performances.
Similar to arbitrary image style transfer, Pixel-level style transfer aims to utilize the color and textures features from low level pixels.
Artistic styles can be devided into different categories, semantic-level style transfer aims to re-generate content image using the same style category, which we call style-centric semantic features.
Based on different pipeline commands, StyQA will parse them into workflow and conducts style transfer one-by-one, here we provide somes demos.
- Open-source inference code
- Deliver demos
- Uploads prepared LoRA weights
- Open-source training/fine-tuning code
- More experiments



