add Python interface #817

Picus303 · 2025-01-10T16:25:34Z

This PR allows to use Fish Speech as an importable Python library for easy integration in code.
It adds the sub-directory lib in the main directory and mainly relies on the dedicated Pipeline class.

Here is an example of use:

# Optionnal: remove prints in terminal
import os
from loguru import logger

logger.remove()
os.environ["TQDM_DISABLE"] = "1"

# Prepare the models
from fish_speech.lib import Pipeline

model = Pipeline(
    llama_path = "models/fish-speech-1_5",
    vqgan_path = "models/vqgan-1_5.pth",
)

# Create a reference audio
ref = model.make_reference("ref.wav", "reference text")

# Generate audio (no streaming)
output = model.generate("text to generate.", ref)

# Generate audio (streaming)
import numpy as np

generator = model.generate("text to generate.", ref, streaming=True)

parts = []
for part in generator:
    parts.append(part)
    print(part.shape)

output = np.concatenate(parts, axis=0)

# Save the output to a file
sample_rate = model.sample_rate

import soundfile as sf
sf.write("output.wav", output, sample_rate)

This PR is not complete yet as it's missing:
1 - Documentation. Question: Where do you want to put it?
2 - The code still depends on .project-root to manage paths, making it impossible to install in non-editable mode, forcing the user to keep the source code in a separate folder. I'd be glad if you have suggestions for this part.

for more information, see https://pre-commit.ci

github-actions · 2025-02-12T00:22:38Z

This PR is stale because it has been open for 30 days with no activity.

organics2016 · 2025-04-02T05:27:11Z

我已经尝试了这个PR，interface工作的非常好且设计优雅，希望作者不要放弃这个PR。

还有一些小问题，

当我直接通过pip git方式安装时，
pip install fish-speech@git+https://github.com/Picus303/fish-speech.git --no-cache-dir
调用这个PR的interface后，下面这行代码会因为找不到 ".project-root" 而提示报错。

fish-speech/fish_speech/models/vqgan/inference.py

Line 15 in 3eb2f32

pyrootutils.setup_root(__file__, indicator=".project-root", pythonpath=True)

我的解决方式是fork后注释它，然后一切工作正常。我的方式不是一个好的解决方式，可能会影响其他功能，在这里说明一下，作为参考，以便PR合并时的兼容性工作。

Juanma-t · 2025-06-08T06:18:34Z

Looking to pay for someone to help me setup an enviorment for this and walk me through on how to have it run through cli, I can figure out the rest, im not a programmer and I only use windows, so I need a way to do this whole thing in a linux thingy I can open in windows, and i'll need your help running my current pipeline on it (it's a script with a bunch of google dependencies for imagen4 and gemini)

Picus303 added 4 commits January 10, 2025 11:28

implement library pipeline base

f0819df

add docstrings

a04215d

add generation

18295ab

add chuck length to parameters

f763d1d

Picus303 marked this pull request as draft January 10, 2025 16:25

pre-commit-ci bot and others added 2 commits January 10, 2025 16:26

[pre-commit.ci] auto fixes from pre-commit.com hooks

c7f43b2

for more information, see https://pre-commit.ci

typo in tools/download_models.py

0529fc3

Whale-Dolphin marked this pull request as ready for review January 12, 2025 10:44

Picus303 marked this pull request as draft January 12, 2025 10:45

github-actions bot added the stale label Feb 12, 2025

tarun7r mentioned this pull request Mar 18, 2025

Python API #925

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add Python interface #817

add Python interface #817

Uh oh!

Picus303 commented Jan 10, 2025

Uh oh!

github-actions bot commented Feb 12, 2025

Uh oh!

organics2016 commented Apr 2, 2025 •

edited

Loading

Uh oh!

Juanma-t commented Jun 8, 2025

Uh oh!

Uh oh!

add Python interface #817

Are you sure you want to change the base?

add Python interface #817

Uh oh!

Conversation

Picus303 commented Jan 10, 2025

Uh oh!

github-actions bot commented Feb 12, 2025

Uh oh!

organics2016 commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Juanma-t commented Jun 8, 2025

Uh oh!

Uh oh!

organics2016 commented Apr 2, 2025 •

edited

Loading