Request for Iterative Generation in Pipeline (e.g., LLaMA model) #33949

qsunyuan · 2024-10-04T14:46:04Z

Feature request

I would like to ask if there is a way to perform iterative generation (n times) within the pipeline, specifically for models like LLMs. If this feature is not available, is there any plan to implement it in the future?

Example:

pipeline = transformers.pipeline(
            "text-generation",
            model="meta-llama/Llama-3.1-8B-Instruct",
            model_kwargs={"torch_dtype": torch.bfloat16},
            device_map="auto",
        ) 

# Generate once
outputs = llama_client(
              messages,
              max_new_tokens=max_tokens
          )
# Generate n times
outputs = llama_client(
              messages,
              max_new_tokens=max_tokens,
              n = n
          )

Similar GPT API

response = client.chat.completions.create(
            model=model,
            messages=messages, 
            max_tokens=max_tokens,
            temperature=temperature, 
            n=n,  
        )

I am also aware that iterative generation can be done using a for loop, but I am wondering if there is a more efficient or optimized way to generate multiple iterations (n times) within the pipeline for models.

https://community.openai.com/t/how-does-n-parameter-work-in-chat-completions/288725

Motivation

build connection between LLM api and transformer pipeline

Your contribution

Request

ArthurZucker · 2024-10-05T14:24:11Z

Sounds interesting, let's see if this is asked by the community ! We usually check activity here 🚀
cc @Rocketknight1

qsunyuan added the Feature request Request for a new feature label Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request for Iterative Generation in Pipeline (e.g., LLaMA model) #33949

Request for Iterative Generation in Pipeline (e.g., LLaMA model) #33949

qsunyuan commented Oct 4, 2024

ArthurZucker commented Oct 5, 2024

Request for Iterative Generation in Pipeline (e.g., LLaMA model) #33949

Request for Iterative Generation in Pipeline (e.g., LLaMA model) #33949

Comments

qsunyuan commented Oct 4, 2024

Feature request

Motivation

Your contribution

ArthurZucker commented Oct 5, 2024