Skip to content

RLHF using microsoft lightning #1411

@PrestigeDevop

Description

@PrestigeDevop

As the title suggest , the agent lightning framework introduce a rlhf optimization rewards with AgentFlow — A modular multi-agent framework that combines planner, executor, verifier, and generator agents with the Flow-GRPO algorithm to tackle long-horizon, sparse-reward tasks.

I'm gonna assume it would , but the starting point will be from that repo.

second question is there any API similar to onnxGenAI local SemanticKernel.Connectors.OnnxRuntimeGenAI ? so no api required .

finally is there any advice for creating a polyglot MCP server with this library in mind ?

thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions