Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer

Triton provides an extension to the standard gRPC inference api for streaming (`inference.GRPCInferenceService/ModelStreamInfer`), this extension is required to use vLLM backend with triton.
However currently the triton runtime adapter does not advertise the existence of this gRPC method and trying to call it results in an error (`inference.GRPCInferenceService/ModelStreamInfer: UNIMPLEMENTED: Method not found or not permitted: inference.GRPCInferenceService/ModelStreamInfer`)

To resolve this issue, I think the ModelStreamInfer method must be added here:
https://github.com/kserve/modelmesh-runtime-adapter/blob/f9781d287d31ec40c7c3eb77d5ac12eb68622aaa/model-mesh-triton-adapter/server/server.go#L267-L269

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer #80

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	mis := make(map[string]*mmesh.RuntimeStatusResponse_MethodInfo)
	mis[tritonServiceName+"/ModelInfer"] = &mmesh.RuntimeStatusResponse_MethodInfo{IdInjectionPath: path1}
	mis[tritonServiceName+"/ModelMetadata"] = &mmesh.RuntimeStatusResponse_MethodInfo{IdInjectionPath: path1}

Triton RuntimeStatus.MethodInfos is missing ModelStreamInfer #80

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions