You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add a command line flag remote_predict_op_use_tensor_content to reduce request size (commit: 1d1a029)
changing Orbax Model version file uniquely identifiable (commit: a01a31a)
Plumb priority into RunHandlerPool (commit: 4184bff)
Fix a check failure in ServerCore. (commit: 223ac59)
Add max_cache_length to PredictRequest.RequestOptions. (commit: 6db34f8)
Add max_cache_length to PredictRequest.RequestOptions. (commit: 6d2fd08)
Add max_cache_length to PredictRequest.RequestOptions. (commit: 1c88766)
Use std::numeric_limits for max int32 value. (commit: a33d9da)
Removal of tsl-specific integral types. (commit: d1f407f)
Add a new sampling mode to infra logging. (commit: b05d65f)
TF Serving Dockerfiles upgraded for a hermetic build. MKL stays non-hermetic at this time because of strict linker checks in llvm_openmp. (commit: ecb468f)
Fixed Serving server startup in docker environment (commit: 275ec5c)
Optimized disk space usage in Dockerfile.devel-gpu (commit: 824ad7a)
Add option to allow resplitting for priority aware scheduler. (commit: 9baa5a2)
add mutable version which can save a copy (commit: a5f6f0b)
Update version for 2.20.0 release. (#4135) (commit: bc7e9d2)