Get desktop application:
View/edit binary Protocol Buffers messages
Provides detailed information about the default version of a model.
Name of model to describe.
required
Version of model to describe.
optional
List registered models in TorchServe.
Use this parameter to specify the maximum number of items to return. When this value is present, TorchServe does not return more than the specified number of items, but it might return fewer. This value is optional. If you include a value, it must be between 1 and 1000, inclusive. If you do not include a value, it defaults to 100.
optional
The token to retrieve the next set of results. TorchServe provides the token when the response from a previous call has more results than the maximum page size.
optional
Register a new model in TorchServe.
Inference batch size, default: 1.
optional
Inference handler entry-point. This value will override handler in MANIFEST.json if present.
optional
Number of initial workers, default: 0.
optional
Maximum delay for batch aggregation, default: 100.
optional
Name of model. This value will override modelName in MANIFEST.json if present.
optional
Maximum time, in seconds, the TorchServe waits for a response from the model inference code, default: 120.
optional
Runtime for the model custom service code. This value will override runtime in MANIFEST.json if present.
optional
Decides whether creation of worker synchronous or not, default: false.
optional
Model archive download url, support local file or HTTP(s) protocol.
required
Decides whether S3 SSE KMS enabled or not, default: false.
optional
Configure number of workers for a default version of a model.This is a asynchronous call by default. Caller need to call describeModel to check if the model workers has been changed.
Name of model to scale workers.
required
Model version.
optional
Maximum number of worker processes.
optional
Minimum number of worker processes.
optional
Number of GPU worker processes to create.
optional
Decides whether the call is synchronous or not, default: false.
optional
Waiting up to the specified wait time if necessary for a worker to complete all pending requests. Use 0 to terminate backend worker process immediately. Use -1 for wait infinitely.
optional
Set default version of a model
Name of model whose default version needs to be updated.
required
Version of model to be set as default version for the model
required
Unregister the default version of a model from TorchServe if it is the only version available.This is a asynchronous call by default. Caller can call listModels to confirm model is unregistered
Name of model to unregister.
required
Name of model to unregister.
optional
Used as response type in: ManagementAPIsService.DescribeModel, ManagementAPIsService.ListModels, ManagementAPIsService.RegisterModel, ManagementAPIsService.ScaleWorker, ManagementAPIsService.SetDefault, ManagementAPIsService.UnregisterModel
Response string of different management API calls.