Get desktop application:
View/edit binary Protocol Buffers messages
Inference Server GRPC endpoints.
The ServerLive API indicates if the inference server is able to receive and respond to metadata and inference requests.
(message has no fields)
True if the inference server is live, false if not live.
The ServerReady API indicates if the server is ready for inferencing.
(message has no fields)
True if the inference server is ready, false if not ready.
The ModelReady API indicates if a specific model is ready for inferencing.
The name of the model to check for readiness.
The version of the model to check for readiness. If not given the server will choose a version based on the model and internal policy.
True if the model is ready, false if not ready.
The ServerMetadata API provides information about the server. Errors are indicated by the google.rpc.Status returned for the request. The OK code indicates success and other codes indicate failure.
(message has no fields)
The server name.
The server version.
The extensions supported by the server.
The per-model metadata API provides information about a model. Errors are indicated by the google.rpc.Status returned for the request. The OK code indicates success and other codes indicate failure.
The name of the model.
The version of the model to check for readiness. If not given the server will choose a version based on the model and internal policy.
The model name.
The versions of the model available on the server.
The model's platform. See Platforms.
The model's inputs.
The model's outputs.
The ModelInfer API performs inference using the specified model. Errors are indicated by the google.rpc.Status returned for the request. The OK code indicates success and other codes indicate failure.
The name of the model to use for inferencing.
The version of the model to use for inference. If not given the server will choose a version based on the model and internal policy.
Optional identifier for the request. If specified will be returned in the response.
Optional inference parameters.
The input tensors for the inference.
The requested output tensors for the inference. Optional, if not specified all outputs produced by the model will be returned.
The data contained in an input tensor can be represented in "raw" bytes form or in the repeated type that matches the tensor's data type. To use the raw representation 'raw_input_contents' must be initialized with data for each tensor in the same order as 'inputs'. For each tensor, the size of this content must match what is expected by the tensor's shape and data type. The raw data must be the flattened, one-dimensional, row-major order of the tensor elements without any stride or padding between the elements. Note that the FP16 data type must be represented as raw content as there is no specific data type for a 16-bit float type. If this field is specified then InferInputTensor::contents must not be specified for any input tensor.
The name of the model used for inference.
The version of the model used for inference.
The id of the inference request if one was specified.
Optional inference response parameters.
The output tensors holding inference results.
The data contained in an output tensor can be represented in "raw" bytes form or in the repeated type that matches the tensor's data type. To use the raw representation 'raw_output_contents' must be initialized with data for each tensor in the same order as 'outputs'. For each tensor, the size of this content must match what is expected by the tensor's shape and data type. The raw data must be the flattened, one-dimensional, row-major order of the tensor elements without any stride or padding between the elements. Note that the FP16 data type must be represented as raw content as there is no specific data type for a 16-bit float type. If this field is specified then InferOutputTensor::contents must not be specified for any output tensor.
An inference parameter value. The Parameters message describes a “name”/”value” pair, where the “name” is the name of the parameter and the “value” is a boolean, integer, or string corresponding to the parameter.
Used in:
, , , ,The parameter value can be a string, an int64, a boolean or a message specific to a predefined parameter.
A boolean parameter value.
An int64 parameter value.
A string parameter value.
The data contained in a tensor represented by the repeated type that matches the tensor's data type. Protobuf oneof is not used because oneofs cannot contain repeated fields.
Used in:
,Representation for BOOL data type. The size must match what is expected by the tensor's shape. The contents must be the flattened, one-dimensional, row-major order of the tensor elements.
Representation for INT8, INT16, and INT32 data types. The size must match what is expected by the tensor's shape. The contents must be the flattened, one-dimensional, row-major order of the tensor elements.
Representation for INT64 data types. The size must match what is expected by the tensor's shape. The contents must be the flattened, one-dimensional, row-major order of the tensor elements.
Representation for UINT8, UINT16, and UINT32 data types. The size must match what is expected by the tensor's shape. The contents must be the flattened, one-dimensional, row-major order of the tensor elements.
Representation for UINT64 data types. The size must match what is expected by the tensor's shape. The contents must be the flattened, one-dimensional, row-major order of the tensor elements.
Representation for FP32 data type. The size must match what is expected by the tensor's shape. The contents must be the flattened, one-dimensional, row-major order of the tensor elements.
Representation for FP64 data type. The size must match what is expected by the tensor's shape. The contents must be the flattened, one-dimensional, row-major order of the tensor elements.
Representation for BYTES data type. The size must match what is expected by the tensor's shape. The contents must be the flattened, one-dimensional, row-major order of the tensor elements.
An input tensor for an inference request.
Used in:
The tensor name.
The tensor data type.
The tensor shape.
Optional inference input tensor parameters.
The tensor contents using a data-type format. This field must not be specified if "raw" tensor contents are being used for the inference request.
An output tensor requested for an inference request.
Used in:
The tensor name.
Optional requested output tensor parameters.
An output tensor returned for an inference request.
Used in:
The tensor name.
The tensor data type.
The tensor shape.
Optional output tensor parameters.
The tensor contents using a data-type format. This field must not be specified if "raw" tensor contents are being used for the inference response.
Metadata for a tensor.
Used in:
The tensor name.
The tensor data type.
The tensor shape. A variable-size dimension is represented by a -1 value.