Get desktop application:
View/edit binary Protocol Buffers messages
Service that implements Google Cloud Video Intelligence API.
Performs asynchronous video annotation. Progress and results can be retrieved through the `google.longrunning.Operations` interface. `Operation.metadata` contains `AnnotateVideoProgress` (progress). `Operation.response` contains `AnnotateVideoResponse` (results).
Video annotation request.
Input video location. Currently, only [Google Cloud Storage](https://cloud.google.com/storage/) URIs are supported, which must be specified in the following format: `gs://bucket-id/object-id` (other URI formats return [google.rpc.Code.INVALID_ARGUMENT][google.rpc.Code.INVALID_ARGUMENT]). For more information, see [Request URIs](/storage/docs/reference-uris). A video URI may include wildcards in `object-id`, and thus identify multiple videos. Supported wildcards: '*' to match 0 or more characters; '?' to match 1 character. If unset, the input video should be embedded in the request as `input_content`. If set, `input_content` should be unset.
The video data bytes. Encoding: base64. If unset, the input video(s) should be specified via `input_uri`. If set, `input_uri` should be unset.
Requested video annotation features.
Additional video context and/or feature-specific parameters.
Optional location where the output (in JSON format) should be stored. Currently, only [Google Cloud Storage](https://cloud.google.com/storage/) URIs are supported, which must be specified in the following format: `gs://bucket-id/object-id` (other URI formats return [google.rpc.Code.INVALID_ARGUMENT][google.rpc.Code.INVALID_ARGUMENT]). For more information, see [Request URIs](/storage/docs/reference-uris).
Optional cloud region where annotation should take place. Supported cloud regions: `us-east1`, `us-west1`, `europe-west1`, `asia-east1`. If no region is specified, a region will be determined based on video file location.
Video annotation progress. Included in the `metadata` field of the `Operation` returned by the `GetOperation` call of the `google::longrunning::Operations` service.
Progress metadata for all videos specified in `AnnotateVideoRequest`.
Video annotation response. Included in the `response` field of the `Operation` returned by the `GetOperation` call of the `google::longrunning::Operations` service.
Annotation results for all videos specified in `AnnotateVideoRequest`.
Bounding box.
Used in:
Left X coordinate.
Right X coordinate.
Bottom Y coordinate.
Top Y coordinate.
Face annotation.
Used in:
Thumbnail of a representative face view (in JPEG format). Encoding: base64.
All locations where a face was detected. Faces are detected and tracked on a per-video basis (as opposed to across multiple videos).
Face locations at one frame per second.
Face location.
Used in:
Bounding box in a frame.
Video time offset in microseconds.
Video annotation feature.
Used in:
Unspecified.
Label detection. Detect objects, such as dog or flower.
Human face detection and tracking.
Shot change detection.
Safe search detection.
Label annotation.
Used in:
Textual description, e.g. `Fixed-gear bicycle`.
Language code for `description` in BCP-47 format.
Where the label was detected and with what confidence.
Label detection mode.
Used in:
Unspecified.
Detect shot-level labels.
Detect frame-level labels.
Detect both shot-level and frame-level labels.
Label level (scope).
Used in:
Unspecified.
Video-level. Corresponds to the whole video.
Segment-level. Corresponds to one of `AnnotateSpec.segments`.
Shot-level. Corresponds to a single shot (i.e. a series of frames without a major camera position or background change).
Frame-level. Corresponds to a single video frame.
Label location.
Used in:
Video segment. Set to [-1, -1] for video-level labels. Set to [timestamp, timestamp] for frame-level labels. Otherwise, corresponds to one of `AnnotateSpec.segments` (if specified) or to shot boundaries (if requested).
Confidence that the label is accurate. Range: [0, 1].
Label level.
Bucketized representation of likelihood.
Used in:
Unknown likelihood.
Very unlikely.
Unlikely.
Possible.
Likely.
Very likely.
Safe search annotation (based on per-frame visual signals only). If no unsafe content has been detected in a frame, no annotations are present for that frame. If only some types of unsafe content have been detected in a frame, the likelihood is set to `UNKNOWN` for all other types of unsafe content.
Used in:
Likelihood of adult content.
Likelihood that an obvious modification was made to the original version to make it appear funny or offensive.
Likelihood of medical content.
Likelihood of violent content.
Likelihood of racy content.
Video time offset in microseconds.
Annotation progress for a single video.
Used in:
Video file location in [Google Cloud Storage](https://cloud.google.com/storage/).
Approximate percentage processed thus far. Guaranteed to be 100 when fully processed.
Time when the request was received.
Time of the most recent update.
Annotation results for a single video.
Used in:
Video file location in [Google Cloud Storage](https://cloud.google.com/storage/).
Label annotations. There is exactly one element for each unique label.
Face annotations. There is exactly one element for each unique face.
Shot annotations. Each shot is represented as a video segment.
Safe search annotations.
If set, indicates an error. Note that for a single `AnnotateVideoRequest` some videos may succeed and some may fail.
Video context and/or feature-specific parameters.
Used in:
Video segments to annotate. The segments may overlap and are not required to be contiguous or span the whole video. If unspecified, each video is treated as a single segment.
If label detection has been requested, what labels should be detected in addition to video-level labels or segment-level labels. If unspecified, defaults to `SHOT_MODE`.
Whether the video has been shot from a stationary (i.e. non-moving) camera. When set to true, might improve detection accuracy for moving objects.
Model to use for label detection. Supported values: "latest" and "stable" (the default).
Model to use for face detection. Supported values: "latest" and "stable" (the default).
Model to use for shot change detection. Supported values: "latest" and "stable" (the default).
Model to use for safe search detection. Supported values: "latest" and "stable" (the default).
Video segment.
Used in: , , ,
Start offset in microseconds (inclusive). Unset means 0.
End offset in microseconds (inclusive). Unset means 0.