package google.cloud.documentai.v1beta1

Mouse Melon logoGet desktop application:
View/edit binary Protocol Buffers messages

service DocumentUnderstandingService

document_understanding.proto:35

Service to parse structured information from unstructured or semi-structured documents using state-of-the-art Google AI such as natural language, computer vision, and translation.

message BatchProcessDocumentsResponse

document_understanding.proto:105

Response to an batch document processing request. This is returned in the LRO Operation after the operation is complete.

message BoundingPoly

geometry.proto:49

A bounding polygon for the detected image annotation.

Used in: Document.Page.Layout, TableBoundHint

message Document

document.proto:35

Document represents the canonical document resource in Document Understanding AI. It is an interchange format that provides insights into documents and allows for collaboration between users and Document Understanding AI to iterate and optimize for quality.

message Document.Entity

document.proto:345

A phrase in the text that is a known entity type, such as a person, an organization, or location.

Used in: Document

message Document.EntityRelation

document.proto:364

Relationship between [Entities][google.cloud.documentai.v1beta1.Document.Entity].

Used in: Document

message Document.Page

document.proto:92

A page in a [Document][google.cloud.documentai.v1beta1.Document].

Used in: Document

message Document.Page.Block

document.proto:150

A block has a set of lines (collected into paragraphs) that have a common line-spacing and orientation.

Used in: Page

message Document.Page.DetectedLanguage

document.proto:289

Detected language for a structural component.

Used in: Page, Block, FormField, Line, Paragraph, Table, Table.TableCell, Token, VisualElement

message Document.Page.Dimension

document.proto:94

Dimension for the page.

Used in: Page

message Document.Page.FormField

document.proto:270

A form field detected on the page.

Used in: Page

message Document.Page.Layout

document.proto:106

Visual element describing a layout unit on a page.

Used in: Page, Block, FormField, Line, Paragraph, Table, Table.TableCell, Token, VisualElement

enum Document.Page.Layout.Orientation

document.proto:108

Detected human reading orientation.

Used in: Layout

message Document.Page.Line

document.proto:171

A collection of tokens that a human would perceive as a line. Does not cross column boundaries, can be horizontal, vertical, etc.

Used in: Page

message Document.Page.Paragraph

document.proto:160

A collection of lines that a human would perceive as a paragraph.

Used in: Page

message Document.Page.Table

document.proto:232

A table representation similar to HTML table structure.

Used in: Page

message Document.Page.Table.TableCell

document.proto:240

A cell representation inside the table.

Used in: TableRow

message Document.Page.Table.TableRow

document.proto:234

A row of table cells.

Used in: Table

message Document.Page.Token

document.proto:181

A detected token.

Used in: Page

message Document.Page.Token.DetectedBreak

document.proto:184

Detected break at the end of a [Token][google.cloud.documentai.v1beta1.Document.Page.Token].

Used in: Token

enum Document.Page.Token.DetectedBreak.Type

document.proto:186

Enum to denote the type of break found.

Used in: DetectedBreak

message Document.Page.VisualElement

document.proto:218

Detected non-text visual elements e.g. checkbox, signature etc. on the page.

Used in: Page

message Document.ShardInfo

document.proto:39

For a large document, sharding may be performed to produce several document shards. Each document shard contains this field to detail which shard it is.

Used in: Document

message Document.Style

document.proto:54

Annotation for common text style attributes. This adheres to CSS conventions as much as possible.

Used in: Document

message Document.Style.FontSize

document.proto:56

Font size with unit.

Used in: Style

message Document.TextAnchor

document.proto:377

Text reference indexing into the [Document.text][google.cloud.documentai.v1beta1.Document.text].

Used in: Entity, Page.Layout, Style

message Document.TextAnchor.TextSegment

document.proto:383

A text segment in the [Document.text][google.cloud.documentai.v1beta1.Document.text]. The indices may be out of bounds which indicate that the text extends into another document shard for large sharded documents. See [ShardInfo.text_offset][google.cloud.documentai.v1beta1.Document.ShardInfo.text_offset]

Used in: TextAnchor

message EntityExtractionParams

document_understanding.proto:202

Parameters to control entity extraction behavior.

Used in: ProcessDocumentRequest

message FormExtractionParams

document_understanding.proto:166

Parameters to control form extraction behavior.

Used in: ProcessDocumentRequest

message GcsDestination

document_understanding.proto:259

The Google Cloud Storage location where the output file will be written to.

Used in: OutputConfig

message GcsSource

document_understanding.proto:254

The Google Cloud Storage location where the input file will be read from.

Used in: InputConfig

message InputConfig

document_understanding.proto:212

The desired input location and metadata.

Used in: ProcessDocumentRequest, ProcessDocumentResponse

message KeyValuePairHint

document_understanding.proto:190

User-provided hint for key value pair.

Used in: FormExtractionParams

message NormalizedVertex

geometry.proto:40

A vertex represents a 2D point in the image. NOTE: the normalized vertex coordinates are relative to the original image and range from 0 to 1.

Used in: BoundingPoly

message OcrParams

document_understanding.proto:122

Parameters to control Optical Character Recognition (OCR) behavior.

Used in: ProcessDocumentRequest

message OperationMetadata

document_understanding.proto:264

Contains metadata for the BatchProcessDocuments operation.

enum OperationMetadata.State

document_understanding.proto:265

Used in: OperationMetadata

message OutputConfig

document_understanding.proto:226

The desired output location and metadata.

Used in: ProcessDocumentRequest, ProcessDocumentResponse

message ProcessDocumentRequest

document_understanding.proto:74

Request to process one document.

Used in: BatchProcessDocumentsRequest

message ProcessDocumentResponse

document_understanding.proto:111

Response to a single document processing request.

Used in: BatchProcessDocumentsResponse

message TableBoundHint

document_understanding.proto:154

A hint for a table bounding box on the page for table parsing.

Used in: TableExtractionParams

message TableExtractionParams

document_understanding.proto:135

Parameters to control table extraction behavior.

Used in: ProcessDocumentRequest

message Vertex

geometry.proto:29

A vertex represents a 2D point in the image. NOTE: the vertex coordinates are in the same scale as the original image.

Used in: BoundingPoly