package tensorflow.tpu.op_profile

Get desktop application:
View/edit binary Protocol Buffers messages

Measurements of an operation (or aggregated set of operations). Metrics are always "total" rather than "self".

Used in: Node

double time = 1
Core-time taken by this operation, as a fraction of all operations.
double flops = 2
Floating point computations performed by this operation, as a fraction of peak core FLOPS * program time. This representation has useful properties: - it is proportional to the number of floating point operations performed - utilization is flops/time - wasted potential flops is proportional to time - flops - it does not reveal the peak core FLOPS of the hardware
double raw_time = 11
Elapsed core-time in picoseconds.
double raw_flops = 12
Total floating-point operations performed.

An entry in the profile tree. (An instruction, or set of instructions).

Used in: Profile

string name = 1
Semantics depend on contents.
optional Metrics metrics = 2
May be omitted e.g. for fused instructions.
repeated Node children = 3
oneof contents
Details about what this node represents.
- Node.InstructionCategory category = 4
- Node.XLAInstruction xla = 5

A category of XLA instructions. name is a descriptive string, like "data formatting".

Used in: Node

(message has no fields)

A single XLA instruction. name is the unique instruction id, like "%multiply.5".

Used in: Node

string op = 1
Opcode like %multiply
string expression = 2
%multiply = [shape]multiply(operand1, operand2)
string provenance = 3
Typically the TensorFlow operation name.
string category = 4
optional XLAInstruction.LayoutAnalysis layout = 5
Describes the physical memory layout of the instruction's primary input. e.g. for a convolution, this analyzes the image and ignores the kernel.

Used in: XLAInstruction

repeated LayoutAnalysis.Dimension dimensions = 1
The physical data layout, from most-minor to most-major dimensions.

Used in: LayoutAnalysis

int32 size = 1
Size of the data in this dimension.
int32 alignment = 2
Data must be padded to a multiple of alignment.
string semantics = 3
What the dimension represents, e.g. "spatial".

Profile is the top-level data that summarizes a program.

Used in: ProfileResponse

optional Node by_category = 1
Root of a profile broken down by instruction category.
optional Node by_program_structure = 2
Root of a profile broken down by program structure.

package tensorflow.tpu.op_profile

message Metrics

double time = 1

double flops = 2

double raw_time = 11

double raw_flops = 12

message Node

string name = 1

optional Metrics metrics = 2

repeated Node children = 3

oneof contents

Node.InstructionCategory category = 4

Node.XLAInstruction xla = 5

message Node.InstructionCategory

message Node.XLAInstruction

string op = 1

string expression = 2

string provenance = 3

string category = 4

optional XLAInstruction.LayoutAnalysis layout = 5

message Node.XLAInstruction.LayoutAnalysis

repeated LayoutAnalysis.Dimension dimensions = 1

message Node.XLAInstruction.LayoutAnalysis.Dimension

int32 size = 1

int32 alignment = 2

string semantics = 3

message Profile

optional Node by_category = 1

optional Node by_program_structure = 2