Proto commits in google/tsl

These commits are when the Protocol Buffers files have changed: (only the last 100 relevant commits are shown)

2026-06-23

Commit:	716954e
Author:	Antonio Sanchez	2026-06-18 14:27:20 -0700
Committer:	Copybara-Service	2026-06-23 11:37:26 -0700

Add missing license headers. PiperOrigin-RevId: 934549510

2026-06-18

Commit:	9a4cd70
Author:	A. Unique TensorFlower	2026-05-22 13:25:37 -0700
Committer:	Copybara-Service	2026-06-18 16:09:26 -0700

Support multipass profiling in multihost_hlo_runner. PiperOrigin-RevId: 919831591

2026-06-12

Commit:	79cf295
Author:	A. Unique TensorFlower	2026-06-12 10:21:01 -0700
Committer:	Copybara-Service	2026-06-12 10:21:24 -0700

Default behaviour correction for GetSnapshotRequest PiperOrigin-RevId: 931213870

The documentation is generated from this commit.

Commit:	abc490c
Author:	A. Unique TensorFlower	2026-06-11 22:09:26 -0700
Committer:	Copybara-Service	2026-06-11 22:11:03 -0700

Add snapshot_session_id to GetSnapshotRequest proto. PiperOrigin-RevId: 930926325

2026-06-11

Commit:	6ad7fde
Author:	A. Unique TensorFlower	2026-06-11 10:26:17 -0700
Committer:	Copybara-Service	2026-06-11 10:27:56 -0700

Implement snapshot_session_id validation and override in GetSnapshot. PiperOrigin-RevId: 930605458

2026-05-14

Commit:	2dea0f9
Author:	Subham Soni	2026-04-13 01:20:43 -0700
Committer:	Copybara-Service	2026-05-13 23:30:35 -0700

Standardize XProf hostname resolution logic PiperOrigin-RevId: 898840873

2026-04-23

Commit:	f26e18f
Author:	A. Unique TensorFlower	2026-04-22 15:13:43 -0700
Committer:	Copybara-Service	2026-04-23 14:16:32 -0700

Add perf counter sampling options. PiperOrigin-RevId: 904070885

Commit:	5860113
Author:	Sannidhya Chauhan	2026-04-23 06:59:40 -0700
Committer:	Copybara-Service	2026-04-23 07:03:53 -0700

Add perf_counters option to ProfilerOptions PiperOrigin-RevId: 904434657

2026-04-13

Commit:	584a493
Author:	Ilya Tikhonovskiy	2026-03-18 02:49:39 -0700
Committer:	Copybara-Service	2026-04-13 06:48:06 -0700

Add aggregation mode to cupti collector. The goal of the pr is to generate a small, lightweight aggregated profile that could be collected frequently and would not slow down the execution. PiperOrigin-RevId: 885476831

2026-02-11

Commit:	b7d698f
Author:	A. Unique TensorFlower	2025-09-29 11:15:33 -0700
Committer:	Copybara-Service	2026-02-11 14:37:29 -0800

Creating UI options for trace mode when capturing profiles using sampling mode PiperOrigin-RevId: 812867750

2026-01-28

Commit:	bf9f65b
Author:	Sannidhya Chauhan	2026-01-28 02:23:23 -0800
Committer:	Copybara-Service	2026-01-28 02:23:44 -0800

Add StopContinuousProfiling RPC to ProfilerService. PiperOrigin-RevId: 862130665

2026-01-23

Commit:	b10432d
Author:	A. Unique TensorFlower	2025-12-04 01:57:25 -0800
Committer:	Copybara-Service	2026-01-23 10:46:59 -0800

Add JAX version metadata to xspace PiperOrigin-RevId: 840138208

2026-01-22

Commit:	e109aca
Author:	Sannidhya Chauhan	2026-01-22 04:18:49 -0800
Committer:	Copybara-Service	2026-01-22 06:23:43 -0800

Add stop_continuous_profiling to pywrap profiler plugin. PiperOrigin-RevId: 859542515

Commit:	1c307f5
Author:	Sannidhya Chauhan	2026-01-22 02:57:29 -0800
Committer:	Copybara-Service	2026-01-22 06:22:41 -0800

Add RPC to stop continuous profiling. PiperOrigin-RevId: 859516646

2026-01-08

Commit:	7415bdc
Author:	Subham Soni	2026-01-07 23:17:11 -0800
Committer:	Copybara-Service	2026-01-07 23:18:03 -0800

Add override_hostname to profiler_options This will allow a follow-up PR that allows utilizing this proto. PiperOrigin-RevId: 853576203

Commit:	3cb4f30
Author:	Subham Soni	2026-01-07 22:29:04 -0800
Committer:	Copybara-Service	2026-01-07 22:38:24 -0800

Use override_hostname from ProfileOptions PiperOrigin-RevId: 853561309

2026-01-02

Commit:	1450399
Author:	Aditya Sharma	2026-01-01 21:44:41 -0800
Committer:	Copybara-Service	2026-01-01 21:45:17 -0800

Add ContinuousProfiling and GetSnapshot RPCs to the profiler service. PiperOrigin-RevId: 851180780

2025-12-24

Commit:	a60ffab
Author:	Sannidhya Chauhan	2025-12-23 04:14:22 -0800
Committer:	Copybara-Service	2025-12-24 06:09:30 -0800

Enable circular buffer tracing for TPU PiperOrigin-RevId: 848119258

2025-12-23

Commit:	9182be0
Author:	Sannidhya Chauhan	2025-12-11 03:53:14 -0800
Committer:	Copybara-Service	2025-12-23 00:46:36 -0800

Add Python bindings for continuous profiling and snapshot retrieval. PiperOrigin-RevId: 843151720

Commit:	11676f9
Author:	Sannidhya Chauhan	2025-12-11 03:24:47 -0800
Committer:	Copybara-Service	2025-12-23 00:44:22 -0800

Add ContinuousProfiling and GetSnapshot RPCs to the profiler service. PiperOrigin-RevId: 843143959

2025-12-01

Commit:	a765cdd
Author:	A. Unique TensorFlower	2025-12-01 13:44:50 -0800
Committer:	Copybara-Service	2025-12-01 13:45:24 -0800

The comment for XEvents within an XLine is updated to specify that partial overlap is not allowed, while nesting is still permitted. PiperOrigin-RevId: 838916383

2025-11-11

Commit:	a8fece0
Author:	A. Unique TensorFlower	2025-11-11 00:01:15 -0800
Committer:	Copybara-Service	2025-11-11 00:01:50 -0800

Add int32 type to AdvancedConfigValue in ProfilerOptions. This allows `AdvancedConfigValue` to store `int32` values, expanding the types of configurations that can be represented. PiperOrigin-RevId: 830774830

Commit:	b2a69ed
Author:	Matt Hurd	2025-11-10 23:31:53 -0800
Committer:	Copybara-Service	2025-11-10 23:32:24 -0800

2025-11-07

Commit:	a1cb380
Author:	Matt Hurd	2025-10-07 13:03:01 -0700
Committer:	Copybara-Service	2025-11-07 10:39:44 -0800

[xprof] Add support for tracemark_lower and tracemark_upper profiler options PiperOrigin-RevId: 816343394

2025-10-27

Commit:	8a644da
Author:	Matt Hurd	2025-10-27 15:46:18 -0700
Committer:	Copybara-Service	2025-10-27 15:46:53 -0700

Add session_id to profiler_options This will allow a follow-up PR that allows utilizing this proto. PiperOrigin-RevId: 824709715

2025-09-10

Commit:	e71dfd9
Author:	Bryan Massoth	2025-09-10 11:52:12 -0700
Committer:	Copybara-Service	2025-09-10 11:52:42 -0700

Add a profiler for subprocesses. PiperOrigin-RevId: 805455652

2025-05-22

Commit:	7c4f81f
Author:	Sannidhya Chauhan	2025-05-21 22:48:16 -0700
Committer:	Copybara-Service	2025-05-21 22:49:13 -0700

Set remaining profile options and perform validation on the advanced_configuration keys. PiperOrigin-RevId: 761811794

2025-03-26

Commit:	d71df2f
Author:	A. Unique TensorFlower	2025-03-26 09:51:13 -0700
Committer:	Copybara-Service	2025-03-26 09:51:39 -0700

[AutoPGLE] Prevent an AutoPGLE to run if user launched an external profiler. PiperOrigin-RevId: 740804528

2025-03-22

Commit:	961956b
Author:	A. Unique TensorFlower	2025-03-22 01:54:12 -0700
Committer:	Copybara-Service	2025-03-22 01:54:56 -0700

[AutoPGLE] Prevent an AutoPGLE to run if user launched an external profiler. PiperOrigin-RevId: 739431800

2025-03-21

Commit:	5f4fcc6
Author:	A. Unique TensorFlower	2025-03-21 02:41:42 -0700
Committer:	Copybara-Service	2025-03-21 02:42:15 -0700

[AutoPGLE] Prevent an AutoPGLE to run if user launched an external profiler. PiperOrigin-RevId: 739109278

2025-03-20

Commit:	8bc89f8
Author:	Sannidhya Chauhan	2025-03-20 00:57:13 -0700
Committer:	Copybara-Service	2025-03-20 00:57:51 -0700

Introduce an advanced configuration option for the profiler. PiperOrigin-RevId: 738702030

2025-03-13

Commit:	e3ead3e
Author:	Julia Guo	2025-03-13 10:41:43 -0700
Committer:	Copybara-Service	2025-03-13 10:44:23 -0700

Move memory_profile.proto to OSS to prepare for OSS benchmarking PiperOrigin-RevId: 736560797

2024-11-27

Commit:	add1173
Author:	A. Unique TensorFlower	2024-11-27 11:03:44 -0800
Committer:	Copybara-Service	2024-11-27 11:05:13 -0800

This CL updates HostTracer to use Traceme Filter mask from profiling request. Also add utility functions to prepare filter mask. PiperOrigin-RevId: 700753727

2024-10-04

Commit:	5f3ef51
Author:	David Dunleavy	2024-10-04 16:15:36 -0700
Committer:	Copybara-Service	2024-10-04 16:17:09 -0700

Move `tsl/protobuf/error_codes.proto` to `xla/tsl/protobuf` PiperOrigin-RevId: 682485437

2024-09-25

Commit:	7804cf3
Author:	David Dunleavy	2024-09-25 14:27:56 -0700
Committer:	Copybara-Service	2024-09-25 14:30:55 -0700

Move `tsl/protobuf/*` besides `error_codes.proto` to `xla/tsl/protobuf` `error_codes.proto` will be moved in a separate change PiperOrigin-RevId: 678848832

2024-08-27

Commit:	4f6d2cd
Author:	A. Unique TensorFlower	2024-08-27 12:26:25 -0700
Committer:	Copybara-Service	2024-08-27 12:28:22 -0700

Changes to the test log proto for TSL changes PiperOrigin-RevId: 668092486

2024-08-19

Commit:	71c704c
Author:	A. Unique TensorFlower	2024-08-19 16:53:04 -0700
Committer:	Copybara-Service	2024-08-19 16:54:21 -0700

Proto bfc_memory_map change to compiler/xla/tsl PiperOrigin-RevId: 665044485

Commit:	e213475
Author:	A. Unique TensorFlower	2024-08-07 14:44:59 -0700
Committer:	Copybara-Service	2024-08-19 12:30:23 -0700

Add `ReportInfoToService` to coordination service for runtime info/error reporting and aggregation. This allows the coordinator to become a central place to gather runtime information and report it. PiperOrigin-RevId: 660545522

2024-07-26

Commit:	a78f990
Author:	A. Unique TensorFlower	2024-07-25 21:09:26 -0700
Committer:	Copybara-Service	2024-07-25 21:13:52 -0700

Add long polling as a new way to propagate error in coordination service. PiperOrigin-RevId: 656228596

2024-06-20

Commit:	4abf8c8
Author:	David Dunleavy	2024-06-18 10:45:05 -0700
Committer:	Copybara-Service	2024-06-20 16:11:37 -0700

Move `tensorflow/tsl/protobuf` to `xla/tsl/protobuf` PiperOrigin-RevId: 644440378

2024-05-31

Commit:	ccf06a2
Author:	A. Unique TensorFlower	2024-05-31 13:15:15 -0700
Committer:	Copybara-Service	2024-05-31 13:17:36 -0700

Allow optional KV store overwrites. Cleanups: use std::string_view, migrate away from TF_ASSERT/EXPECT. PiperOrigin-RevId: 639131697

2024-04-17

Commit:	ff3217b
Author:	Shanbin Ke	2024-04-17 01:15:11 -0700
Committer:	Copybara-Service	2024-04-17 01:17:13 -0700

PR #11449: [XLA:GPU] replace is_causal_mask with enum mask_type to support more cudnn mask generation Imported from GitHub PR https://github.com/openxla/xla/pull/11449 * replace is_causal_mask in backend_config with enum mask_type. * cudnn also supports padding mask and alibi mask. Copybara import of the project: -- 5780ffd6622978c0a173d90cc562d32f354bfbfd by cjkkkk <ske@nvidia.com>: replace is_causal_mask with mask_type to support more mask -- 73555dba8f8cc1319cf75ed9cf8fe49ad88948bd by cjkkkk <ske@nvidia.com>: add newline -- 2fed993035a2e5c17c990b60b7268ea9dbaeb7e5 by cjkkkk <ske@nvidia.com>: add is_causal_mask back in backend_config Merging this change closes #11449 PiperOrigin-RevId: 625595798

2024-04-11

Commit:	00b7447
Author:	David Dunleavy	2024-04-11 16:36:40 -0700
Committer:	Copybara-Service	2024-04-11 16:38:25 -0700

Move tsl/distributed_runtime to xla/tsl/distributed_runtime PiperOrigin-RevId: 623975602

2024-02-16

Commit:	8bd84de
Author:	Philipp Hack	2024-02-15 18:28:23 -0800
Committer:	Copybara-Service	2024-02-15 18:29:51 -0800

PR #8225: Layer Norm Gradient Fusion Imported from GitHub PR https://github.com/openxla/xla/pull/8225 Rewrites the backward graph of layer norm patterns (X, S, E, N, DY) -> (DX, DS, DB), where X, S, E, N and DY are the input to the forward layer norm, scale, expectation, norm factor and gradient w.r.t. the result of the forward layer norm, and DX, DS and DB are the gradients w.r.t. the input, scale and bias tensors, into a Custom Call to the cuDNN library. Copybara import of the project: -- 657202adda82f6fc60a88624ce2fb1c152bb5ed6 by Philipp Hack <phack@nvidia.com>: Support for layer norm gradient fusion. -- 91434d8054bbe15c2a94f864bdda3ebc10437805 by Philipp Hack <phack@nvidia.com>: Support for layer norm gradient fusion. -- 816d4c770e797d1558897a30ed033707ac6b0123 by Philipp Hack <phack@nvidia.com>: Support for layer norm gradient fusion. -- 946b0c8d9b3f018318aff846b888a453e1948612 by Philipp Hack <phack@nvidia.com>: Support for layer norm gradient fusion. Merging this change closes #8225 PiperOrigin-RevId: 607521087

2023-10-26

Commit:	17ce560
Author:	A. Unique TensorFlower	2023-10-25 05:12:42 -0700
Committer:	Copybara-Service	2023-10-25 17:48:03 -0700

Adds compressed_string type to XStat PiperOrigin-RevId: 576496122

2023-09-14

Commit:	bbee6bd
Author:	A. Unique TensorFlower	2023-09-14 03:31:01 -0700
Committer:	Copybara-Service	2023-09-14 03:32:45 -0700

Added placeholder for internal RPC option. PiperOrigin-RevId: 565319212

2023-08-28

Commit:	9ca4830
Author:	A. Unique TensorFlower	2023-08-28 16:27:32 -0700
Committer:	Copybara-Service	2023-08-28 16:28:54 -0700

Add escape hatch for users who do not want to enable coordination service by default. PiperOrigin-RevId: 560849974

2023-07-28

Commit:	8606673
Author:	Philipp Hack	2023-07-28 16:05:52 -0700
Committer:	Copybara-Service	2023-07-28 16:07:01 -0700

PR #60807: FP8 Convolutions in XLA Imported from GitHub PR https://github.com/tensorflow/tensorflow/pull/60807 Enables scaled convolutions of the form (X, W, x_scale, w_scale, y_scale) -> Y, where the input X, the filter W and the output Y are based on the `F8E4M3FN` and `F8E5M2` data types and x_scale, w_scale and y_scale are their scaling factors. Copybara import of the project: -- 8a30aa731c21612fe098a6b620a54922578611c2 by Philipp Hack <phack@nvidia.com>: Support for FP8 convolutions in XLA. -- caade6453519ad2531ebcf8f206e40187a1687ca by Philipp Hack <phack@nvidia.com>: Support for FP8 convolutions in XLA. -- ecd080bd6c64682f6bee62f4455ea2c37c279f26 by Philipp Hack <phack@nvidia.com>: Support for FP8 convolutions in XLA. -- da22a881a3d24fd4f357207034ba6c596aa414d0 by Philipp Hack <phack@nvidia.com>: Support for FP8 convolutions in XLA. Merging this change closes #60807 PiperOrigin-RevId: 551973730

2023-07-07

Commit:	2862c3d
Author:	TJ Xu	2023-07-07 13:34:08 -0700
Committer:	Copybara-Service	2023-07-07 13:36:17 -0700

PR #3886: [NVIDIA XLA:GPU] Introducing training support for cudnn fused mha (stream executor) Imported from GitHub PR https://github.com/openxla/xla/pull/3886 This is the commit to include only stream executor changes. Rewriter changes are in this [pr](https://github.com/openxla/xla/pull/3726) lhlo lowering logic is in this [pr](https://github.com/openxla/xla/pull/3886) Copybara import of the project: -- 0541a4c40a3cec2bc9b348043a02713602a28793 by TJ <tjx@nvidia.com>: Introducing training support for cudnn fused mha This is the commit to include only stream executor changes -- abec4e1134e4d934fed9231f6e29833a687e2d58 by TJ <tjx@nvidia.com>: Replace tsl::error with absl:: calls -- 4af3ef74ca208348cb5a90a3d8f000e4e2d1c1d8 by TJ <tjx@nvidia.com>: Place some of the vlog statements before std::move -- 0606d44ed2ea85a7a4c80dfb68237ffcc51a43cc by TJ <tjx@nvidia.com>: Address clang-tidy issues Merging this change closes #3886 PiperOrigin-RevId: 546371252

2023-06-30

Commit:	3cbd099
Author:	Yin Zhang	2023-06-30 10:56:51 -0700
Committer:	Copybara-Service	2023-06-30 11:00:25 -0700

1 line change: Fix comment typo in xplane.proto PiperOrigin-RevId: 544699109

2023-06-29

Commit:	c3ef667
Author:	George Karpenkov	2023-06-29 07:09:57 -0700
Committer:	Copybara-Service	2023-06-29 07:11:20 -0700

Move autotuning protos to XLA The protobufs are mostly used and modified inside of XLA, which also provide most of the GPU performance. PiperOrigin-RevId: 544346444

2023-06-28

Commit:	7d8807d
Author:	Jake Hall	2023-06-28 16:57:06 -0700
Committer:	Copybara-Service	2023-06-28 16:58:18 -0700

PR #3200: Add support for float8_e4m3fnuz and float8_e5m2fnuz. Imported from GitHub PR https://github.com/openxla/xla/pull/3200 This adds support for the two FP8 types `float8_e4m3fnuz` and `float8_e5m2fnuz` to XLA similar to `float8_e4m3fn`, `float8_e4m3b11`, and `float8_e5m2`. Copybara import of the project: -- 3b96f8fe219c1ac1bec5c4b99ff9c51148706981 by Jake Hall <jakeh@graphcore.ai>: Add support for float8_e4m3fnuz and float8_e5m2fnuz. Merging this change closes #3200 PiperOrigin-RevId: 544198797

2023-06-23

Commit:	df7e113
Author:	Tao Wang	2023-06-22 21:53:05 -0700
Committer:	Copybara-Service	2023-06-22 21:54:23 -0700

Move ProfiledInstructionsProto from `tensorflow/compiler/xla/...` to `tensorflow/tsl/profiler/protobuf/...` PiperOrigin-RevId: 542757964

2023-05-05

Commit:	672204a
Author:	David Majnemer	2023-05-02 18:24:24 -0700
Committer:	Copybara-Service	2023-05-04 18:49:07 -0700

[XLA] Add support for E4M3B11 for CPU & GPU PiperOrigin-RevId: 528942681

2023-04-08

Commit:	90a23e5
Author:	Jie Sun	2023-04-08 09:25:30 -0700
Committer:	Copybara-Service	2023-04-08 09:26:50 -0700

change Trace/TraceEvent etc to tsl::profiler package. PiperOrigin-RevId: 522821950

2023-03-30

Commit:	22ad2ad
Author:	A. Unique TensorFlower	2023-03-29 17:36:09 -0700
Committer:	Copybara-Service	2023-03-29 17:38:04 -0700

[Coord Service] Guard silent reconnect upon restart feature with a config field. Disable this by default so that unexpected reconnects upon restart will return an error. Context: this feature partly caused restart loops under certain conditions. PiperOrigin-RevId: 520488973

2023-03-29

Commit:	6bbc6b8
Author:	Ayan Moitra	2023-03-29 15:29:02 -0700
Committer:	Copybara-Service	2023-03-29 15:30:57 -0700

PR #60145: [NVIDIA XLA:GPU] Fused MHA Support in XLA GPU: SE Imported from GitHub PR https://github.com/tensorflow/tensorflow/pull/60145 cuDNN currently supports the following patterns for Multi-headed attention: BMM1 - BMM2 BMM1 - Scale - Bias - Mask - Softmax - BMM2 BMM1 - Scale - Bias - Mask - Softmax - Dropout - BMM2 BMM1 - Scale - Mask - Softmax - BMM2 BMM1 - Scale - Mask - Softmax - Dropout - BMM2 BMM1 - Softmax - Dropout - BMM2 This PR adds support for the stream executor for these patterns. Copybara import of the project: -- d65ed4bb5de2607f09eeb5d065caa3612d750a85 by Ayan Moitra <amoitra@nvidia.com>: Revert "Revert: PR #60047: Fused MHA Support in XLA GPU: Stream Executor changes" This reverts commit 7a4d2e8b1a503a45ff86d40b0387fc832302379f. -- 532ed5cf024a72716ed1040d973d74eb5be471d3 by Ayan Moitra <amoitra@nvidia.com>: Fused MHA Support in XLA GPU: SE Merging this change closes #60145 PiperOrigin-RevId: 520458719

2023-03-27

Commit:	e31f005
Author:	Peter Hawkins	2023-03-27 10:55:57 -0700
Committer:	Copybara-Service	2023-03-27 10:57:39 -0700

Revert: PR #60047: Fused MHA Support in XLA GPU: Stream Executor changes Imported from GitHub PR https://github.com/tensorflow/tensorflow/pull/60047 cuDNN currently supports the following patterns for Multi-headed attention: BMM1 - BMM2 BMM1 - Scale - Bias - Mask - Softmax - BMM2 BMM1 - Scale - Bias - Mask - Softmax - Dropout - BMM2 BMM1 - Scale - Mask - Softmax - BMM2 BMM1 - Scale - Mask - Softmax - Dropout - BMM2 BMM1 - Softmax - Dropout - BMM2 This PR adds support for the stream exec... PiperOrigin-RevId: 519770880

Commit:	09c41e9
Author:	Ayan Moitra	2023-03-27 03:42:08 -0700
Committer:	Copybara-Service	2023-03-27 03:43:43 -0700

PR #60047: Fused MHA Support in XLA GPU: Stream Executor changes Imported from GitHub PR https://github.com/tensorflow/tensorflow/pull/60047 cuDNN currently supports the following patterns for Multi-headed attention: BMM1 - BMM2 BMM1 - Scale - Bias - Mask - Softmax - BMM2 BMM1 - Scale - Bias - Mask - Softmax - Dropout - BMM2 BMM1 - Scale - Mask - Softmax - BMM2 BMM1 - Scale - Mask - Softmax - Dropout - BMM2 BMM1 - Softmax - Dropout - BMM2 This PR adds support for the stream executor for these patterns. Copybara import of the project: -- c957cb879b9757b5c524106aab066a09dab32dfd by Ayan Moitra <amoitra@nvidia.com>: Fused MHA Support in XLA GPU: Stream Executor changes -- 68785164a1ba14457148f1a7ed616183dec7c481 by Ayan Moitra <amoitra@nvidia.com>: Test build failure fix -- 99292e9540c02708b9acec872a60140000c31a4e by Ayan Moitra <amoitra@nvidia.com>: Possible fix for build failure -- 7b47ff4c5b7c429285ca437a308dd81ea66c1fd1 by Ayan Moitra <amoitra@nvidia.com>: Still fixing build test build fail-cannot repro locally -- d86e51b1e4a4a57ed50efc019f2f73d03d325cfa by Ayan Moitra <amoitra@nvidia.com>: continue fixing build test build fail-cannot repro locally -- 0bf1352234fddb20ddfec6fdc01b2806fb8c820b by Ayan Moitra <amoitra@nvidia.com>: try fix build issue Merging this change closes #60047 PiperOrigin-RevId: 519675175

2023-03-15

Commit:	1a0807a
Author:	Yang Chen	2023-03-14 18:09:56 -0700
Committer:	Copybara-Service	2023-03-14 18:11:15 -0700

#tf-data-service Improve error handling for SnapshotManager. If the snapshot manager receives an error from a worker: 1. It writes a StatusProto to an ERROR file. The error status can be recovered if the dispatcher restarts. 2. It returns empty task lists to other workers. The workers will then cancel the ongoing work here: https://github.com/tensorflow/tensorflow/blob/46dd63bd2c36c5b5fbdef9e2df652a5a31b55dc3/tensorflow/core/data/service/worker_impl.cc#L651-L659. PiperOrigin-RevId: 516684775

2023-03-13

Commit:	7d13006
Author:	A. Unique TensorFlower	2023-03-13 14:22:12 -0700
Committer:	Copybara-Service	2023-03-13 14:39:57 -0700

follow-up to cl/515714382. removes legacy trace events from the TSL Trace proto. PiperOrigin-RevId: 516321656

2023-03-01

Commit:	0ec37dc
Author:	Anlun Xu	2023-02-28 17:40:29 -0800
Committer:	Copybara-Service	2023-02-28 17:45:37 -0800

[xla:gpu] AOT autotuning on cache miss triggers runtime autotuning PiperOrigin-RevId: 513087574

2023-02-28

Commit:	a290ed1
Author:	Anlun Xu	2023-02-28 12:00:57 -0800
Committer:	Copybara-Service	2023-02-28 13:13:42 -0800

[xla:gpu] Add support for runtime convolution autotuning PiperOrigin-RevId: 513000771

2023-02-14

Commit:	f616431
Author:	Tomás Longeri	2023-02-14 12:25:15 -0800
Committer:	Copybara-Service	2023-02-14 13:28:00 -0800

Replace `tensorflow::Status::SetStackTrace` with `SetStackTrace(status, trace)`, to be compatible with the `absl::Status` API. PiperOrigin-RevId: 509604863

2023-02-10

Commit:	1493718
Author:	Sergey Kozub	2023-02-10 00:28:41 -0800
Committer:	Copybara-Service	2023-02-10 00:30:07 -0800

Add `reordered_int8_nchw_vect` flag to convolution backend proto. This is necessary to disambiguate layouts that could not be otherwise detected by XlaConvShapesToStreamExecutorLayouts, in this case int8x32 reordered filter and bias. PiperOrigin-RevId: 508586274

2023-02-06

Commit:	8f8eb4d
Author:	Ilia Sergachev	2023-02-06 09:13:02 -0800
Committer:	Copybara-Service	2023-02-06 09:14:24 -0800

[XLA:GPU] Add triton-based matmul emitter. PiperOrigin-RevId: 507498160

2023-02-03

Commit:	d8508ac
Author:	Sergey Kozub	2023-02-03 02:14:04 -0800
Committer:	Copybara-Service	2023-02-03 06:48:06 -0800

Add support for emitting the int8x32 cuDNN convolution reordering custom calls PiperOrigin-RevId: 506846205

2023-02-02

Commit:	88acc6a
Author:	Sergey Kozub	2023-02-02 06:00:14 -0800
Committer:	Copybara-Service	2023-02-02 06:46:28 -0800

Add custom calls for convolution inputs reordering (cuDNN specific int8x32 layout) PiperOrigin-RevId: 506597581

Commit:	999e79b
Author:	Ilia Sergachev	2023-02-02 03:44:43 -0800
Committer:	Copybara-Service	2023-02-02 03:58:29 -0800

[XLA:GPU] Add API to query full CUDA shared memory size including the dynamic one; use it for Triton-based GEMM. PiperOrigin-RevId: 506576577

2023-01-30

Commit:	4c2ec84
Author:	Ilia Sergachev	2023-01-26 09:09:03 -0800
Committer:	Copybara-Service	2023-01-30 09:09:03 -0800

[XLA:GPU] Add auto-tuning of triton-based matmul. PiperOrigin-RevId: 504858571

Commit:	2b83de6
Author:	Ilia Sergachev	2023-01-26 05:38:58 -0800
Committer:	Copybara-Service	2023-01-30 06:50:57 -0800

[XLA:GPU] Disable padding for triton-based matmuls. PiperOrigin-RevId: 504817238

2023-01-20

Commit:	65c0450
Author:	Parker Schuh	2023-01-19 18:44:37 -0800
Committer:	Copybara-Service	2023-01-19 18:45:57 -0800

Move xplane_to_trace_events and trace_events_to_json to tsl. PiperOrigin-RevId: 503320621

2022-12-16

Commit:	aecc416
Author:	Philipp Hack	2022-12-16 00:58:41 -0800
Committer:	Copybara-Service	2022-12-16 01:00:30 -0800

PR #58720: FP8 GEMMs in XLA Imported from GitHub PR https://github.com/tensorflow/tensorflow/pull/58720 Enables scaled GEMMs based on `F8E4M3FN` and `F8E5M2` [FP8 data types](https://arxiv.org/abs/2209.05433). The pattern described by steps 1 through 6 in [RFC #22](https://github.com/openxla/xla/discussions/22) is rewritten into a Custom Call of the form (A, B, a_scale, b_scale, d_scale) -> (D, d_amax), where A, B and D are FP8 matrices and a_scale, b_scale and d_scale are their respective scaling factors. The scalar d_amax gives the maximum of the absolute values in D before rescaling and casting to FP8 and can be used in the calculation of new scaling factors. Copybara import of the project: -- f2eb35a9efcaaffdbb7314f99521357840bd49d8 by Philipp Hack <phack@nvidia.com>: Support for FP8 GEMMs in XLA. -- 0afd695b3840417fdb1c00987c8c5e980be0de33 by Philipp Hack <phack@nvidia.com>: Support for FP8 GEMMs in XLA. -- 5aba0882bc624215613c77d73dd23ec3b1d8b0d9 by Philipp Hack <phack@nvidia.com>: Support for FP8 GEMMs in XLA. -- 8d18d22d61b1b440421fc3dd402acdaaf27519b3 by Philipp Hack <phack@nvidia.com>: Support for FP8 GEMMs in XLA. -- 7759e0a5d041c26c632d4e433d5f544e0194ea40 by Philipp Hack <phack@nvidia.com>: Support for FP8 GEMMs in XLA. Merging this change closes #58720 PiperOrigin-RevId: 495806551

2022-11-08

Commit:	a70d3ac
Author:	Clive Verghese	2022-11-08 13:05:20 -0800
Committer:	Copybara-Service	2022-11-08 13:06:29 -0800

Move profiler_options_proto to TSL PiperOrigin-RevId: 487031480

2022-11-05

Commit:	9876752
Author:	Tomás Longeri	2022-11-04 17:50:36 -0700
Committer:	Copybara-Service	2022-11-04 17:51:41 -0700

Roll forward of cl/478660619: Migrate core/protobuf/autotuning.proto and compiler/xla/stream_executor/dnn.proto to TSL PiperOrigin-RevId: 486269845

2022-11-04

Commit:	5b3dfb6
Author:	Clive Verghese	2022-11-04 14:22:25 -0700
Committer:	Copybara-Service	2022-11-04 14:23:58 -0700

Move profile.proto to TSL PiperOrigin-RevId: 486230034

2022-11-02

Commit:	69d51d9
Author:	A. Unique TensorFlower	2022-11-02 12:55:43 -0700
Committer:	Copybara-Service	2022-11-02 12:56:55 -0700

1) Fork a minimal subset of grpc_state for grpc_coordination_client. 2) Move grpc_client_cq_tag to TSL. 3) Move proto parse/unparse methods in grpc_util from TF to TSL. 4) Add a ctor arg for RPCState to include a user-defined proto_parse_fn. 5) Use optimized tensor parse fn in worker service. PiperOrigin-RevId: 485671378

Commit:	9df66e3
Author:	Michael Hudgins	2022-11-01 17:11:00 -0700
Committer:	Copybara-Service	2022-11-02 06:59:26 -0700

Testing failure of CI PiperOrigin-RevId: 485457160

2022-10-31

Commit:	6a79f3a
Author:	A. Unique TensorFlower	2022-10-31 13:19:06 -0700
Committer:	Copybara-Service	2022-10-31 13:20:15 -0700

Move coordination service to TSL. PiperOrigin-RevId: 485132220

Commit:	09c9d6a
Author:	Tomás Longeri	2022-10-30 23:39:51 -0700
Committer:	Copybara-Service	2022-10-30 23:41:27 -0700

Roll forward of cl/482043512: Migrate test_log.proto to TSL Also rename test_log_proto_impl to test_log_proto and clean up loads in tsl/util/BUILD. PiperOrigin-RevId: 484965660

2022-10-28

Commit:	d174948
Author:	Clive Verghese	2022-10-28 13:39:30 -0700
Committer:	Copybara-Service	2022-10-28 13:40:56 -0700

Roll forward Move XPlane Proto to TSL. PiperOrigin-RevId: 484610094

2022-10-27

Commit:	d31927d
Author:	A. Unique TensorFlower	2022-10-27 09:56:12 -0700
Committer:	Copybara-Service	2022-10-27 09:57:33 -0700

Internal change PiperOrigin-RevId: 484280058

2022-10-26

Commit:	b0088b5
Author:	A. Unique TensorFlower	2022-10-26 16:38:55 -0700
Committer:	Copybara-Service	2022-10-26 16:40:08 -0700

Internal change PiperOrigin-RevId: 484103999

Commit:	f94efb6
Author:	David Dunleavy	2022-10-26 12:58:56 -0700
Committer:	Copybara-Service	2022-10-26 16:01:02 -0700

Duplicate python/lib/core:bfloat16_lib in XLA PiperOrigin-RevId: 484047401

Commit:	bc58139
Author:	A. Unique TensorFlower	2022-10-26 13:59:23 -0700
Committer:	Copybara-Service	2022-10-26 14:01:42 -0700

Move coordination service and config protos to TSL. PiperOrigin-RevId: 484063774

Commit:	440c2aa
Author:	A. Unique TensorFlower	2022-10-26 12:44:38 -0700
Committer:	Copybara-Service	2022-10-26 12:45:51 -0700

Split RPCOptions out of config.proto and move it to TSL. PiperOrigin-RevId: 484044213

2022-10-24

Commit:	bd2a507
Author:	A. Unique TensorFlower	2022-10-24 16:36:41 -0700
Committer:	Copybara-Service	2022-10-24 16:39:04 -0700

1.Split common methods in distributed_runtime/rpc/grpc_util to a corresponding TSL library (e.g. ToGrpcStatus). 2. Move distributed_runtime_payloads.proto to TSL. PiperOrigin-RevId: 483517874

2022-10-19

Commit:	5e4705d
Author:	A. Unique TensorFlower	2022-10-19 00:21:45 -0700
Committer:	Copybara-Service	2022-10-19 00:22:31 -0700

Migrate test_log.proto to TSL Also rename test_log_proto_impl to test_log_proto and clean up loads in tsl/util/BUILD. PiperOrigin-RevId: 482124575

2022-10-18

Commit:	d0e138b
Author:	Tomás Longeri	2022-10-18 15:51:54 -0700
Committer:	Copybara-Service	2022-10-18 15:52:28 -0700

Migrate test_log.proto to TSL Also rename test_log_proto_impl to test_log_proto and clean up loads in tsl/util/BUILD. PiperOrigin-RevId: 482043512

Commit:	1df87a3
Author:	Tomás Longeri	2022-10-17 23:46:15 -0700
Committer:	Copybara-Service	2022-10-17 23:47:07 -0700

Migrate bfc_memory_map.proto to TSL PiperOrigin-RevId: 481833082

2022-10-14

Commit:	9202127
Author:	Michael Hudgins	2022-10-14 19:23:44 +0000

Moving original commit for tsl to preserve history

2022-10-13

Commit:	fa9a6f4
Author:	A. Unique TensorFlower	2022-10-13 11:38:42 -0700
Committer:	TensorFlower Gardener	2022-10-13 11:42:26 -0700

Roll forward Move XPlane Proto to TSL. PiperOrigin-RevId: 480944931

Commit:	847f55e
Author:	Clive Verghese	2022-10-13 10:13:51 -0700
Committer:	TensorFlower Gardener	2022-10-13 10:21:55 -0700

Roll forward Move XPlane Proto to TSL. PiperOrigin-RevId: 480922000

2022-10-11

Commit:	6fc777b
Author:	Clive Verghese	2022-10-11 11:59:02 -0700
Committer:	TensorFlower Gardener	2022-10-11 12:02:56 -0700

Move XPlane Proto to TSL PiperOrigin-RevId: 480418211

Commit:	ba3e7e9
Author:	Clive Verghese	2022-10-11 10:37:33 -0700
Committer:	TensorFlower Gardener	2022-10-11 10:45:48 -0700

Move XPlane Proto to TSL PiperOrigin-RevId: 480396023

2022-10-10

Commit:	4ab6453
Author:	A. Unique TensorFlower	2022-10-10 15:56:37 -0700
Committer:	TensorFlower Gardener	2022-10-10 16:00:37 -0700

Roll forward of cl/479867985: Shard core/framework/summary.proto's HistogramProto into tsl/protobuf/histogram.proto PiperOrigin-RevId: 480197757

Commit:	54265ed
Author:	Skye Wanderman-Milne	2022-10-10 14:25:35 -0700
Committer:	TensorFlower Gardener	2022-10-10 14:29:08 -0700

Migrate core/protobuf/autotuning.proto and compiler/xla/stream_executor/dnn.proto to TSL PiperOrigin-RevId: 480177205

2022-10-09

Commit:	841abaf
Author:	A. Unique TensorFlower	2022-10-08 23:55:11 -0700
Committer:	TensorFlower Gardener	2022-10-08 23:58:36 -0700

Shard core/framework/summary.proto's HistogramProto into tsl/protobuf/histogram.proto PiperOrigin-RevId: 479873997

Commit:	1e515b0
Author:	A. Unique TensorFlower	2022-10-08 22:42:14 -0700
Committer:	TensorFlower Gardener	2022-10-08 22:45:32 -0700

Shard core/framework/summary.proto's HistogramProto into tsl/protobuf/histogram.proto PiperOrigin-RevId: 479867985

2022-10-04

Commit:	14bb2a1
Author:	A. Unique TensorFlower	2022-10-03 18:51:11 -0700
Committer:	TensorFlower Gardener	2022-10-03 19:02:23 -0700

Migrate core/protobuf/autotuning.proto and compiler/xla/stream_executor/dnn.proto to TSL PiperOrigin-RevId: 478660619