Proto commits in triton-inference-server/common

These 74 commits are when the Protocol Buffers files have changed:

2026-04-02

Commit:	b0b5387
Author:	Sai Kiran Polisetty	2026-04-02 20:06:30 +0530
Committer:	GitHub	2026-04-02 20:06:30 +0530

doc: Enforce `max_inflight_requests` as a shared limit across ensemble requests (#152)

The documentation is generated from this commit.

2025-11-03

Commit:	6a318ca
Author:	Sai Kiran Polisetty	2025-11-03 10:57:08 +0530
Committer:	GitHub	2025-11-03 10:57:08 +0530

feat: Add support for `max_inflight_requests` parameter to prevent unbounded memory growth in ensemble models (#141) Co-authored-by: Yingge He <157551214+yinggeh@users.noreply.github.com>

2025-03-26

Commit:	bfb7c4d
Author:	Indrajit Bhosale	2025-03-26 00:02:07 -0700

Pre-Commit fix

Commit:	86e510a
Author:	Indrajit Bhosale	2025-03-26 00:00:45 -0700

Draft for ModelInfer

Commit:	54618ab
Author:	Indrajit Bhosale	2025-03-25 23:58:02 -0700

Draft for ModelInfer

2025-02-26

Commit:	95197f1
Author:	Indrajit Bhosale	2025-02-26 09:23:47 -0800

Create New service for callback

Commit:	7478ed9
Author:	Indrajit Bhosale	2025-02-26 05:33:58 -0800

Create New service for callback

Commit:	4843947
Author:	Indrajit Bhosale	2025-02-26 04:36:43 -0800

Create New service for callback

2024-11-06

Commit:	3948525
Author:	Yingge He	2024-11-05 17:19:37 -0800
Committer:	GitHub	2024-11-05 17:19:37 -0800

feat: Per-model metric customization (#126)

2024-09-23

Commit:	15f7227
Author:	fpetrini15	2024-09-23 15:52:25 -0700

Test

2024-07-27

Commit:	2e9cb9a
Author:	Sai Kiran Polisetty	2024-07-27 19:40:09 +0530
Committer:	GitHub	2024-07-27 19:40:09 +0530

Fix shape and reformat free tensor handling in the input byte size check (#125) * Update model_config.proto

2024-02-23

Commit:	00b3a71
Author:	Jacky	2024-02-23 13:29:48 -0800
Committer:	GitHub	2024-02-23 13:29:48 -0800

Add cancellation into response statistics (#113)

2024-02-17

Commit:	bf4b163
Author:	Jacky	2024-02-16 17:37:50 -0800
Committer:	GitHub	2024-02-16 17:37:50 -0800

Add response statistics (#112) * Add response stats to protobuf * Remove mentioning decoupled on comments

2024-02-01

Commit:	a506fbe
Author:	Francesco Petrini	2024-02-01 10:07:56 -0800
Committer:	GitHub	2024-02-01 10:07:56 -0800

Support Double-Type Infer/Response Parameters * Support Double-Type Infer/Response Parameters

2024-01-11

Commit:	00a4288
Author:	Jacky	2024-01-11 09:11:16 -0800
Committer:	GitHub	2024-01-11 09:11:16 -0800

Add runtime to model configuration (#103) * Add runtime to model config * Update copyright

2023-11-19

Commit:	a8a7341
Author:	Iman Tabrizian	2023-11-19 16:09:59 -0500
Committer:	GitHub	2023-11-19 16:09:59 -0500

Generative ->Iterative (#107) (#108) * name change * updated language * updated with default value * updated language Co-authored-by: Neelay Shah <neelays@nvidia.com>

Commit:	3ecedb0
Author:	Neelay Shah	2023-11-19 12:56:37 -0800
Committer:	GitHub	2023-11-19 15:56:37 -0500

Generative ->Iterative (#107) * name change * updated language * updated with default value * updated language

2023-11-15

Commit:	805dbcf
Author:	Iman Tabrizian	2023-11-15 11:35:00 -0500
Committer:	Misha Chornyi	2023-11-15 09:00:43 -0800

Add options for growable memory and single state buffers (#104) * Add same input/output bstate buffer option * Add an option for using GrowableMemory * Review comments * Format * Review comments * Review comment * Fix description

Commit:	9f8c873
Author:	Iman Tabrizian	2023-11-15 11:35:00 -0500
Committer:	GitHub	2023-11-15 11:35:00 -0500

2023-11-01

Commit:	adef772
Author:	GuanLuo	2023-10-31 22:43:14 -0700
Committer:	GitHub	2023-10-31 22:43:14 -0700

Add new sequence batcher parameter for generative sequence (#102)

2023-09-06

Commit:	468eb21
Author:	dyastremsky	2023-09-06 15:29:26 -0700
Committer:	GitHub	2023-09-06 15:29:26 -0700

Add GitHub action to format and lint code (#96) * Add pre-commit hook * Run commit hooks, remove ignored word list * Add GitHub action * Add Java to Clang * Fix pre-commit to include all Python files * Remove old formatter * Remove unused skipped files * Remove codeql because no more Python * Add more pre-commit filetype checkers * Trim whitespace hook * Remove unnecessary dependency * Add mixed-line-ending and case-conflicts checks * Add copyright * Update max-line-length * Remove unnecessary line * End of file * Fix comment * Add and apply isort * Remove duplicate copyrights, add hooks link * Pin workflow Ubuntu version * Flake8 Black style, move Flake8 conf to toml * Alphabetize configs by tool * Move flake8 back into pre-commit-config * Restore clang-format file * Eof newline * Fix yaml spacing * Normalize spacing * Normalize config indentation * Update line limit in clang-format to 80 chars * Update workflows to run on every PR * Run pre-commit

2023-08-15

Commit:	072ad13
Author:	Tanmay Verma	2023-08-15 12:46:47 -0700
Committer:	GitHub	2023-08-15 12:46:47 -0700

Add preserve_ordering field to oldest strategy in sequence scheduler config (#97) (#98) Co-authored-by: Ryan McCormick <rmccormick@nvidia.com>

Commit:	a2de06f
Author:	Ryan McCormick	2023-08-15 12:06:26 -0700
Committer:	GitHub	2023-08-15 12:06:26 -0700

Add preserve_ordering field to oldest strategy in sequence scheduler config (#97)

2023-06-22

Commit:	1df32b9
Author:	dyastremsky	2023-06-22 11:33:55 -0700
Committer:	GitHub	2023-06-22 11:33:55 -0700

Auto-format (#95)

2023-06-08

Commit:	7ff0105
Author:	Neelay Shah	2023-06-08 12:26:10 -0700
Committer:	GitHub	2023-06-08 12:26:10 -0700

Updating Service and Model Config Protobuf for uint64 Request Priority (#93) Co-authored-by: qmas <q.massoz@evs.com>

Commit:	869bf83
Author:	Neelay Shah	2023-06-08 11:34:23 -0700
Committer:	GitHub	2023-06-08 11:34:23 -0700

Revert "Updating Service and Model Config Protobuf for uint64 Request Priority" (#92) This reverts commit e3048594e2ed6d7532099c80b8fb26ec42dd7fe9.

Commit:	d1ac878
Author:	nnshah1	2023-06-08 11:28:18 -0700

Revert "Updating Service and Model Config Protobuf for uint64 Request Priority" This reverts commit e3048594e2ed6d7532099c80b8fb26ec42dd7fe9.

Commit:	e304859
Author:	Neelay Shah	2023-06-08 09:02:35 -0700
Committer:	GitHub	2023-06-08 09:02:35 -0700

Updating Service and Model Config Protobuf for uint64 Request Priority * change priority from uint32 to uint64 in model_config * add uint64 and double types to inference parameters Co-authored-by: qmas <q.massoz@evs.com>

2023-06-07

Commit:	34a0f79
Author:	nnshah1	2023-06-07 13:15:27 -0700

updated with documentation on support for double and uint64

2023-05-31

Commit:	b0d13a2
Author:	Neelay Shah	2023-05-31 16:27:59 -0700

adding uint64 and double param to infer parameter.

Commit:	31004d0
Author:	Neelay Shah	2023-05-31 16:14:04 -0700
Committer:	GitHub	2023-05-31 16:14:04 -0700

change priority from uint32 to uint64 in model_config Co-authored-by: qmas <q.massoz@evs.com>

2023-05-25

Commit:	501aa75
Author:	GuanLuo	2023-05-25 15:08:01 -0700
Committer:	GitHub	2023-05-25 15:08:01 -0700

Add memory usage report in GRPC statistic service (#88) * Update GRPC service proto * Fix type * Fix type

2023-05-08

Commit:	f9904d9
Author:	GuanLuo	2023-05-08 16:44:26 -0700
Committer:	GitHub	2023-05-08 16:44:26 -0700

Update documentation for "platform" (#89)

2023-02-21

Commit:	974998c
Author:	GuanLuo	2023-02-21 14:01:07 -0800
Committer:	GitHub	2023-02-21 14:01:07 -0800

Add reserved namespace field in ensemble step (#81)

2023-01-23

Commit:	7b37a24
Author:	dyastremsky	2023-01-23 09:50:08 -0800
Committer:	GitHub	2023-01-23 09:50:08 -0800

Add protobuf for GRPC health check (#80) * Draft health service * Formatting * Clean up * Change build order * Add health proto to targets * Change ordering * Reordering build * Add comments * Copyrights, formatting * Keep implemented methods * Remove Python health executables * Rename health library * Naming

2022-10-28

Commit:	c06c43b
Author:	Iman Tabrizian	2022-10-28 15:27:01 -0400
Committer:	GitHub	2022-10-28 15:27:01 -0400

Improve the documentation for input_data_file. (#76)

2022-10-11

Commit:	cb62c76
Author:	kthui	2022-10-10 12:33:45 -0700
Committer:	Misha Chornyi	2022-10-10 19:06:05 -0700

Revert per response metrics

2022-10-10

Commit:	050e5ba
Author:	kthui	2022-10-10 16:55:14 -0700
Committer:	GitHub	2022-10-10 16:55:14 -0700

Revert per response metrics (#74)

2022-09-20

Commit:	b018b65
Author:	Iman Tabrizian	2022-09-20 17:04:26 -0400
Committer:	GitHub	2022-09-20 14:04:26 -0700

Add response statistics to GRPC frontend (#71) * Add response statistics to GRPC frontend * Improve docs * Improve comments * Add no response count * Improve documentation clarity Co-authored-by: kthui <18255193+kthui@users.noreply.github.com>

2022-08-15

Commit:	58a25d1
Author:	Iman Tabrizian	2022-08-15 11:54:23 -0400

Update documentation for execution accelerators

2022-08-10

Commit:	d401744
Author:	Francesco Petrini	2022-08-10 12:40:37 -0700
Committer:	GitHub	2022-08-10 12:40:37 -0700

Incorporating Dynamic Logging (#70) * Migrating Changes * New line * Add comments

2022-06-15

Commit:	051c706
Author:	GuanLuo	2022-06-14 17:05:43 -0700
Committer:	GitHub	2022-06-14 17:05:43 -0700

Add 'count' field for warmup (#61) * Add 'repeat_count' field for warmup * Address comment * Change "repeat_count" to "count"

2022-05-09

Commit:	976afde
Author:	GuanLuo	2022-05-09 14:01:13 -0700
Committer:	GitHub	2022-05-09 14:01:13 -0700

Extend GRPC ModelRepositoryParameter to allow bytes (#51)

2022-04-28

Commit:	2e51208
Author:	Ryan McCormick	2022-04-28 14:07:07 -0700
Committer:	GitHub	2022-04-28 14:07:07 -0700

Add TYPE_BF16 scaffolding (#49) * TYPE_BF16 scaffolding * Add note on BF16 datatype requiring use raw contents

2022-03-10

Commit:	fc2f0a6
Author:	GuanLuo	2022-03-10 10:31:03 -0800
Committer:	GitHub	2022-03-10 10:31:03 -0800

Add batch input item shape specification (#43) * Add batch input item shape specification * Fix copyright * Address comment

Commit:	b9099c4
Author:	Ryan McCormick	2022-03-10 09:21:06 -0800
Committer:	GitHub	2022-03-10 09:21:06 -0800

Add cache_miss to grpc stub (#42) * Add cache_miss to grpc stub * Update 2022 copyright header * Review comments

2022-02-23

Commit:	59c891c
Author:	GuanLuo	2022-02-23 14:14:05 -0800
Committer:	GitHub	2022-02-23 14:14:05 -0800

Extend load API in GRPC service (#41) * Extend load API * Fix copyright

2022-02-09

Commit:	b1ef9c1
Author:	Ryan McCormick	2022-02-09 10:30:54 -0800
Committer:	GitHub	2022-02-09 10:30:54 -0800

Update GRPC stub to include cache stats (#37) * Add cache_hit stat to common GRPC protobuf * Update GRPC proto to match server/docs/protocol/extension_statistics.md * Add more details on cache hits per review feedback * Add more details to 'cache_hit' field and refer to it in the 'compute_*' fields

2022-02-08

Commit:	b7e11ba
Author:	GuanLuo	2022-02-08 14:47:15 -0800
Committer:	GitHub	2022-02-08 14:47:15 -0800

Add GRPC trace service (#40) * Add GRPC trace service * Fix up * Address comment * Expose JSON null check * Address comment * Address comment

2022-01-28

Commit:	65dec4c
Author:	CoderHam	2022-01-28 11:20:29 -0800

map cannot be in oneof - create new message for map

Commit:	09b6735
Author:	CoderHam	2022-01-28 11:20:00 -0800

fix TensorStructure def

Commit:	481d507
Author:	CoderHam	2022-01-27 17:08:34 -0800

review edits

2022-01-27

Commit:	f2e67ed
Author:	CoderHam	2022-01-27 14:42:14 -0800

cleanup

Commit:	8f6ee44
Author:	CoderHam	2022-01-27 14:40:43 -0800

test

Commit:	3a8e7a3
Author:	CoderHam	2022-01-26 17:12:24 -0800

Add TensorStructure field for I/O

2021-12-14

Commit:	c009eeb
Author:	Iman Tabrizian	2021-12-14 15:17:37 -0500
Committer:	GitHub	2021-12-14 15:17:37 -0500

Add state initialization setting to model config protobuf (#36) * Add state initialization setting to model config protobuf * Review edit * Remove nested metadata

2021-12-10

Commit:	f939abe
Author:	GuanLuo	2021-12-10 09:41:10 -0800
Committer:	GitHub	2021-12-10 09:41:10 -0800

Add optional field in ModelInput message (#35) * Add optional field in ModelInput message * Fix comment

2021-12-07

Commit:	dc3cbd2
Author:	Iman Tabrizian	2021-12-07 17:06:48 -0500
Committer:	Iman Tabrizian	2021-12-07 18:25:04 -0500

Review edit

Commit:	175e2d5
Author:	Iman Tabrizian	2021-12-07 12:52:29 -0500
Committer:	Iman Tabrizian	2021-12-07 15:16:32 -0500

Add state initialization setting to model config protobuf

2021-11-10

Commit:	e8c269d
Author:	deadeyegoodwin	2021-11-10 13:02:47 -0800
Committer:	GitHub	2021-11-10 13:02:47 -0800

Fix GRPC protocol error. KServer protocol specifies 'bytes_contents' (#34)

2021-10-29

Commit:	cc58c85
Author:	Iman Tabrizian	2021-10-29 08:48:21 -0400
Committer:	GitHub	2021-10-29 08:48:21 -0400

Add state description to model config (#28) * Add state description to the protobuf * Review edits

2021-10-08

Commit:	893d3c1
Author:	Tanmay Verma	2021-10-08 10:02:08 -0700
Committer:	GitHub	2021-10-08 10:02:08 -0700

Add response cache enable setting in model config (#30) * Add response cache enable setting in model config * Format fix * Use composite message for response cache settings

2021-10-05

Commit:	fe7e548
Author:	Tanmay Verma	2021-10-05 12:02:22 -0700
Committer:	GitHub	2021-10-05 12:02:22 -0700

Add clarification for rate limiter config priority (#29)

Commit:	e726d90
Author:	David Goodwin	2021-10-05 11:18:26 -0700

Remove some legacy 'custom backend' references

2021-09-24

Commit:	86f1931
Author:	Tanmay Verma	2021-09-24 13:55:14 -0700
Committer:	GitHub	2021-09-24 13:55:14 -0700

Document memory impact of the output_copy_stream (#27)

2021-09-20

Commit:	6b6e981
Author:	Ashwini Khade	2021-09-20 20:10:46 +0000

bug fix

Commit:	5ab636c
Author:	Ashwini Khade	2021-09-20 19:04:05 +0000

add more configuration params for ORT

2021-08-12

Commit:	ce91438
Author:	Kris Hung	2021-08-12 13:41:57 -0700
Committer:	GitHub	2021-08-12 13:41:57 -0700

Extend START, END, READY controls to allow BOOL type (#22) * Add bool type * Update identifier * Update identifier Co-authored-by: Kris Hung <krish@krish-dt.nvidia.com>

2021-05-27

Commit:	2492327
Author:	GuanLuo	2021-05-27 14:53:03 -0700
Committer:	GitHub	2021-05-27 14:53:03 -0700

Add host policy field (#17) * Add numa id field * Enforce the NUMA id to be the same as GPU id for GPU instance * Modify to "host_policy" as a more general approach * Address comment * Fix rebase artifact

2021-05-26

Commit:	a0e3d6d
Author:	Hemant Jain	2021-05-26 15:00:41 -0700
Committer:	GitHub	2021-05-26 15:00:41 -0700

Add support for DLA/secondary device specification (#18) * Add support for DLA/secondary device specification * Address review comments * Improve description and other cleanup

2021-05-14

Commit:	996299e
Author:	GuanLuo	2021-05-13 20:15:21 -0700
Committer:	GitHub	2021-05-13 20:15:21 -0700

Add 'passive' field in ModelInstanceGroup (#16)

2021-05-11

Commit:	47f791e
Author:	deadeyegoodwin	2021-05-11 08:50:11 -0700
Committer:	GitHub	2021-05-11 08:50:11 -0700

Integrate minor doc changes (#15)

2021-04-16

Commit:	011b7ac
Author:	David Goodwin	2021-04-01 17:03:47 -0700
Committer:	deadeyegoodwin	2021-04-15 17:20:05 -0700

Move protobuf to common

Commit:	feaebe7
Author:	David Goodwin	2021-04-14 14:50:32 -0700
Committer:	deadeyegoodwin	2021-04-15 17:20:05 -0700

Integrate change from triton-inference-server/server repo > e2208d2dd5effd0 src/core/grpc_service.proto > commit 09271f9c4d4d935bd9667dd2be2208d2dd5effd0 > Author: GuanLuo <41310872+GuanLuo@users.noreply.github.com> > Date: Mon Apr 5 09:48:32 2021 -0700 > > Add end point to unload model and its dependents (#2684)