Project huggingface/text-generation-inference
On this site you can view the documentation of the Protocol Buffers (Protobuf) files / API in the Text-generation-inference project.
Last checked for updates at:
(we're working on daily update checks)
This website is new and is still being built. You're welcome to talk with us about possible improvements we can make.
Source
Generated from commit: a1aac78
This documentation applies to the following branches and tags, which have the same .proto
files as commit a1aac78:
- Branches: add_chunked_atn, add_chunked_attn, add_deepseekv3, add_L4, add_vlm_chunking, adjust-mllama-test-output, aiter_kernels, auto_length, baichuan2-13b, bump-kernel-versions, chunked_attn_l4, ci-update_xpu_image, debugging-timeouts, enable-transformers-vlm, feat-backend-llamacpp, fix_fp8_llama3.2, fix-tp, flashinfer-0.2.5, gaudi/add-ci, gha_sccache_use_secrets, git_3.1.1, git_3.2.0, git_3.2.1, git_3.3.0, git_v2.4.1, git_v3.0.0, git_v3.0.1, git_v3.0.2, git_v3.1.0, git_v3.2.2, git_v3.2.3, improve-tool-call-and-response-ids, ipex-moe, kvrouter, kvrouter-endpoints, main, message-more-info, more_logs, neuron_backend_ci_test, new_minor_version, nix/pytorch-2.5.1, no_root_user, no_root_user2, origin/slind_window_fix, pr-2711-ci-branch, pr-2784-ci-branch, pr-2840-ci-branch, pr-2954-ci-branch, pr-3002-ci-branch, pr-3004-ci-branch, pr-3018-ci-branch, proxy_sse_engine_state, release-3.2.4, response-header-metrics, s3-cache, tmp_invariants, triton_fix, trtllm/executor_stats, update-jsonschema, upgrade-outlines, use_updated_kernels, vllm/setup, zstd
- Tags: v2.4.1, v3.0.0, v3.0.1, v3.0.2, v3.1.0, v3.1.1, v3.2.0, v3.2.1, v3.2.2, v3.2.3, v3.3.0
Files
This documentation is generated from the following files: