Proto commits in modelscope/dash-infer

These 6 commits are when the Protocol Buffers files have changed:

2025-03-08

Commit:	a9cbbdc
Author:	yjc9696	2025-03-08 20:24:18 +0800
Committer:	GitHub	2025-03-08 20:24:18 +0800

support MOE EP (#73) Co-authored-by: yangjiacheng.yjc <yangjiacheng.yjc@alibaba-inc.com>

The documentation is generated from this commit.

2024-12-20

Commit:	163850f
Author:	zhenglaiwen.zlw	2024-12-20 14:45:21 +0800
Committer:	zhenglaiwen.zlw	2024-12-20 15:03:01 +0800

some bugfix - uuid crash issue - update lora implement - set page size by param - delete deprecated files

2024-12-16

Commit:	a8b9f8e
Author:	Jiejing Zhang	2024-12-09 11:31:20 +0800
Committer:	zhenglaiwen.zlw	2024-12-16 10:44:59 +0800

Update For Version 2.0: add support for CUDA and VLM (#43) * release dashinfer 2.0 version thirdparty: add cutlass. python: spanattention build from source. benchmark: add stop model in the end.

2024-12-12

Commit:	a216786
Author:	Jiejing Zhang	2024-12-09 11:31:20 +0800
Committer:	Jiejing Zhang	2024-12-12 16:25:48 +0800

Update For Version 2.0: add support for CUDA and VLM (#43) * release dashinfer 2.0 version thirdparty: add cutlass. python: spanattention build from source. benchmark: add stop model in the end.

2024-05-13

Commit:	9ef6e35
Author:	zhenglaiwen.zlw	2024-05-13 14:36:32 +0800
Committer:	zhenglaiwen.zlw	2024-05-13 16:12:26 +0800

fix memory leak bug, add default config to helper, update convert_model api - bugfix - helper: check if get empty generated_elem - fix python input memory leak - avoid async copy python inputs - fix bug caused by inconsistent definition of RequestHandle - engine - worker, model: EnqueueRequest -> StartRequestImpl - generation: output token_logprobs - helper - add defualt config - add ConfigManager to merge and check user config - use torch related api only within the helper class - release torch model after conversion - examples - cpp: erase screen before get inputs - py: shutdown executor after finishing tasks - py: use jinja template to format prompt - py: update ipynb basic example and corresponding doc - doc - add model_type to root readme - update modelscope notebook pic and doc - update future plan in root readme

2024-04-04

Commit:	877529e
Author:	Laiwen Zheng	2024-04-04 21:47:20 +0800
Committer:	Laiwen Zheng	2024-04-04 21:50:11 +0800

add source code