Releases: vllm-project/semantic-router
Releases · vllm-project/semantic-router
v0.1.0 - Iris
What's Changed
- feat: support auto-enable reasoning mode based on intention by @Xunzhuo in #1
- fix: remove no needed todo and verify CI by @Xunzhuo in #2
- project: add bench and site owners by @Xunzhuo in #4
- project: add code of conduct by @Xunzhuo in #5
- chore: unify docker images by @Xunzhuo in #6
- fix: use the correct go test file name. by @yafengio in #7
- ci: disable notify action for now by @Xunzhuo in #10
- docs: semantic cache stale types and implementation by @gluonfield in #9
- chore: rm readthedocs as its deprecated by @Xunzhuo in #12
- Removed redundant / from code img by @tao12345666333 in #13
- chore: Update CONTRIBUTING.md by @cryo-zd in #17
- chore: add DCO requirement in CONTRIBUTING.md by @cryo-zd in #18
- fix(cache): cleanup expired cache entries during update operations by @QIN2DIM in #16
- chore(logging): unify the logging method by @ZeroZ-lab in #19
- fix:make reasoning effort configurable by @OneZero-Y in #21
- docs: add vsr star history diagram by @Xunzhuo in #26
- docs: add repo link in CONTRIBUTING.md by @cryo-zd in #27
- project: add acknowledgements to huggingface-candle by @Xunzhuo in #28
- chore: replace fmt.Printf with log.Printf for logging by @cryo-zd in #29
- doc: update workflow to create config.yaml by @rootfs in #30
- feat: implement batch classification API by @OneZero-Y in #24
- chore: 1) install rust if not present 2) expose bench params in env var by @rootfs in #54
- feat: Add comprehensive monitoring metrics for batch classification API by @OneZero-Y in #58
- docs: add pre-commit requirement code quality checks to contributing by @OneZero-Y in #60
- feat: reasoning model controller by @tao12345666333 in #56
- test: add unit tests for getModelFamilyAndTemplateParam by @tao12345666333 in #63
- docs: add reasoning model metrics by @tao12345666333 in #64
- feat: add test framework for classifier with dependency injection by @aeft in #57
- project: add vllm semantic router v0.1 roadmap by @Xunzhuo in #22
- test: add unit test around ttft pkg by @yuluo-yx in #68
- feat: code polish on classifier by @yuluo-yx in #67
- feat: robust model name filter for DeepSeek by @tao12345666333 in #69
- fix: correct candle-binding replace path in go.mod files by @aeft in #65
- project: add blog section by @Xunzhuo in #70
- chore: only run the workflow notify-owners on vllm-project/semantic-router by @liangyuanpeng in #72
- feat(observability): structured JSON logs and event fields by @tao12345666333 in #66
- chore: Normalize comment punctuation to use English period by @cryo-zd in #79
- chore: Use (*OpenAIRouter)(nil) for interface compliance check by @cryo-zd in #77
- pricing: add currency label and change the metric name to llm_model_cost_total by @tao12345666333 in #80
- test: add go vet to CI by @cryo-zd in #81
- feat(logging): adopt zap as unified logging library by @tao12345666333 in #83
- docs: add python install setups in install-local by @yuluo-yx in #78
- feat(config): watch config file and hot-reload router without restart by @tao12345666333 in #84
- chore: remove GPU and model params in config. Backend and model aware optimization will be handled in the control plane by @rootfs in #93
- chore: add go mod tidy check by @Xunzhuo in #99
- fix: startup config for docker-compose by @liangyuanpeng in #73
- fix: don't set reasoning effort for non-reasoning models by @rootfs in #97
- chore: add github action badge in README by @yuluo-yx in #102
- refactor: use slices.Contains for readability and consistency by @cryo-zd in #104
- test: add more test cases and refactor SelectBestModelForCategory/SelectBestModelFromList/InitializeJailbreakClassifier for testability by @aeft in #101
- docs: add github action badge for docs index by @yuluo-yx in #103
- feat: add milvus persistent storage support by @rootfs in #105
- Slight readme changes by @LysandreJik in #25
- refactor: move classifier model init to classifier.go and unify the classifier model init logic by @aeft in #113
- docs: add eslint check for docs website by @yuluo-yx in #114
- Refactor: use worker pool for batch classification concurrency by @cryo-zd in #115
- feat: add comprehensive unit tests for entropy-based routing. Tests c… by @rootfs in #112
- docs: reasoning quickstart by @tao12345666333 in #110
- o11y: Add TTFT and TPOT histograms for SLOs by @tao12345666333 in #126
- docs: add markdown lint check and fix md lint style by @yuluo-yx in #117
- Feature Enhancement: Batch Inference Support in candle-binding by @OneZero-Y in #71
- infra: add yaml lint check and fix yaml style by @yuluo-yx in #131
- perf: enable concurrent classification via Arc+clone by @cryo-zd in #127
- feat: implement dataset-agnostic router reasoning benchmark by @rootfs in #125
- o11y: Add request error counters by @tao12345666333 in #132
- logging: unify stdlib log usage to pkg/observability (zap) by @tao12345666333 in #134
- fix: add comments for readability by @JaredforReal in #135
- docs(installation): update Go version requirement and add test tip for model downloads by @samzong in #146
- docs: reorder the quickstart pages by @Xunzhuo in #143
- project: add ack for kubernetes by @Xunzhuo in #141
- docs: sync blog from official vLLM by @Xunzhuo in #142
- infra: refactor makefile by @yuluo-yx in #149
- infra: update Dockerfile.extproc by @yuluo-yx in #158
- fix: use request id to locate the correct cache entry to update by @aeft in #154
- feat: add codespell check and tidy linter check config files by @yuluo-yx in #159
- fix: miss copy tools dir in dockerfile by @lengrongfu in #161
- metrics: Add request-level token histograms by @tao12345666333 in #157
- docs: add repo URL in docker/README.md by @cryo-zd in #163
- [Docs] remove discarded fields fro...