LangBot

mirror of https://github.com/langbot-app/LangBot.git synced 2026-06-13 09:16:04 +00:00

Go to file

huanghuoguoguo 9ecb587ac0 refactor(provider): use LiteLLM as unified LLM requester backend (#2150 )

* refactor(provider): use LiteLLM as unified LLM requester backend

  - Replace 23+ individual requester implementations with unified litellmchat.py
  - Add litellm_provider field to 27 YAML manifests for provider routing
  - Delete redundant requester subclasses
  - Add unit tests for LiteLLMRequester (29 tests)
  - Fix num_retries parameter name (was max_retries)
  - Fix exception handling order for subclass exceptions

  LiteLLM provides unified API for 100+ providers, eliminating need for
  provider-specific requesters.

* fix: ruff format provider.py

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* refactor(provider): simplify LiteLLM requester usage handling

  - Remove unused Anthropic-specific tool schema generation
  - Share completion argument construction between normal and streaming calls
  - Use LiteLLM/OpenAI native usage fields for monitoring
  - Collect stream token usage from LiteLLM stream_options
  - Update LiteLLM requester tests for unified usage fields

* restore: restore deleted provider requester files

Restore individual provider requester implementations that were
removed in de61b5d3. These files coexist with the unified
litellmchat.py backend.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

* feat: update requesters and improve provider selection UI

- Added `litellm_provider` field to various requesters' YAML configurations.
- Removed obsolete Python requester files for OpenRouter, PPIO, QHAIGC, ShengSuanYun, SiliconFlow, Space, TokenPony, VolcArk, and Xai.
- Introduced new requesters for Tencent and Together AI with corresponding YAML configurations and SVG icons.
- Enhanced the ProviderForm component to include a searchable dropdown for selecting providers, improving user experience.
- Updated localization files to include search provider text for both English and Chinese.

* fix(provider): align litellm rebase with master

* fix(provider): capture streaming token usage; add token observability

The LiteLLM streaming requester only captured usage when a chunk had an
empty `choices` list. Many OpenAI-compatible gateways (e.g. new-api) and
providers send the final usage payload in a chunk that still carries an
empty-delta choice, so streamed calls always recorded 0 tokens in the
monitoring logs/dashboard (non-streaming worked).

- Capture stream usage whenever a chunk carries it, regardless of choices
- Add robust _normalize_usage (dict/obj shapes, derive missing total_tokens)
- Register litellm in bootutils/deps.py (was in pyproject only)
- Add MonitoringService.get_token_statistics + /monitoring/token-statistics
  endpoint: summary, per-model breakdown, token timeseries, and a
  zero-token-success data-quality signal
- Add TokenMonitoring dashboard tab (summary tiles, stacked token chart,
  per-model table) + i18n (en/zh)
- Regression tests for stream usage capture and usage normalization

Verified end-to-end against a real OpenAI-compatible endpoint with
gpt-5.5 and claude-opus-4-8: tokens now recorded non-zero for both
streaming and non-streaming paths.

* refactor(provider): simplify litellm capabilities

* style: simplify wrapped expressions

* feat(models): persist context metadata

* fix(provider): handle dict embeddings and openai-compatible rerank in LiteLLMRequester

- invoke_embedding: support both object- and dict-shaped response.data
  entries (OpenAI-compatible gateways like new-api return dicts)
- invoke_rerank: litellm.arerank rejects the 'openai' provider, so for
  openai-compatible (or unspecified) providers call the standard
  Jina/Cohere-style POST /v1/rerank endpoint directly over HTTP
- accept both 'relevance_score' and 'score' fields in rerank results
- add unit tests for the openai-compatible HTTP rerank path

* feat(provider): enforce requester support_type when adding models

- frontend: AddModelPopover only shows model-type tabs (llm/embedding/
  rerank) that the provider's requester declares in its manifest
  support_type; ModelsDialog fetches requester manifests and maps
  requester -> support_type, passed down through ProviderCard
- backend: add _validate_provider_supports guard in create_llm_model /
  create_embedding_model / create_rerank_model so a model cannot be
  attached to a provider whose requester does not support that type,
  even if the frontend restriction is bypassed (manifests without
  support_type are allowed for backward compatibility)
- manifests: correct support_type for providers that do not offer all
  three model types:
  - llm only: anthropic, deepseek, groq, moonshot, openrouter, xai
  - llm + text-embedding: openai, gemini, mistral
  - add rerank to new-api (verified working via /v1/rerank)
  - set llm + text-embedding + rerank for aggregator/unknown gateways

* feat(provider): add searchable alias to requester manifests

- add a free-text 'alias' field to every requester manifest spec,
  containing the vendor's English/Chinese names, pinyin, common
  nicknames and flagship model-series names (e.g. moonshot -> kimi,
  月之暗面; zhipu -> glm, 智谱清言)
- frontend: ProviderForm requester search now also matches against
  alias (substring/contains), so searching 'kimi' surfaces Moonshot,
  '硅基' surfaces SiliconFlow, etc.
- also fix support_type: openrouter (relay) supports embedding+rerank;
  LangBot Space gains rerank (coming soon)

* fix(provider): make support_type guard defensive against incomplete model_mgr

- _validate_provider_supports now uses getattr to gracefully skip when
  model_mgr / provider_dict / manifest lookup is unavailable, instead of
  raising AttributeError (fixes unit tests that mock ap.model_mgr as a
  bare SimpleNamespace)
- add TestValidateProviderSupports covering: allow supported type,
  reject unsupported type, allow when support_type missing, allow when
  provider unknown, degrade safely when model_mgr is incomplete

* fix(persistence): guard 0004 migration against missing llm_models table

The 0004_add_llm_model_context_length migration called
inspector.get_columns('llm_models') unconditionally, raising
NoSuchTableError when the table does not exist (e.g. migrating a
fresh/empty DB, as exercised by the integration tests where
create_all() registers no tables because the ORM models are not
imported). Every other migration guards with a table-existence check
first; add the same guard here for both upgrade and downgrade.

Also restore the test head assertion to 0004 (it had been lowered to
0003 to mask this failure).

* Merge branch 'master' into feat/litellm

Resolve conflicts:
- uv.lock: regenerated via 'uv lock' to reconcile litellm/fastuuid
  (ours) with openai bump (master).
- Alembic migrations: master added 0004_add_mcp_readme while this
  branch added 0004_add_llm_model_context_length, both as children of
  0003 (would create multiple heads). Re-chain the litellm migration as
  0005_add_llm_model_context_length with down_revision=0004_add_mcp_readme
  for a single linear head. Update test head assertion accordingly.

* fix(persistence): shorten migration revision id to fit varchar(32)

PostgreSQL stores alembic_version.version_num as varchar(32).
'0005_add_llm_model_context_length' (33 chars) overflowed it, raising
StringDataRightTruncationError in the PG migration tests. Rename the
revision (and file) to '0005_add_llm_context_length' (27 chars) and
update the head assertions in both SQLite and PostgreSQL migration
tests.

---------

Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>
Co-authored-by: fdc310 <2213070223@qq.com>
Co-authored-by: RockChinQ <rockchinq@gmail.com>

2026-06-13 16:59:48 +08:00

.github

fix(ci): trigger CLA check on PR reopen

2026-06-10 12:10:52 +08:00

docker

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

docs

Feat/sandbox (#2072 )

2026-06-03 11:12:39 +08:00

res

chore: Update logo in README files to new resource location

2026-02-23 17:01:37 +08:00

scripts

Feat/test build (#2174 )

2026-05-16 12:05:54 +08:00

src/langbot

refactor(provider): use LiteLLM as unified LLM requester backend (#2150 )

2026-06-13 16:59:48 +08:00

tests

refactor(provider): use LiteLLM as unified LLM requester backend (#2150 )

2026-06-13 16:59:48 +08:00

web

refactor(provider): use LiteLLM as unified LLM requester backend (#2150 )

2026-06-13 16:59:48 +08:00

.dockerignore

feat(lark): 支持商店应用机器人 (#1855 )

2025-12-11 16:54:28 +08:00

.gitignore

feat: add plugin extension pages (iframe rendering, Page SDK, security hardening, i18n)

2026-04-25 19:14:14 +08:00

.mcp.json

Revert "chore(tooling): wire CodeGraph MCP server + agent guidance"

2026-06-03 23:17:34 +08:00

.pre-commit-config.yaml

Feat/shadcn sidebar and page views (#2084 )

2026-03-27 01:51:13 +08:00

AGENTS.md

docs(agents): add --standalone-box flag and box config keys

2026-06-07 08:57:43 -04:00

CLA.md

feat: add Contributor License Agreement (CLA) and signing workflow

2026-06-10 11:49:30 +08:00

CLAUDE.md

chore: add guidance for code agents

2025-11-06 21:34:02 +08:00

codecov.yml

chore: add codecov.yml

2025-10-07 00:15:56 +08:00

CONTRIBUTING.md

feat: add Contributor License Agreement (CLA) and signing workflow

2026-06-10 11:49:30 +08:00

Dockerfile

Feat/saas sandbox adaptation (#2234 )

2026-06-09 19:30:17 +08:00

LICENSE

chore: switch LICENSE to Apache 2.0

2025-09-13 09:44:18 +08:00

main.py

chore: Add PyPI package support for uvx/pip installation (#1764 )

2025-11-16 19:53:01 +08:00

Makefile

Feat/test build (#2174 )

2026-05-16 12:05:54 +08:00

pyproject.toml

refactor(provider): use LiteLLM as unified LLM requester backend (#2150 )

2026-06-13 16:59:48 +08:00

pytest.ini

Feat/test build (#2174 )

2026-05-16 12:05:54 +08:00

README_CN.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

README_ES.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

README_FR.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

README_JP.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

README_KO.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

README_RU.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

README_TW.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

README_VI.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

README.md

docs(docker): move k8s deployment docs to wiki, drop README_K8S.md

2026-06-07 11:36:39 -04:00

run_tests.sh

perf: Filter plugins by component types in pipeline extensions (#1821 )

2025-11-30 20:55:48 +08:00

test-embed.html

Feat/sandbox (#2072 )

2026-06-03 11:12:39 +08:00

uv.lock

refactor(provider): use LiteLLM as unified LLM requester backend (#2150 )

2026-06-13 16:59:48 +08:00

README.md

Production-grade platform for building agentic IM bots.

Quickly build, debug, and ship AI bots to Slack, Discord, Telegram, WeChat, and more.

English / 简体中文 / 繁體中文 / 日本語 / Español / Français / 한국어 / Русский / Tiếng Việt

Website ｜ Features ｜ Docs ｜ API ｜ Cloud ｜ Plugin Market ｜ Roadmap

What is LangBot?

LangBot is an open-source, production-grade platform for building AI-powered instant messaging bots. It connects Large Language Models (LLMs) to any chat platform, enabling you to create intelligent agents that can converse, execute tasks, and integrate with your existing workflows.

Key Capabilities

AI Conversations & Agents — Multi-turn dialogues, tool calling, multi-modal support, streaming output. Built-in RAG (knowledge base) with deep integration to Dify, Coze, n8n, Langflow, Deerflow, Weknora.
Universal IM Platform Support — One codebase for Discord, Telegram, Slack, LINE, QQ, WeChat, WeCom, Lark, DingTalk, KOOK.
Production-Ready — Access control, rate limiting, sensitive word filtering, comprehensive monitoring, and exception handling. Trusted by enterprises.
Plugin Ecosystem — Hundreds of plugins, event-driven architecture, component extensions, and MCP protocol support.
Web Management Panel — Configure, manage, and monitor your bots through an intuitive browser interface. No YAML editing required.
Multi-Pipeline Architecture — Different bots for different scenarios, with comprehensive monitoring and exception handling.

→ Learn more about all features

📍 Practical guides: deploy a multi-platform AI bot in 5 minutes, connect DeepSeek to WeChat, Discord, and Telegram, run a Dify Agent in Discord, Telegram, and Slack, and build an n8n-powered chatbot.

Quick Start

☁️ LangBot Cloud (Recommended)

LangBot Cloud — Zero deployment, ready to use.

One-Line Launch

uvx langbot

Requires uv. Visit http://localhost:5300 — done.

Docker Compose

git clone https://github.com/langbot-app/LangBot
cd LangBot/docker
docker compose up -d

One-Click Cloud Deploy

More options: Docker · Manual · BTPanel · Kubernetes

Supported Platforms

Platform	Status	Notes
Discord	✅	Official
Telegram	✅	Official
Slack	✅	Official
LINE	✅	Official
QQ	✅	Personal & Official API (Channel, DM, Group)
WeCom	✅	Enterprise WeChat, External CS, AI Bot
WeChat	✅	Personal & Official Account
Lark	✅	Official
DingTalk	✅	Official
KOOK	✅	Official
Satori	✅
Email	✅	Matrix, Satori
Matrix	✅	Supports multiple bridged platforms such as Signal, WhatsApp, Messenger, iMessage, Mattermost, Google Chat, IRC, XMPP, Zulip, and more

Supported LLMs & Integrations

Provider	Type	Status
OpenAI	LLM	✅
Anthropic	LLM	✅
DeepSeek	LLM	✅
Google Gemini	LLM	✅
xAI	LLM	✅
Moonshot	LLM	✅
Zhipu AI	LLM	✅
Ollama	Local LLM	✅
LM Studio	Local LLM	✅
Dify	LLMOps	✅
MCP	Protocol	✅
SiliconFlow	Gateway	✅
Aliyun Bailian	Gateway	✅
Volc Engine Ark	Gateway	✅
ModelScope	Gateway	✅
GiteeAI	Gateway	✅
CompShare	GPU Platform	✅
PPIO	GPU Platform	✅
ShengSuanYun	GPU Platform	✅
接口 AI	Gateway	✅
302.AI	Gateway	✅
Qiniu	Gateway	✅

→ View all integrations

Why LangBot?

Use Case	How LangBot Helps
Customer Support	Deploy AI agents to Slack/Discord/Telegram that answer questions using your knowledge base
Internal Tools	Connect n8n/Dify workflows to WeCom/DingTalk for automated business processes
Community Management	Moderate QQ/Discord groups with AI-powered content filtering and interaction
Multi-Platform Presence	One bot, all platforms. Manage from a single dashboard

Live Demo

Try it now: https://demo.langbot.dev/

Email: demo@langbot.app
Password: langbot123456

Note: Public demo environment. Do not enter sensitive information.

Community

Discord Community

Star History

Contributors

Thanks to all contributors who have helped make LangBot better:

Languages

Python 62.7%

TypeScript 35.8%

JavaScript 0.9%

CSS 0.4%

Shell 0.1%

README.md Unescape Escape

Production-grade platform for building agentic IM bots.

Quickly build, debug, and ship AI bots to Slack, Discord, Telegram, WeChat, and more.

What is LangBot?

Key Capabilities

Quick Start

☁️ LangBot Cloud (Recommended)

One-Line Launch

Docker Compose

One-Click Cloud Deploy

Supported Platforms

Supported LLMs & Integrations

Why LangBot?

Live Demo

Community

Star History

Contributors

README.md