LangBot

mirror of https://github.com/langbot-app/LangBot.git synced 2026-06-13 17:26:04 +00:00

Author	SHA1	Message	Date
huanghuoguoguo	c9ef788072	Fix agent runner steering and lifecycle hardening	2026-06-13 17:46:50 +08:00
huanghuoguoguo	9cf99815ba	feat(agent-runner): audit steering injection	2026-06-13 17:46:50 +08:00
huanghuoguoguo	c10ce6cc2e	chore: commit workspace changes	2026-06-13 17:46:50 +08:00
huanghuoguoguo	86ec12a391	feat(agent-runner): enforce typed host permissions	2026-06-13 17:46:50 +08:00
huanghuoguoguo	4e016ad23e	fix(agent-runner): harden state and event APIs	2026-06-13 17:46:31 +08:00
huanghuoguoguo	5831198f38	refactor(agent-runner): remove protocol_version from various components and update related documentation	2026-06-13 17:46:31 +08:00
huanghuoguoguo	7675f565ff	test(agent): harden runner persistence coverage	2026-06-13 17:46:31 +08:00
huanghuoguoguo	54bba1a1f5	feat(agent-runner): expose skill resources through host context	2026-06-13 17:45:53 +08:00
huanghuoguoguo	a6a90f7d1b	test: cover host skill tool scoping	2026-06-13 17:45:53 +08:00
huanghuoguoguo	4a8c1a76d7	refactor(agent-runner): use protocol version field	2026-06-13 17:45:53 +08:00
huanghuoguoguo	2de6d15d07	refactor(provider): formalize tool lookup contract	2026-06-13 17:45:53 +08:00
huanghuoguoguo	f1a44ea8a8	refactor agent runner orchestration boundaries	2026-06-13 17:45:53 +08:00
huanghuoguoguo	3773e3dfaf	fix(agent-runner): align plugin runner runtime boundaries	2026-06-13 17:45:53 +08:00
huanghuoguoguo	23d3b7c279	feat(agent-runner): add bounded native tool artifacts	2026-06-13 17:45:53 +08:00
huanghuoguoguo	058721cca3	feat(agent-runner): expose effective prompt and transcript history	2026-06-13 17:45:53 +08:00
huanghuoguoguo	e13a3b845c	refactor(agent-runner): make agent binding and auth snapshot explicit	2026-06-13 17:45:53 +08:00
huanghuoguoguo	dc4cf5711e	refactor(agent-runner): simplify event-first entry path	2026-06-13 17:45:53 +08:00
huanghuoguoguo	1384d328d6	refactor(agent-runner): align config with agent semantics	2026-06-13 17:45:14 +08:00
huanghuoguoguo	16faeca508	refactor(agent-runner): remove host context windowing	2026-06-13 17:45:14 +08:00
huanghuoguoguo	4852b21f9b	feat(agent-runner): normalize binding config boundaries	2026-06-13 17:45:14 +08:00
huanghuoguoguo	0b9778abd9	fix: enforce agent run API permissions	2026-06-13 17:45:14 +08:00
huanghuoguoguo	c296c187f4	fix(agent-runner): authorize external runner tools	2026-06-13 17:45:14 +08:00
huanghuoguoguo	94c0adc8a1	docs(agent-runner): align runner protocol boundaries	2026-06-13 17:45:14 +08:00
huanghuoguoguo	fc2dc34ecf	fix(agent-runner): stabilize event context and streams	2026-06-13 17:45:14 +08:00
huanghuoguoguo	819a2843e7	refactor(agent-runner): tighten protocol v1 runtime boundaries	2026-06-13 17:44:44 +08:00
huanghuoguoguo	96fa9e1eeb	feat(agent-runner): align protocol adapter terminology	2026-06-13 17:44:44 +08:00
huanghuoguoguo	b4ae049c54	feat(agent-runner): route pipeline runs through event-first flow - run_from_query() now delegates to run(event, binding) instead of maintaining a separate legacy execution path - Pipeline Query is converted to AgentEventEnvelope via PipelineCompatAdapter - Pipeline config is converted to AgentBinding with StatePolicy - bound_plugins authorization preserved from Pipeline - Legacy compatibility fields preserved: - query_id → context.runtime.query_id → session registry - prompt → context.compatibility.extra.prompt (not top-level) - params → context.compatibility.extra.params (with proper filtering) - max-round → bootstrap.messages and compatibility.legacy_messages - Pipeline path gains event-first host capabilities: - EventLog and Transcript writing - ArtifactStore registration - PersistentStateStore for state.updated - Removed legacy handlers: - _handle_artifact_created_query() (replaced by _handle_artifact_created) - _handle_state_updated() (replaced by _handle_state_updated_event) This change unifies the execution path while preserving backward compatibility for Pipeline-based runners. EventGateway is not implemented in this branch; only the event-first entry point is reserved.	2026-06-13 17:44:44 +08:00
huanghuoguoguo	d1e49a5b44	feat(agent-runner): add persistent state APIs	2026-06-13 17:44:44 +08:00
huanghuoguoguo	2e0343cb21	feat(agent-runner): scope event-first state by binding	2026-06-13 17:44:44 +08:00
huanghuoguoguo	53c9199df8	feat(agent-runner): persist created artifacts	2026-06-13 17:44:44 +08:00
huanghuoguoguo	bec11e5a18	feat(agent-runner): add artifact store pull APIs	2026-06-13 17:44:44 +08:00
huanghuoguoguo	a31f910f10	feat(agent-runner): add event-first context facts and pull APIs Add EventLog and Transcript persistence entities for storing auditable event facts and conversation history projection. Implement event-first AgentRunContext builder that produces Protocol v1 compliant context payloads with required fields: event, delivery, context (ContextAccess). Key changes: - EventLog ORM: auditable event records with indexes - Transcript ORM: conversation history projection with composite indexes - AgentRunContextBuilder: Protocol v1 payload with delivery, context, bootstrap - EventLogStore/TranscriptStore: async stores for fact sources - Host action handlers: HISTORY_PAGE, HISTORY_SEARCH, EVENT_GET, EVENT_PAGE - Context validation: build_context output validates via SDK AgentRunContext - Alembic migration for event_log and transcript tables - Alembic env.py imports all ORM models for autogenerate discovery Legacy compatibility: max-round messages go into bootstrap.messages and compatibility.legacy_messages, not top-level messages field.	2026-06-13 17:44:44 +08:00
huanghuoguoguo	c1dc5e3970	fix(agent-runner): package context for plugin execution	2026-06-13 17:44:44 +08:00
huanghuoguoguo	d8d98b0838	feat: make agent runner config schema driven	2026-06-13 17:44:44 +08:00
huanghuoguoguo	c97ea27d42	chore(agent): remove v1 wording from runner internals	2026-06-13 17:42:59 +08:00
huanghuoguoguo	bbbbc05201	feat(agent): reserve stable runner event names	2026-06-13 17:42:59 +08:00
huanghuoguoguo	752ac6e9d2	feat(agent-runner): enrich plugin runner host context	2026-06-13 17:42:59 +08:00
huanghuoguoguo	9dfddd4927	fix: log agent runner best-effort failures	2026-06-13 17:42:59 +08:00
huanghuoguoguo	9f8dd6cbe4	test: address agent runner review comments	2026-06-13 17:42:59 +08:00
huanghuoguoguo	6d0e6dcc63	feat: support dynamic agent runner defaults	2026-06-13 17:42:21 +08:00
huanghuoguoguo	2123ef5816	feat(plugin): implement INVOKE_RERANK handler with run-scoped authorization - Add invoke_rerank action handler in plugin handler - Validate rerank model access via run session - Cap documents at 64 for API limit - Return sorted results by relevance score	2026-06-13 17:41:37 +08:00
huanghuoguoguo	6ef40fbd68	perf(agent-runner): improve session registry and orchestrator efficiency - Add pre-computed _authorized_ids (frozenset) at session registration for O(1) lookup - Refactor is_resource_allowed() from linear search to set membership check - Add thread-safe locking to get_session_registry() singleton - Cache _session_registry and _state_store references in orchestrator __init__ - Add asyncio.gather() for parallel resource building in AgentResourceBuilder - Create shared test fixtures in tests/unit_tests/agent/conftest.py - Update test files to import from shared conftest.py Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-13 17:41:37 +08:00
huanghuoguoguo	45f150da2d	feat(agent-runner): integrate AgentRunner Protocol v1 with plugin system Phase 0 integration complete - verified minimal loop with local-agent stub runner. Changes: - Add AgentRunOrchestrator for plugin-based agent execution - Add AgentResultNormalizer for Protocol v1 result conversion - Add AgentRunnerDescriptor for runner ID parsing (plugin:author/name/runner) - Update chat handler to use new orchestrator instead of direct runner lookup - Add plugin handler methods for list_agent_runners and run_agent - Add connector methods for AgentRunner protocol forwarding - Update pipeline API to include runner options in metadata - Add integration docs and implementation plan Integration verified: - Runner: plugin:langbot/local-agent/default - Input: "你好" - Output: [stub] Echo: 你好 - Date: 2026-05-10 10:09 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-13 17:41:37 +08:00
RockChinQ	672abfe95d	refactor(core): remove pre-3.x legacy config migration system The pkg/core/migrations system (m001-m043 DBMigration-style config migrations, MigrationStage, and the core.migration base class) only ever ran when upgrading from LangBot 3.x. The last 3.x release is over a year old and is no longer supported, so this dead code is removed entirely: - delete pkg/core/migrations/ (43 mXXX_*.py + __init__) - delete pkg/core/migration.py (base class + registry) - delete pkg/core/stages/migrate.py (MigrationStage) - drop 'MigrationStage' from boot.py stage_order - delete tests/unit_tests/core/test_migration.py (tested the removed base class)	2026-06-13 05:26:01 -04:00
huanghuoguoguo	9ecb587ac0	refactor(provider): use LiteLLM as unified LLM requester backend (#2150 ) * refactor(provider): use LiteLLM as unified LLM requester backend - Replace 23+ individual requester implementations with unified litellmchat.py - Add litellm_provider field to 27 YAML manifests for provider routing - Delete redundant requester subclasses - Add unit tests for LiteLLMRequester (29 tests) - Fix num_retries parameter name (was max_retries) - Fix exception handling order for subclass exceptions LiteLLM provides unified API for 100+ providers, eliminating need for provider-specific requesters. * fix: ruff format provider.py Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * refactor(provider): simplify LiteLLM requester usage handling - Remove unused Anthropic-specific tool schema generation - Share completion argument construction between normal and streaming calls - Use LiteLLM/OpenAI native usage fields for monitoring - Collect stream token usage from LiteLLM stream_options - Update LiteLLM requester tests for unified usage fields * restore: restore deleted provider requester files Restore individual provider requester implementations that were removed in `de61b5d3`. These files coexist with the unified litellmchat.py backend. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> * feat: update requesters and improve provider selection UI - Added `litellm_provider` field to various requesters' YAML configurations. - Removed obsolete Python requester files for OpenRouter, PPIO, QHAIGC, ShengSuanYun, SiliconFlow, Space, TokenPony, VolcArk, and Xai. - Introduced new requesters for Tencent and Together AI with corresponding YAML configurations and SVG icons. - Enhanced the ProviderForm component to include a searchable dropdown for selecting providers, improving user experience. - Updated localization files to include search provider text for both English and Chinese. * fix(provider): align litellm rebase with master * fix(provider): capture streaming token usage; add token observability The LiteLLM streaming requester only captured usage when a chunk had an empty `choices` list. Many OpenAI-compatible gateways (e.g. new-api) and providers send the final usage payload in a chunk that still carries an empty-delta choice, so streamed calls always recorded 0 tokens in the monitoring logs/dashboard (non-streaming worked). - Capture stream usage whenever a chunk carries it, regardless of choices - Add robust _normalize_usage (dict/obj shapes, derive missing total_tokens) - Register litellm in bootutils/deps.py (was in pyproject only) - Add MonitoringService.get_token_statistics + /monitoring/token-statistics endpoint: summary, per-model breakdown, token timeseries, and a zero-token-success data-quality signal - Add TokenMonitoring dashboard tab (summary tiles, stacked token chart, per-model table) + i18n (en/zh) - Regression tests for stream usage capture and usage normalization Verified end-to-end against a real OpenAI-compatible endpoint with gpt-5.5 and claude-opus-4-8: tokens now recorded non-zero for both streaming and non-streaming paths. * refactor(provider): simplify litellm capabilities * style: simplify wrapped expressions * feat(models): persist context metadata * fix(provider): handle dict embeddings and openai-compatible rerank in LiteLLMRequester - invoke_embedding: support both object- and dict-shaped response.data entries (OpenAI-compatible gateways like new-api return dicts) - invoke_rerank: litellm.arerank rejects the 'openai' provider, so for openai-compatible (or unspecified) providers call the standard Jina/Cohere-style POST /v1/rerank endpoint directly over HTTP - accept both 'relevance_score' and 'score' fields in rerank results - add unit tests for the openai-compatible HTTP rerank path * feat(provider): enforce requester support_type when adding models - frontend: AddModelPopover only shows model-type tabs (llm/embedding/ rerank) that the provider's requester declares in its manifest support_type; ModelsDialog fetches requester manifests and maps requester -> support_type, passed down through ProviderCard - backend: add _validate_provider_supports guard in create_llm_model / create_embedding_model / create_rerank_model so a model cannot be attached to a provider whose requester does not support that type, even if the frontend restriction is bypassed (manifests without support_type are allowed for backward compatibility) - manifests: correct support_type for providers that do not offer all three model types: - llm only: anthropic, deepseek, groq, moonshot, openrouter, xai - llm + text-embedding: openai, gemini, mistral - add rerank to new-api (verified working via /v1/rerank) - set llm + text-embedding + rerank for aggregator/unknown gateways * feat(provider): add searchable alias to requester manifests - add a free-text 'alias' field to every requester manifest spec, containing the vendor's English/Chinese names, pinyin, common nicknames and flagship model-series names (e.g. moonshot -> kimi, 月之暗面; zhipu -> glm, 智谱清言) - frontend: ProviderForm requester search now also matches against alias (substring/contains), so searching 'kimi' surfaces Moonshot, '硅基' surfaces SiliconFlow, etc. - also fix support_type: openrouter (relay) supports embedding+rerank; LangBot Space gains rerank (coming soon) * fix(provider): make support_type guard defensive against incomplete model_mgr - _validate_provider_supports now uses getattr to gracefully skip when model_mgr / provider_dict / manifest lookup is unavailable, instead of raising AttributeError (fixes unit tests that mock ap.model_mgr as a bare SimpleNamespace) - add TestValidateProviderSupports covering: allow supported type, reject unsupported type, allow when support_type missing, allow when provider unknown, degrade safely when model_mgr is incomplete * fix(persistence): guard 0004 migration against missing llm_models table The 0004_add_llm_model_context_length migration called inspector.get_columns('llm_models') unconditionally, raising NoSuchTableError when the table does not exist (e.g. migrating a fresh/empty DB, as exercised by the integration tests where create_all() registers no tables because the ORM models are not imported). Every other migration guards with a table-existence check first; add the same guard here for both upgrade and downgrade. Also restore the test head assertion to 0004 (it had been lowered to 0003 to mask this failure). * Merge branch 'master' into feat/litellm Resolve conflicts: - uv.lock: regenerated via 'uv lock' to reconcile litellm/fastuuid (ours) with openai bump (master). - Alembic migrations: master added 0004_add_mcp_readme while this branch added 0004_add_llm_model_context_length, both as children of 0003 (would create multiple heads). Re-chain the litellm migration as 0005_add_llm_model_context_length with down_revision=0004_add_mcp_readme for a single linear head. Update test head assertion accordingly. * fix(persistence): shorten migration revision id to fit varchar(32) PostgreSQL stores alembic_version.version_num as varchar(32). '0005_add_llm_model_context_length' (33 chars) overflowed it, raising StringDataRightTruncationError in the PG migration tests. Rename the revision (and file) to '0005_add_llm_context_length' (27 chars) and update the head assertions in both SQLite and PostgreSQL migration tests. --------- Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com> Co-authored-by: fdc310 <2213070223@qq.com> Co-authored-by: RockChinQ <rockchinq@gmail.com>	2026-06-13 16:59:48 +08:00
RockChinQ	2b6dcfe9c7	feat(survey): add bot_response_success_100 milestone trigger event Counts successful non-WebSocket bot responses (persisted in the metadata table as survey_bot_response_count, survives restarts) and fires the bot_response_success_100 survey event once the instance reaches 100 responses. Counting stops after the milestone has been triggered. Existing first_bot_response_success behavior unchanged. 6 new unit tests.	2026-06-12 09:40:07 -04:00
RockChinQ	dd96da895c	feat(telemetry): payload v2 with feature usage counters and instance heartbeat Per-query events now carry event_type='query' and a features JSON object: - tool_calls by source (native/plugin/mcp/skill) via ToolManager - tool_call_rounds, kb usage (count/engine plugins/retrieved entries) via local-agent - sandbox execs/errors via BoxService - activated_skills and bound mcp_servers snapshots New instance_heartbeat event (startup + daily) reports anonymous instance profile: deploy platform, database/vdb kind, box backend/availability, adapter type names, and resource counts. Respects space.disable_telemetry. All collection helpers are defensive and never break the pipeline. Verified: ruff, 37 telemetry unit tests (13 new), 504 box/provider/pipeline tests.	2026-06-12 08:11:43 -04:00
RockChinQ	47ade18596	fix(log): roll daily log file at midnight for long-running processes The log filename was computed once at init_logging() startup and the RotatingFileHandler only rotated by size, so a process running across midnight kept appending every subsequent day's logs to the start-day file (langbot-<start date>.log). No file ever appeared for the current day until the process was restarted, confusing users into thinking logging had stopped. Replace RotatingFileHandler with DailyGroupedRotatingFileHandler, which switches to langbot-<current date>.log when the local date changes while still doing size-based numbered rotation within a day. On-disk naming stays compatible with the maintenance log-retention cleanup (LOG_FILE_PATTERN). Adds regression tests.	2026-06-10 04:58:11 -04:00
Junyan Chin	8e558ad3a1	Feat/saas sandbox adaptation (#2234 ) * fix(box): trust Box-reported skill paths when filesystem is not shared In separated deployments (Docker Compose, k8s sidecar, --standalone-box, remote runtime.endpoint) the Box runtime owns its own filesystem, so the skill package_root it reports via list_skills is not resolvable on the LangBot side. LangBot's reload_skills and build_skill_extra_mounts validated those paths with os.path.isdir() against its own filesystem, which silently dropped every skill in such deployments — breaking the sandbox skill feature for the nsjail/SaaS backend. Add BoxService.shares_filesystem_with_box, derived from the connector transport (stdio = shared, WebSocket = separated), with an explicit override seam for tests/embedders. Gate both isdir() guards on it: keep local validation in shared-fs stdio mode, trust Box-reported paths otherwise. The Box runtime only reports skills found on its own filesystem, so those paths are valid there by construction. Adds topology-derivation tests (real connector, no mocks) and skill-retention tests for both shared and separated filesystems. * build(docker): ship a self-contained nsjail sandbox backend in the image Compile nsjail 3.6 from source in a dedicated multi-stage build and carry only the binary plus its runtime libs (libprotobuf32, libnl-route-3-200) into the final image. This lets the Box runtime isolate sandboxed code via nsjail user/mount/pid/net namespaces without a host Docker socket — the prerequisite for running Box on LangBot Cloud (k8s), where mounting docker.sock would grant node root and is not acceptable for multi-tenant. The build toolchain (build-essential/bison/flex/protobuf-dev/libnl-dev) stays in the nsjail-build stage and is not present in the shipped image. Verified: image builds (583MB), nsjail --help exits 0, libraries resolve, and the real NsjailBackend executes an isolated command end-to-end on a v6.1/cgroup2 host matching LangBot Cloud prod (rlimit fallback path, since container /sys/fs/cgroup is read-only; PID-namespace isolation confirmed). * feat(box): SaaS guard to force a single global sandbox scope Add system.limitation.force_box_session_id_template: when non-empty it overrides every pipeline's box-session-id-template at resolve time, pinning all queries to one shared sandbox (e.g. {global}). This is the authoritative, unbypassable guard — it runs on every exec call, so editing the pipeline config via API cannot escape it. The web UI locks the Sandbox Scope selector via a combined box_scope_editable flag (box available AND not forced). * build(deps): pin langbot-plugin==0.4.2b1 (nsjail cgroup container-safety beta) * fix(web): show forced sandbox scope + make disabled tooltip tap-friendly When a SaaS deployment pins every pipeline to a fixed sandbox scope via system.limitation.force_box_session_id_template, the Sandbox Scope selector was correctly locked but still displayed the pipeline's stored value (e.g. the per-chat default), misrepresenting the scope that the runtime actually enforces on every exec. Coerce the displayed/saved value to the forced template so the locked selector truthfully shows the active scope (e.g. Global). Also fix the disabled_tooltip being invisible on touch devices: hover-only Radix tooltips never open without a pointer, so the explanation of why the field is locked could not be read on mobile. Wrap the info icon so a tap toggles the tooltip while desktop hover still works. * feat(web): hide sidebar new-version prompt for edition=cloud Cloud instances are upgraded centrally by the operator, so surfacing a GitHub 'new version available' badge to tenants is misleading and actionable only by the operator. Skip the release check entirely when edition=cloud. * style(web): prettier formatting for DisabledTooltipIcon ternary * chore(deps): bump langbot-plugin to 0.4.2b2 Picks up the SDK fix that creates a read-write host_path before the nsjail bind-mount, fixing the SaaS MCP shared-workspace sandbox failure (exec exit 255 with empty output when host_path didn't exist). * chore(deps): bump langbot-plugin to 0.4.2b3 Picks up the nsjail /dev-node fix so stdio MCP servers (uvx-launched) can start under force_global_sandbox instead of failing with 'Connection closed / please check URL'. * fix(web): show real MCP runtime status on installed extensions list The installed-extensions list badge keyed solely off the enable flag, so a server that was still CONNECTING (or in ERROR) was shown as 'Connected'. Reflect the actual runtime_info.status (connecting/connected/error/disabled) with matching colors, and poll quietly every 3s while any MCP server is connecting so the badge transitions without a manual refresh. * chore(deps): bump langbot-plugin to 0.4.2b4 Picks up the 30s start_managed_process timeout so cold uvx MCP bootstraps don't get torn down mid-install. * style(web): satisfy prettier — parenthesize nullish-coalescing in ternary * fix(mcp): isolate transient test sessions from the shared Box session A config-page 'test' (server_name='_', no persisted UUID) ran in the same shared 'mcp-shared' Box session as live MCP servers. A failing test (e.g. empty args) churned that shared session and tore down healthy, already- connected servers — leaving them stuck after exhausting their retries. Mark UUID-less sessions as transient, give them their own isolated Box session ('mcp-test-<uuid>'), and fully delete that session on cleanup so tests can never disturb live servers and don't leak sessions. * fix(mcp): tear down transient test session after test completes A successful config-page test left its isolated 'mcp-test-<uuid>' Box session running (the lifecycle task blocks until shutdown). Wrap the transient test coroutine so it always shuts the session down afterward, preventing isolated test sessions from leaking.	2026-06-09 19:30:17 +08:00
RockChinQ	7330732f62	fix(ci): bump migration head assertion to 0004, apply prettier - Update test_migrations / test_migrations_postgres head assertion from 0003 to 0004 after adding the mcp readme migration. - Reformat MCPForm.tsx / MCPReadme.tsx to satisfy prettier/prettier.	2026-06-06 03:56:14 -04:00

1 2 3

110 Commits