Tencent Opens WeKnora RAG and Agent Knowledge Platform

What WeKnora Is

Tencent has published WeKnora as an open-source, LLM-powered knowledge framework targeting enterprise-grade document understanding, semantic retrieval, and autonomous reasoning [1]. The platform is organized around three core operational modes: RAG-based Quick Q&A for routine lookups, a ReAct Agent for complex multi-step tasks, and a Wiki Mode that autonomously builds and maintains a linked knowledge base from source documents [1].

The release positions WeKnora as infrastructure for organizations that manage large, scattered document repositories and need a self-hostable alternative to cloud-dependent knowledge tooling.

How the Three Modes Work

The RAG-based Quick Q&A mode handles everyday semantic lookups by retrieving relevant document chunks and returning grounded answers. It is designed for speed and directness, serving as the entry point for most user queries [1].

The ReAct Agent mode moves beyond single-pass retrieval. It autonomously orchestrates retrieval steps, invokes MCP tools, and runs web searches to resolve complex, multi-step questions that a single vector lookup cannot answer [1]. The agent reasons iteratively, selecting tools and intermediate steps before producing a final response.

Wiki Mode represents the most autonomous of the three. Agents in this mode distill raw source documents into a self-maintaining, interlinked markdown knowledge base accompanied by an interactive knowledge graph [1]. The system updates that knowledge base as source material changes, reducing the manual overhead of keeping internal documentation current.

Ingestion, Formats, and LLM Integrations

WeKnora handles more than ten document formats, including PDF, Word, images, and Excel [1]. Multi-source connectors support auto-syncing from Feishu, Notion, and Yuque, with additional data sources described as forthcoming [1].

Once ingested, knowledge bases can serve Q&A directly through instant-messaging channels including WeCom, Feishu, Slack, and Telegram [1], which reduces friction for teams that work primarily inside those platforms.

On the model side, WeKnora integrates with more than 20 LLM providers. Named integrations include OpenAI, DeepSeek, Qwen from Alibaba Cloud, Zhipu, Hunyuan, Gemini, MiniMax, NVIDIA, and Ollama [1]. The breadth of provider support means operators can route workloads to locally hosted models through Ollama or to commercial APIs depending on data-sensitivity requirements.

Enterprise and Deployment Architecture

Version 0.6.0 introduced a multi-tenant RBAC system built around a four-tier role matrix: Owner, Admin, Contributor, and Viewer [1]. The system adds per-resource knowledge-base ownership and per-tenant audit logging, giving compliance-focused organizations a traceable record of access and changes [1].

Observability is handled through a Langfuse integration that covers agent reasoning traces, token usage, and full pipeline tracing [1]. For teams operating WeKnora at scale, this provides visibility into where reasoning chains succeed or stall.

The architecture is fully modular. Operators can swap LLMs, vector databases, and storage backends independently [1]. Local and private cloud deployment are both supported, which the project describes as ensuring complete data sovereignty [1]. A command-line interface, weknora CLI v0.4, reached general availability alongside v0.6.0 and includes an mcp serve command [1].

Who the Platform Targets

WeKnora is aimed at enterprise teams managing large document repositories who need semantic search, autonomous reasoning, and a continuously updated internal knowledge base without routing sensitive documents through third-party services [1]. The self-hostable design and support for on-premises LLMs through Ollama make it applicable to organizations in regulated industries or those with strict data-residency requirements.

Teams already using Feishu or Notion as their primary documentation layer can connect those sources directly, reducing the ingestion burden during initial deployment [1].

FAQ

Q. Which document formats does WeKnora support at ingestion? WeKnora handles more than ten formats, with PDF, Word, images, and Excel listed explicitly in the project documentation [1]. Additional format support is not detailed in available sources.

Q. Can WeKnora run entirely on-premises without external API calls? The platform supports local deployment and integrates with Ollama for locally hosted models, which the project describes as enabling complete data sovereignty [1]. Operators choosing commercial LLM providers would still route inference traffic externally.

Q. How does the RBAC system handle access across multiple teams? Version 0.6.0 introduced a four-tier role matrix (Owner, Admin, Contributor, Viewer) with per-knowledge-base ownership and per-tenant audit logging [1]. Multi-workspace user experience and self-service workspaces were also added in that release [1].

Q. What observability tooling does WeKnora expose? WeKnora integrates with Langfuse to provide tracing across agent reasoning steps, token usage, and pipeline execution [1]. No other observability integrations are described in available sources.

Q. Does Wiki Mode require manual curation after initial setup? Wiki Mode is described as self-maintaining: agents distill and update the interlinked markdown knowledge base as source documents change [1]. The degree of human review required for accuracy is not specified in available sources.

Key takeaways

WeKnora provides three distinct operational modes (RAG Q&A, ReAct Agent, Wiki Mode) within a single open-source platform, allowing operators to match the reasoning depth to the task [1].
More than 20 LLM provider integrations, including Ollama for local inference, give teams flexibility to balance cost, latency, and data-residency requirements [1].
The four-tier RBAC system with per-tenant audit logging addresses enterprise compliance requirements that simpler RAG frameworks typically omit [1].
A fully modular architecture lets operators swap vector databases, storage backends, and LLMs without rebuilding the pipeline [1].
Direct connectors to Feishu, Notion, and Yuque, combined with IM-channel Q&A delivery, reduce integration overhead for teams already using those platforms [1].