How to evaluate TSDB data subscription, topics, filters, consumer groups, checkpoint resume, Kafka comparison, sharing scenarios, and POC tests?
In industrial IoT, intelligent operations, energy management, and financial trading, a time-series database often becomes the system of record for high-volume timestamped data. But storing and querying data is no longer enough. Modern platforms also need fresh time-series data to flow reliably from production systems to analytics, alerting, AI, archiving, and partner applications.
This is where data subscription and sharing capabilities become important. A TSDB with mature subscription support can decouple data producers from consumers, reduce repeated polling, and help organizations build real-time data pipelines without adding unnecessary architecture complexity.
This article explains the core value of data subscription, the capabilities to evaluate, and when built-in subscription is a better fit than an external message queue.
1. The Core Value of Data Subscription
Data subscription decouples production from consumption. In a traditional architecture, each consuming system queries the source database directly. As the number of consumers grows, the source system carries more query load, integration logic becomes duplicated, and applications become tightly coupled.
With subscription capability, the database can publish new data or data changes to downstream systems as a stream. This provides three practical benefits.
- Decouple producers and consumers. Business systems can focus on writing data, while downstream systems subscribe to the data they need. When consumers change or scale, production-side stability is less affected.
- Serve multiple downstream systems. A single time-series data source can feed BI platforms, AI training pipelines, real-time alerting engines, data archiving services, and operational applications. Each consumer can process the same source data for a different purpose.
- Build real-time data pipelines. Subscription mechanisms can deliver data with low latency, often in the millisecond-to-second range depending on implementation and workload. This supports real-time monitoring, anomaly detection, and event-driven operations.
2. Core Subscription Capabilities in a Time-Series Database
Four capabilities matter most when evaluating TSDB subscription support.
2.1 Topic-Based Subscription
Topics are the foundation of data distribution. A strong time-series database should support topic definitions at useful granularities, such as Database, Supertable, Subtable, or Tag conditions.
For example, a topic might include only data from a specific device type, production line, or region. This prevents consumers from receiving irrelevant data and reduces network, storage, and processing overhead.
2.2 Filter Conditions
Beyond topic granularity, row-level filtering is often necessary. A consumer may need only temperature readings above a threshold, error events with a specific status code, or Metrics from a selected device group.
SQL-like filters make subscription more precise. They also reduce the need for every downstream consumer to implement its own filtering logic.
2.3 Consumer Groups
Consumer groups provide load balancing and High Availability for subscription workloads. Multiple consumers can join the same group and process one topic together. This enables parallel consumption and failover.
When one consumer fails, its assigned partitions or workload should be redistributed to healthy consumers. The exact behavior depends on the product, so teams should test failover, rebalancing, and duplicate handling during a POC.
2.4 Checkpoint Resume
In distributed systems, restarts, network instability, and consumer failures are normal. Checkpoint resume allows a consumer to continue from its last processed position instead of starting over or skipping data.
This usually depends on persisted offsets. A reliable subscription mechanism should clearly define its delivery guarantees, such as at-least-once delivery. Exactly-once semantics, if claimed, should be tested carefully because it often depends on both the database and the downstream processing logic.
3. Built-In Subscription vs. External Message Queues
When designing a time-series data distribution architecture, teams often choose between the database’s built-in subscription mechanism and an external message queue such as Kafka or RabbitMQ.
| Dimension | Built-in Subscription | Kafka / RabbitMQ |
|---|---|---|
| Deployment complexity | Lower, no extra core component | Higher, separate deployment and maintenance |
| Data consistency | Directly tied to database storage | Requires synchronization into the queue |
| Latency | Often lower because there are fewer hops | May add a transmission and processing hop |
| Throughput | Depends on database implementation | Very high in mature streaming systems |
| Maximum consumers | Usually more limited | Horizontally scalable |
| Protocol compatibility | Often product-specific SDKs | Rich ecosystem and protocol support |
| Operations cost | Lower for simple scenarios | Higher, usually needs stream-platform expertise |
Built-in subscription is often the lighter choice for moderate data volume, limited consumer counts, and straightforward downstream processing. TDengine, for example, provides subscription functionality that captures data changes from the database layer without requiring an additional synchronization link.
External message queues remain valuable for very large-scale streams, complex stream processing, cross-system event buses, long retention in the messaging layer, or heterogeneous protocol requirements. The right choice depends on workload scale, consumer count, processing complexity, and team expertise.
4. Typical Data Sharing Scenarios
4.1 Cross-Department Data Sharing
In a manufacturing enterprise, equipment operating data may need to be shared with the quality team for process optimization and the energy team for consumption analysis. With subscription topics, each department can receive the data it is authorized to use without coordinating direct database access for every query.
4.2 Upstream-Downstream Data Exchange
In supply chain scenarios, a core enterprise may share order progress with suppliers and shipment tracking with customers. Controlled topics and API exposure can support governed data exchange while keeping access boundaries clear.
4.3 API Exposure and Ecosystem Development
Organizations can push anonymized or aggregated time-series data to API gateways through subscription mechanisms. This makes it easier to provide data services to partners, developers, or ecosystem applications while protecting raw operational data.
5. Key Dimensions for Evaluation
When evaluating data subscription and sharing capability, focus on five dimensions.
Subscription latency. Measure the time from data write to delivery at the consumer. Real-time monitoring may require second-level or lower latency, while analytical workflows may tolerate minute-level latency.
Throughput. Test how many data points or how much data can be delivered per unit of time. Validate peak volume, not just average volume.
Maximum consumer count. Check the number of active subscribers or consumer groups the system can support, and leave room for future growth.
Protocol compatibility. Review supported protocols and SDK languages. Standard interfaces such as MQTT, WebSocket, HTTP SSE, or multi-language SDKs can reduce integration cost, depending on the scenario.
Data security and access control. Enterprise deployments need topic-level access control, encrypted transmission, consumer authentication, audit logs, and clear controls over which consumers can access which data.
6. Reference Architecture
A typical time-series data subscription and sharing architecture has three layers:
Time-Series Database (Data Production Layer)
-> Subscription Mechanism (Data Distribution Layer)
-> BI Analytics Platform (historical analysis, reporting)
-> AI Training System (feature extraction, model training)
-> Real-Time Alerting Engine (anomaly detection, notifications)
-> Data Archiving Service (cold storage, compliance audit)
In this architecture, the time-series database acts as the source of truth. The subscription layer distributes data to multiple consumers, and each consumer can operate independently. This design helps preserve data consistency while allowing the consumption side to scale and evolve.
Data subscription and sharing capability is becoming a core requirement for enterprise time-series databases. A strong TSDB should not only support high-performance writes and queries, but also help teams move data reliably from collection to consumption.
When evaluating products, test subscription latency, throughput, consumer scalability, protocol compatibility, delivery guarantees, and access control against real business workloads. Built-in subscription can simplify architecture in many scenarios, while Kafka or other message queues remain better suited to large-scale or complex streaming architectures.
Run a POC with realistic data volume, consumer count, failure scenarios, and downstream processing logic before making the final selection.
To pick up a draggable item, press the space bar. While dragging, use the arrow keys to move the item. Press space again to drop the item in its new position, or press escape to cancel.


