Modernizing Oilfield Data Infrastructure with TDengine at Sinopec

TDengine Team

April 14, 2026 /

China Petroleum & Chemical Corporation (Sinopec) is one of the world’s largest integrated energy and chemical companies and a consistent member of the Fortune Global 500. The company operates across the full value chain, including upstream oil and gas exploration, refining, petrochemicals, and distribution. Sinopec produces hundreds of millions of tons of oil and gas annually and operates one of the largest refining capacities in the world, exceeding 300 million tons per year.

The oilfield described in this case study is the second-largest in Sinopec. It operates across a total licensed area of 15,405 square kilometers, with estimated resources of 2 billion tons of oil and 1.8 trillion cubic meters of natural gas. Proven geological reserves include 625.8 million tons of oil and 483.3 billion cubic meters of natural gas.

The oilfield relies on its Production Control System (PCS) as a core hub for ensuring safe operations and optimizing production. To address challenges such as high-concurrency write bottlenecks, high storage costs, complex real-time analytics requirements, and data silos across multiple business domains, TDengine was deployed in the data infrastructure, storing and managing data for core PCS modules and enabling real-time data synchronization from branch sites to headquarters.

Business Challenges

The PCS system covers multiple critical business domains, including oil production, water injection, gas storage management, and pipeline monitoring. Across the oilfield, tens of thousands of sensors and intelligent devices generate hundreds of thousands of time-series data records per second. And with the widespread adoption of IoT technologies and the acceleration of digital transformation, the number of connected devices and data types in the PCS system has grown exponentially. This massive volume of time-series data has placed significant pressure on the existing Oracle-based data architecture.

Under the original architecture, the system faced several key challenges:

  • Write Performance Bottlenecks: Oracle databases are not designed for high-frequency time-series data ingestion. Under heavy concurrent insert workloads, I/O pressure becomes significant, often leading to write latency and even impacting the timely delivery of front-end control commands.
  • High Storage Costs: Relational databases offer limited compression efficiency for time-series data, resulting in substantial storage consumption. To retain historical data over required periods, continuous investment in high-end storage infrastructure is needed, driving up both hardware and maintenance costs.
  • Limited Real-Time Analytics Capability: Performing operations such as time-window aggregations and cross-device correlation analysis in Oracle requires complex SQL queries with low execution efficiency. Query response times often exceed several minutes.
  • Rigid System Architecture: Different business modules (such as PCS and gas storage systems) store data independently, creating data silos. This makes it difficult to perform unified cross-domain analysis and extract broader business value.

These challenges have significantly constrained the oilfield’s progress toward digitalization and intelligent operations, making it essential to adopt a high-performance database purpose-built for time-series data.

Why TDengine

Sinopec required a system that could deliver extremely high write and query performance, excellent data compression, strong SQL compatibility, high availability, and a cluster architecture that is easy to operate and maintain, while also integrating seamlessly with the existing IoT ecosystem.

After rigorous testing, TDengine demonstrated decisive advantages in the following areas:

  • Superior Performance: Under the same hardware conditions, TDengine achieved significant improvements in both write throughput and query speed compared to other solutions.
  • Exceptional Compression: With a storage architecture purpose-built for time-series data, TDengine achieved a compression ratio of 1:10, significantly outperforming other tested alternatives.
  • SQL Compatibility: The team’s existing SQL expertise could be directly applied, greatly reducing development and learning costs. It also enabled seamless integration with third-party tools such as ThingsBoard via standard JDBC connections for data querying.
  • Comprehensive Architecture: TDengine integrates features such as data subscription, caching, and stream processing out of the box. This eliminates the need for additional components like Kafka or Redis, simplifying the system architecture and reducing operational complexity.
  • Tiered Storage: Cold data can be automatically migrated to lower-cost object storage, aligning well with the project’s long-term data retention requirements and further optimizing storage costs.

System Architecture

A cloud-edge collaborative model was used, with TDengine serving as the core data persistence layer and computation engine throughout the system.

Capabilities and Benefits

Data Ingestion and Real-Time Processing

A dual-track data ingestion strategy was created to support various business scenarios at the oilfield. For business data with flexible and frequently changing structures, such as gas storage operations, TDengine TSDB’s schemaless mode allowed the database to automatically create tables and significantly simplify data integration. For core PCS business data, which has a stable structure and more complex logic, traditional SQL-based ingestion was used to ensure precise control and high performance.

At the data ingestion layer, various field devices across the oilfield report data to the ThingsBoard IoT platform via MQTT or HTTP. For flexible and dynamic data structures, such as those in gas storage systems, the schemaless ingestion interface of TDengine TSDB enables ThingsBoard to write data directly, with tables automatically created by TDengine, greatly improving development efficiency.

Key Benefits

  • Breakthrough in Write Performance: TDengine’s write engine, purpose-built for time-series data, eliminates the I/O bottlenecks encountered with Oracle. The system can easily handle hundreds of thousands of concurrent data writes per second, reducing latency from seconds to milliseconds, and ensuring real-time monitoring and timely execution of control commands.

Data Synchronization and Distribution

To meet requirements for global data analysis, the project synchronizes selected tables and supertables from oilfield production sites to headquarters in real time, achieving unified data integration across operations. Data synchronization tasks are configured through a graphical interface in the database management web console.

Key Benefits

  • Unified Data Aggregation: A two-tier edge—headquarters data architecture has been established, effectively addressing the need for centralized, global data analysis at headquarters while supporting localized data processing at branch sites.
  • Reliable and Efficient Synchronization: TDengine ensures data consistency and reliability during synchronization, with minimal impact on the performance of the source cluster. Headquarters can access a real-time, unified view of enterprise-wide data, providing a solid foundation for strategic analysis and cross-regional benchmarking.

Data Query and Analytics

Business applications, real-time dashboards, and analytics systems can directly access TDengine via standard SQL, a RESTful API, or JDBC. This enables full use of built-in functions for efficient aggregation, interpolation, and downsampling. With the supertable model, unified query and analysis across devices and regions has become straightforward.

At the application layer, business systems, real-time dashboards, and alerting engines connect directly to the oilfield-side TDengine cluster, leveraging its real-time computing capabilities for fast data querying and analysis.

Key Benefits

  • Query Performance Improvement: Typical query scenarios see significant performance gains. For example, time-window aggregations such as “average pressure per minute over the past hour at a water injection station” can be executed much faster and more efficiently.
  • Breaking Down Data Silos: Using the supertable model, data from similar equipment (e.g., pumps) across different oil production plants and business domains (such as oil production and water injection) can be unified under a consistent schema. This enables cross-domain and cross-region analytics, unlocking new opportunities for coordinated production optimization.
  • Enhanced Real-Time Analytics: Millisecond-level query response times make real-time dashboards, interactive analysis, and instant decision-making feasible and significantly improving the responsiveness of production scheduling and emergency operations.

Data Storage and Compression Module

TDengine is purpose-built for time-series data, leveraging columnar storage and specialized compression algorithms to enable high-throughput ingestion and efficient storage of massive data points. This approach not only significantly reduces storage footprint but also improves query performance.

Key Benefits

  • Significant Reduction in Storage Costs: TDengine has proven highly effective at data compression, typically achieving ratios exceeding 10:1. For example, datasets that originally required around 50 TB of raw storage are reduced to approximately 4.2 TB with a single replica. Combined with tiered storage strategies, overall long-term storage costs have been reduced by more than 85%.
  • Automated Storage Management: Tiered storage enables automated data lifecycle management, eliminating the need for manual archiving. While ensuring historical data remains accessible for queries, it also frees up expensive high-performance storage for active workloads.

Future Expansion: Toward an AI-Native Data Platform with TDengine IDMP

Building on the solid foundation of TDengine TSDB, the oilfield is looking into further enhancing its data capabilities by introducing TDengine IDMP as the next layer of its industrial data platform.

With IDMP, the focus will shift from data storage and retrieval to data understanding and intelligent decision-making. By leveraging built-in capabilities such as AI-driven analysis, event insight, and advanced process analytics, the oilfield could enable faster root cause identification, more accurate anomaly detection, and deeper operational insights across production systems.

In practical terms, TDengine IDMP would support:

  • Unified data visualization and monitoring, enabling real-time dashboards and cross-system visibility
  • Event-driven analysis, allowing engineers to quickly correlate operational events and identify key influencing factors
  • AI-assisted diagnostics, reducing reliance on manual analysis and accelerating decision-making
  • Closed-loop optimization, where insights from historical and real-time data can be continuously fed back into production processes

By integrating TDengine IDMP into its existing architecture, the oilfield could move toward a more intelligent, data-driven operating model, where data is not only stored efficiently but actively drives optimization, automation, and innovation across the entire operation.