ClickHouse vs. GreptimeDB - Which Log Storage System Is Right for You?

Logs: A Key Player in Monitoring and Data Analytics

Logs play a crucial role in monitoring, operations, and data analytics. They are indispensable in troubleshooting online systems, analyzing user behavior, tracking ad clicks, and monitoring edge IoT devices. Log data includes server logs, application logs, and security records, often reaching terabytes or even petabytes per day for mid-to-large-scale enterprises. The challenge lies in efficiently and in real-time processing these vast amounts of log data within IT infrastructures.

ClickHouse has become a major choice for log storage systems worldwide, covering even specialized log types like traces. Its high compression rates and performance make it a strong alternative to ELK (Elasticsearch). Since 2024, GreptimeDB has been developing its log processing engine and has recently demonstrated outstanding results in ClickHouse’s official JSONBench benchmark, ranking first in Cold Run and fourth in Hot Run.

This article compares the features and performance of ClickHouse and GreptimeDB in log processing scenarios to assist users in choosing the right log storage system.

Core Differences: General-Purpose OLAP vs. Observability-Optimized Design

ClickHouse is a general-purpose OLAP database that efficiently handles a wide range of analytical queries. It excels in log analysis, financial processing, and IoT scenarios due to its flexible design and mature SQL ecosystem.

In contrast, GreptimeDB is a cloud-native database designed for observability data, making it well-suited for metric collection, log storage, and real-time monitoring. Its architecture is optimized for high-frequency time-stamped data ingestion and querying, such as metrics, logs and events.

Feature Comparison

Product	ClickHouse	GreptimeDB
Data Model	Columnar storage, table-based, general OLAP	Columnar storage, table-based, unified model for metrics, logs, and traces
Performance	Strong OLAP performance, balanced for time-series data	Optimized for observability workloads, excelling in read/write performance and compression
Scalability	Horizontal scaling with manual configuration	Cloud-native auto-scaling via Kubernetes
Query Language	SQL	SQL, native PromQL support
Cost Efficiency	Hardware-dependent	Primarily based on object storage such as S3, separation of storage and compute for efficiency
Ecosystem Compatibility	Rich plugin ecosystem; mature SQL support	Supports multiple ingestion protocols; compatible with MySQL/PostgreSQL protocols; Parquet storage for big data integration
Community Support	Large, stable community	Growing community focused on observability

(Figure 1: GreptimeDB in Observability Scenario) — （Figure 1: GreptimeDB in Observability Scenario）

Log-Specific Feature Comparison

Beyond the core features, log storage systems also require additional capabilities. Here’s how ClickHouse and GreptimeDB compare in this domain:

Product	ClickHouse	GreptimeDB
Visualization	Integrates with Grafana, Kibana	Built-in log views; integrates with Grafana, Kibana
Elasticsearch API Compatibility	Not natively supported; possible via third-party tools	Enterprise edition supports Elasticsearch’s REST API
Log ETL	No built-in ETL	Built-in pipeline for converting unstructured logs to structured logs
Logs to Metrics Conversion	Materialized views; pre-aggregated tables	Flow-based stream processing
Structured & Unstructured Logs	Supports both; experimental full-text search (high write impact)	Supports both; built-in full-text indexing
Traces Ingestion & Query	Requires manual modeling; SQL-based queries	Built-in Otel Traces modeling; supports SQL and Jaeger query protocols

Performance Analysis

The performance differences between the two can be summarized as follows. For detailed data, refer to our benchmark study: “67.5% Boost in Write Performance, 50% Lower Storage Cost —— GreptimeDB v0.12 Log Performance Benchmark.”

Product	GreptimeDB	ClickHouse
Version	v0.12	24.9.1.219
Data Model	Supports structured & unstructured logs (full-text indexing enabled)	Supports structured & unstructured logs (experimental full-text search)
Write Performance (TPS)	185,535 (structured); 159,502 (unstructured)	166,667 (structured); 136,612 (unstructured)
Write Throughput (HTTP Protocol)	470% of Elasticsearch	Better than Elasticsearch, slightly below GreptimeDB
Query Performance (Structured Data)	COUNT query: 6ms; Keyword search: 22.8ms	COUNT query: 46ms; Keyword search: 52ms
Query Performance (Unstructured Data)	COUNT query: 6ms; Keyword search: ~2547ms	COUNT query: 8ms; Keyword search: ~2080ms
Compression Rate	13%-33% (best efficiency)	26%-51%
CPU Usage (%)	Structured: 13.2%; Unstructured: 10.29%	Structured: 9.56%; Unstructured: 26.77%
Memory Usage (MB)	Structured: 408MB; Unstructured: 624MB	Structured: 611MB; Unstructured: 732MB
Object Storage (S3)	1%-2% performance loss	Natively supported

In summary, GreptimeDB achieves comparable read/write performance to ClickHouse while offering advantages in compression rates and resource efficiency. Notably, its object storage performance remains nearly unaffected.

Furthermore, GreptimeDB’s performance was validated in ClickHouse’s official JSONBench, showcasing its competitiveness in handling large-scale datasets:

Ranked 1st in 1-billion document Cold Run

(Figure 2: 1 billion docs, cold run) — （Figure 2: 1 billion docs, cold run）

(Figure 3: detailed comparison, 1 billion docs, cold run) — （Figure 3: detailed comparison, 1 billion docs, cold run）

Ranked 4th in 1-billion document Hot Run

(Figure 4: 1 billion docs, hot run) — （Figure 4: 1 billion docs, hot run）

(Figure 5: detailed comparison, 1 billion docs, hot run) — （Figure 5: detailed comparison, 1 billion docs, hot run）

Scalability: A Key Differentiator

Scalability is often a decisive factor in database selection. ClickHouse and GreptimeDB take fundamentally different approaches in this aspect. For more information about GreptimeDB storage architecture, please read this blog.

Complex Scaling with ClickHouse

ClickHouse supports horizontal scaling through sharding and replication, along with vertical scaling. However, horizontal scaling requires manual configuration, including adding nodes and rebalancing data. ClickHouse Cloud offers auto-scaling but lacks the flexibility of GreptimeDB’s open-source elastic scaling.

Seamless Auto-Scaling with GreptimeDB

As a cloud-native database, GreptimeDB employs a storage-compute separation architecture. Built on Kubernetes, it enables seamless elastic scaling, making it ideal for cloud environments. Independent resource scaling ensures cost efficiency and stable performance under high-demand workloads—requiring minimal manual intervention.

Storage-Compute Separation Architecture

While ClickHouse allows storing data on object storage like S3 (reference), this architecture has some limitations:

First, scaling still requires data replication, even though the data is already in S3. This results in a cumbersome expansion process and unnecessary storage overhead. Although ClickHouse provides Zero-Copy Replication, it still has many bugs and is not recommended for production use. In fact, ClickHouse officially recommends using the closed-source SharedMergeTree.

Second, ClickHouse only moves data files to object storage, while metadata (such as table schemas and data files metadata) is still managed locally and synchronized via Zookeeper. This easily leads to consistent issues between local and object storage metadata, along with various scalability and reliability challenges.

In contrast, GreptimeDB stores table metadata in MetaSrv and places data file metadata directly in object storage. This eliminates local storage dependencies, enhancing scalability and reliability in cloud-native environments.

Product	ClickHouse	GreptimeDB
Scaling Method	Supports both horizontal and vertical scaling, but horizontal scaling requires manual configuration (adding nodes, rebalancing data), making large-scale expansion complex.	Built on a cloud-native architecture with storage-compute separation, enabling seamless elastic scaling—ideal for cloud environments.
Cloud Support	ClickHouse Cloud offers auto-scaling options, but still has limitations compared to the elastic scalability of open-source GreptimeDB.	Kubernetes-native deployment with independent resource scaling, ensuring cost efficiency, flexibility, and stable performance without manual intervention.
Storage-Compute Separation	Data can be stored on object storage like S3, but scaling still requires data replication, making the process complex and leading to storage waste. Zero-Copy Replication has numerous issues and bugs.	Metadata and data are stored separately—table metadata in MetaSrv, while data file metadata is managed in object storage. This lightweight architecture offers better scalability.
Metadata Management	Table schemas and data files metadata are still stored locally and synchronized using Zookeeper, leading to inconsistencies and scalability challenges.	Centralized metadata management in MetaSrv, with data file metadata managed in object storage, ensuring better consistency and scalability.
Use Cases	Broad OLAP query and analysis, but limited scalability and cloud-native compatibility.	Excels in observability and IoT scenarios, making it ideal for high-scalability and elasticity-demanding applications.

Choosing Your Right Database: A Practical Guide

Choosing between ClickHouse and GreptimeDB depends on your specific workload requirements:

✅ When to Choose ClickHouse

Broad analytical queries across multiple domains
Requires a mature community and extensive documentation
Mixed OLAP workloads with observability use cases

✅ When to Choose GreptimeDB

High-frequency, time-stamped data (e.g., metrics, logs, observability data)
Cloud-native architectures requiring elastic scalability and cost efficiency
Workloads needing PromQL integration and compatibility with Prometheus

Conclusion for Database Selection

ClickHouse is a powerful general-purpose OLAP database with excellent performance, flexibility, and a large ecosystem. However, GreptimeDB excels in observability scenarios, offering optimized log and metric storage, high ingestion speed, and a scalable, cloud-native architecture tailored for monitoring and IoT workloads.

Both solutions are open-source, providing flexibility for different use cases. The choice ultimately depends on data characteristics, scalability needs, and cost considerations.

About Greptime

Greptime offers industry-leading time series database products and solutions to empower IoT and Observability scenarios, enabling enterprises to uncover valuable insights from their data with less time, complexity, and cost.

GreptimeDB is an open-source, high-performance time-series database offering unified storage and analysis for metrics, logs, and events. Try it out instantly with GreptimeCloud, a fully-managed DBaaS solution—no deployment needed!

The Edge-Cloud Integrated Solution combines multimodal edge databases with cloud-based GreptimeDB to optimize IoT edge scenarios, cutting costs while boosting data performance.

Star us on GitHub or join GreptimeDB Community on Slack to get connected.

ClickHouse vs. GreptimeDB - Which Log Storage System Is Right for You?

Logs: A Key Player in Monitoring and Data Analytics ​

Core Differences: General-Purpose OLAP vs. Observability-Optimized Design ​

Feature Comparison ​

Log-Specific Feature Comparison ​

Performance Analysis ​

Scalability: A Key Differentiator ​

Complex Scaling with ClickHouse ​

Seamless Auto-Scaling with GreptimeDB ​

Storage-Compute Separation Architecture ​

Choosing Your Right Database: A Practical Guide ​

Conclusion for Database Selection ​

About Greptime ​

Join our community