✕

Join us for a virtual meetup on Zoom at 8 PM, July 31 (PDT) about using One Time Series Database for Both Metrics and Logs 👉🏻 Register Now

✕
Skip to content
About Apache Pinot
Apache Pinot is a real-time distributed OLAP datastore designed for ultra-low latency analytics at scale. Originally built at LinkedIn, Pinot specializes in user-facing analytical applications that require sub-second response times on large datasets. It features columnar storage, pluggable indexing technologies, and supports both real-time streaming and batch ingestion. Pinot excels at slice-and-dice operations, drill-downs, and complex analytical queries on high-cardinality, multi-dimensional data.
GreptimeDB vs. Apache Pinot
Feature/AspectGreptimeDBApache Pinot
Data ModelUnified Observability DatabaseReal-time OLAP Analytics Database
Primary FocusTime-series data with observability (metrics, logs, traces)User-facing real-time analytics and reporting
Multi-model SupportMetrics, Logs & Traces in one databaseAnalytics data only (requires separate systems for observability)
Query Use CasesTime-series queries, monitoring, alerting, observabilityOLAP queries, slice-and-dice, drill-down, pivot operations
Ingestion ProtocolsSQL
gRPC
InfluxDB Line Protocol
Prometheus Remote Storage
OpenTelemetry
HTTP API
Kafka
Pulsar
Kinesis
Batch (Hadoop, Spark, S3)
REST API
Query LanguagesSQL & PromQL (dual interface)SQL & PromQL (via plugin, experimental in v1.3.0+)
Query PerformanceSub-second response optimized for time-series patternsSub-second response optimized for OLAP workloads
Indexing StrategyTime-series optimized (inverted, full-text, vector search)OLAP optimized (StarTree, Bloom filter, range, text, JSON, geospatial)
Storage FormatApache Parquet (time-series optimized)Columnar with dictionary encoding, compression
Real-time ProcessingNative time-series streaming with Pipeline engineReal-time OLAP with lambda architecture
Data RetentionFlexible TTL policies for observability dataTiered storage (hot, warm, cold)
Scalability ModelCompute-storage separation for time-series workloadsHorizontal scaling for high-concurrency analytics
ConcurrencyOptimized for monitoring and alerting queriesHundreds of thousands of concurrent analytical queries
Use CasesInfrastructure monitoring, application observability, IoT data analysis, real-time alertingUser-facing dashboards, business analytics, interactive reporting
ArchitectureCloud-native distributed with observability focusDistributed OLAP with Controller, Broker, Server architecture
Data CompressionTime-series aware compressionMultiple compression schemes (Run Length, LZ4, Snappy)
Schema ManagementSchema-on-write with time-series semanticsSchema-on-write with OLAP optimizations
Deployment ComplexitySingle unified system with simplified Kubernetes operationsComplex multi-component deployment for analytics
LicenseApache 2.0Apache 2.0
Written LanguageRust (memory safety, performance)Java (ecosystem compatibility)
Ecosystem IntegrationDeep observability ecosystem (Grafana, Prometheus, OpenTelemetry)Analytics ecosystem (Superset, Tableau, custom dashboards)

Join our community

Get the latest updates and discuss with other users.