Hydrolix Custom Metrics

A list of custom metrics used in Hydrolix

Hydrolix Metrics

This page lists the custom Hydrolix metrics available, and which components update them.

If more than one component uses a given metric, then querying it will return results from all relevant components. You can restrict results to a specific component by adding a service keyword to your query. For example, process_open_fds{service="stream-peer"}.

For more information about metric types, refer to Prometheus's documentation.

Custom general metrics

MetricTypeComponentsPurpose
bytes_writtenCounterBatch peer, Stream peerBytes written to the indexer.
partitions_createdCounterBatch peer, Stream peerCount of partitions created.
upload_durationSummaryAny intake peerTime spent uploading a file, in milliseconds.

Custom query metrics

MetricTypeComponentsPurpose
net_connect_attempts_totalHistogramHead/Query peerHistogram of TCP connection attempts to the storage service.
net_connect_secondsHistogramHead/Query peerHistogram of time to connect over TCP to the storage service, in seconds.
net_dns_resolve_secondsHistogramHead/Query peerHistogram of DNS resolution time for the storage service, in seconds.
net_http_response_timeHistogramHead/Query peerHistogram of HTTP response times from the storage service, in seconds.
net_http_response_bytesHistogramHead/Query peerHistogram of HTTP bytes downloaded from the storage service.
net_http_attempts_totalHistogramHead/Query peerHistogram of HTTP connection attempts to the storage service.
net_http_status_codeHistogramHead/Query peerHistogram of HTTP status codes from the storage service.
vfs_cache_hitmiss_totalHistogramHead/Query peerHistogram of cache status (bucket=0 cache miss, bucket=1 cache hit).
vfs_cache_read_bytesHistogramHead/Query peerHistogram of bytes read from the cache.
vfs_net_read_bytesHistogramHead/Query peerHistogram of bytes read from the network.
vfs_cache_lru_file_eviction_totalHistogramHead/Query peerHistogram of file evictions from the cache.
epoll_cpu_secondsHistogramHead/Query peerHistogram of CPU usage in seconds.
epoll_io_secondsHistogramHead/Query peerHistogram of I/O times in seconds.
epoll_poll_secondsHistogramHead/Query peerHistogram of wait times for file descriptors, in seconds.
hdx_storage_r_catalog_partitions_totalHistogramHead/Query peerHistogram of per-query catalog partition count.
hdx_storage_r_partitions_read_totalHistogramHead/Query peerHistogram of per-query partition read count.
hdx_storage_r_partitions_per_core_totalHistogramHead/Query peerHistogram of per-core partition usage count.
hdx_storage_r_peers_used_totalHistogramQuery peerHistogram of storage used total.
hdx_storage_r_cores_used_totalHistogramQuery peerHistogram of cores used total.
hdx_storage_r_catalog_timerangeHistogramHead/Query peerHistogram of query time-range distribution.
hdx_partition_columns_read_totalHistogramHead/Query peerHistogram of columns read.
hdx_partition_block_decode_secondsHistogramHead/Query peerHistogram of time spent decoding HDX blocks, in seconds.
hdx_partition_open_secondsHistogramHead/Query peerHistogram of time spent opening HDX partitions, in seconds.
hdx_partition_read_secondsHistogramHead/Query peerHistogram of time spent reading HDX partitions, in seconds.
hdx_partition_skipped_totalHistogramHead/Query peerHistogram of partitions skipped due to no matching columns.
hdx_partition_blocks_read_totalHistogramHead/Query peerHistogram of partition-block read counts.
hdx_partition_blocks_avail_totalHistogramHead/Query peerHistogram of partition blocks available.
hdx_partition_index_decisionHistogramHead/Query peerHistogram of partition decisions (bucket=0 full scan, 1 partial scan, 2 no match).
hdx_partition_index_lookup_secondsHistogramHead/Query peerHistogram of index lookup times, in seconds.
hdx_partition_index_blocks_skipped_percentHistogramHead/Query peerHistogram of skipped index blocks, in percentage.
hdx_partition_index_blocks_skipped_totalHistogramHead/Query peerHistogram of total skipped index blocks.
hdx_partition_rd_w_err_totalHistogramHead/Query peerHistogram of read/write errors (bucket=0 read error, 1 write error, 3 error).
query_iowait_secondsHistogramHead/Query peerHistogram of query I/O wait times, in seconds.
query_cpuwait_secondsHistogramHead/Query peerHistogram of query CPU wait times, in seconds.
query_hdx_ch_conv_secondsHistogramHead/Query peerHistogram of time spent converting HDX blocks to ClickHouse, in seconds.
query_healthHistogramHead/Query peerHistogram of query health (bucket=0 initiated error, 1 succeeded, 2 error).
query_peer_availabilityHistogramHead/Query peerHistogram of peer availability (bucket=0 primary_peer_available, 1 secondary_peer_available, 2 no_reachable_peers).
query_attempts_totalHistogramHead/Query peerHistogram of total query attempts.
query_response_secondsHistogramHead/Query peerHistogram of total query response times, in seconds.
query_rows_read_totalHistogramHead/Query peerHistogram of total query rows read.
query_read_bytesHistogramHead/Query peerHistogram of total query read bytes.
query_rows_written_totalHistogramHead/Query peerHistogram of total query rows written.

Custom batch metrics

MetricTypeComponentsPurpose
processed_countCounterBatch peerCount of items processed.
processed_failureCounterBatch peerCount of processing failures.
processing_duration_histoHistogramBatch peerHistogram of Batch processing durations (milliseconds).
processing_duration_summarySummaryBatch peerSummary of Batch processing durations (milliseconds).
rows_readCounterBatch peerCount of rows read.

Custom merge metrics

MetricTypeComponentsPurpose
merge_duration_summarySummaryMerge peerMerge processing duration, in milliseconds.
merge_duration_histoHistogramMerge peerMerge processing duration, in milliseconds.
merge_sdk_duration_summarySummaryMerge peerMerge SDK processing duration, in milliseconds.
merge_sdk_duration_histoHistogramMerge peerMerge SDK processing duration, in milliseconds.
merge_candidate_histoHistogramMerge peerPartitions per merge candidate.
merge_candidate_inactiveCounterMerge peerMerge candidates skipped due to an inactive partition within the candidate.
merge_candidate_construction_summarySummaryMerge headTime spent building merge candidates, in milliseconds.
merge_queue_fullCounterMerge headTimes candidate generation was skipped due to a full queue.
merge_successCounterMerge peerCount of merge successes.
merge_failureCounterMerge peerCount of merge failures.

Custom streaming metrics

MetricTypeComponentsPurpose
hdx_sink_backlog_bytes_countGaugeIntake headTotal bytes of all partition buckets in the sink backlog waiting to be indexed (requires intake_head_index_backlog_enabled = true).
hdx_sink_backlog_items_countGaugeIntake headTotal count of partition buckets in the backlog (requires intake_head_index_backlog_enabled = true).
hdx_sink_backlog_dropped_bytes_countCounterIntake headTotal bytes of partition buckets dropped due to backlog overflow (requires intake_head_index_backlog_enabled = true).
hdx_sink_backlog_dropped_items_countCounterIntake headCount of partition buckets dropped due to backlog overflow (requires intake_head_index_backlog_enabled = true).
hdx_sink_backlog_delivery_countCounterIntake headCount of backlog buckets successfully handed off to indexing (requires intake_head_index_backlog_enabled = true).
hdx_sink_backlog_trim_duration_nsHistogramIntake headTime to trim the backlog in nanoseconds (requires intake_head_index_backlog_enabled = true).
http_source_byte_countCounterStream headCount of bytes processed.
http_source_request_countCounterStream headCount of HTTP requests.
http_source_request_duration_nsHistogramStream headHistogram of HTTP request durations in nanoseconds.
http_source_request_error_countCounterStream headCount of HTTP request failures.
http_source_row_countCounterStream headCount of rows processed.
http_source_value_countCounterStream headCount of values processed.

Custom Kinesis metrics

MetricTypeComponentsPurpose
kinesis_source_byte_countCounterStream peerCount of bytes read from Kinesis.
kinesis_source_checkpoint_countCounterStream peerCount of Kinesis checkpoint operations.
kinesis_source_checkpoint_duration_nsHistogramStream peerDuration of Kinesis checkpoint operations, in nanoseconds.
kinesis_source_checkpoint_error_countCounterStream peerCount of Kinesis checkpoint operation errors.
kinesis_source_error_countCounterStream peerCount of errors reading from Kinesis.
kinesis_source_lag_msGaugeStream peerMeasure of lag in Kinesis, in milliseconds.
kinesis_source_operation_countCounterStream peerCount of Kinesis operations.
kinesis_source_operation_duration_nsHistogramStream peerHistogram of Kinesis operation durations, in nanoseconds.
kinesis_source_record_countCounterStream peerCount of records read from Kinesis.
kinesis_source_row_countCounterStream peerCount of rows read from Kinesis.
kinesis_source_value_countCounterStream peerCount of values read from Kinesis.

Custom Kafka metrics

MetricTypeComponentsPurpose
kafka_source_byte_countCounterStream peerCount of bytes read from Kafka.
kafka_source_commit_duration_nsHistogramStream peerKafka commit duration, in nanoseconds.
kafka_source_read_countCounterStream peerCount of Kafka reads.
kafka_source_read_duration_nsHistogramStream peerKafka read duration, in nanoseconds.
kafka_source_read_error_countCounterStream peerCount of Kafka errors.
kafka_source_row_countCounterStream peerCount of rows processed from Kafka.
kafka_source_value_countCounterStream peerCount of values processed from Kafka.

Custom DNS metrics

MetricTypeComponentsPurpose
dns_num_ips_in_cacheHistogram(ingest)The size of the IP pool used in the DNS system.
dns_lookup_timeHistogram(ingest)Milliseconds per lookup.
dns_ttlHistogram(ingest)TTLs received per lookup.

Custom PostgreSQL metrics

MetricTypePurpose
pgx_pool_total_acquire_countCountThe cumulative count of successful acquires from the pool.
pgx_pool_total_acquire_duration_ns_countCountThe total duration (ns) of all successful acquires from the pool.
pgx_pool_total_acquire_cancel_countCountThe cumulative count of acquires from the pool that were canceled by a context.
pgx_pool_total_acquire_empty_countCountThe cumulative count of successful acquires from the pool that had to wait for a resource because the pool was empty.
pgx_pool_total_conns_opened_countCountThe cumulative count of new connections opened.
pgx_pool_total_destroyed_max_lifetime_countCountThe cumulative count of connections destroyed because they exceeded MaxConnLifetime.
pgx_pool_total_destroyed_max_idle_countCountThe cumulative count of connections destroyed because they exceeded MaxConnIdleTime.
pgx_pool_current_sizeGaugeThe total number of resources currently in the pool.
pgx_pool_current_constructingGaugeThe number of connections currently being constructed.
pgx_pool_current_acquiredGaugeThe number of connections currently acquired.
pgx_pool_current_idleGaugeThe number of currently idle connections in the pool.
pgx_pool_maxGaugeThe maximum size of the pool.