datasketches

datasketches

datasketches : Approximate analytics sketches and aggregates for PostgreSQL

Overview

IDExtensionPackageVersionCategoryLicenseLanguage
4690
datasketches
datasketches
1.7.0
FUNC
Apache-2.0
C++
AttributeHas BinaryHas LibraryNeed LoadHas DDLRelocatableTrusted
--s-d-r
No
Yes
No
Yes
yes
no

Built against Apache DataSketches C++ core 5.0.0.

Packages

TypeRepoVersionPG Major CompatibilityPackage PatternDependencies
EXT
PIGSTY
1.7.0
18
17
16
15
14
datasketches-
RPM
PIGSTY
1.7.0
18
17
16
15
14
datasketches_$v-
DEB
PIGSTY
1.7.0
18
17
16
15
14
postgresql-$v-datasketches-
Linux / PGPG18PG17PG16PG15PG14
el8.x86_64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
el8.aarch64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
el9.x86_64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
el9.aarch64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
el10.x86_64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
el10.aarch64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
d12.x86_64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
d12.aarch64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
d13.x86_64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
d13.aarch64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
u22.x86_64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
u22.aarch64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
u24.x86_64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
u24.aarch64
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PIGSTY 1.7.0
PackageVersionOSORGSIZEFile URL
datasketches_181.7.0el8.x86_64pigsty324.4 KiBdatasketches_18-1.7.0-1PIGSTY.el8.x86_64.rpm
datasketches_181.7.0el8.aarch64pigsty314.1 KiBdatasketches_18-1.7.0-1PIGSTY.el8.aarch64.rpm
datasketches_181.7.0el9.x86_64pigsty309.4 KiBdatasketches_18-1.7.0-1PIGSTY.el9.x86_64.rpm
datasketches_181.7.0el9.aarch64pigsty315.1 KiBdatasketches_18-1.7.0-1PIGSTY.el9.aarch64.rpm
datasketches_181.7.0el10.x86_64pigsty319.1 KiBdatasketches_18-1.7.0-1PIGSTY.el10.x86_64.rpm
datasketches_181.7.0el10.aarch64pigsty319.4 KiBdatasketches_18-1.7.0-1PIGSTY.el10.aarch64.rpm
postgresql-18-datasketches1.7.0d12.x86_64pigsty918.1 KiBpostgresql-18-datasketches_1.7.0-1PIGSTY~bookworm_amd64.deb
postgresql-18-datasketches1.7.0d12.aarch64pigsty920.0 KiBpostgresql-18-datasketches_1.7.0-1PIGSTY~bookworm_arm64.deb
postgresql-18-datasketches1.7.0d13.x86_64pigsty943.3 KiBpostgresql-18-datasketches_1.7.0-1PIGSTY~trixie_amd64.deb
postgresql-18-datasketches1.7.0d13.aarch64pigsty944.0 KiBpostgresql-18-datasketches_1.7.0-1PIGSTY~trixie_arm64.deb
postgresql-18-datasketches1.7.0u22.x86_64pigsty1017.0 KiBpostgresql-18-datasketches_1.7.0-1PIGSTY~jammy_amd64.deb
postgresql-18-datasketches1.7.0u22.aarch64pigsty1020.8 KiBpostgresql-18-datasketches_1.7.0-1PIGSTY~jammy_arm64.deb
postgresql-18-datasketches1.7.0u24.x86_64pigsty977.8 KiBpostgresql-18-datasketches_1.7.0-1PIGSTY~noble_amd64.deb
postgresql-18-datasketches1.7.0u24.aarch64pigsty991.3 KiBpostgresql-18-datasketches_1.7.0-1PIGSTY~noble_arm64.deb
PackageVersionOSORGSIZEFile URL
datasketches_171.7.0el8.x86_64pigsty324.4 KiBdatasketches_17-1.7.0-1PIGSTY.el8.x86_64.rpm
datasketches_171.7.0el8.aarch64pigsty314.1 KiBdatasketches_17-1.7.0-1PIGSTY.el8.aarch64.rpm
datasketches_171.7.0el9.x86_64pigsty309.4 KiBdatasketches_17-1.7.0-1PIGSTY.el9.x86_64.rpm
datasketches_171.7.0el9.aarch64pigsty315.0 KiBdatasketches_17-1.7.0-1PIGSTY.el9.aarch64.rpm
datasketches_171.7.0el10.x86_64pigsty319.1 KiBdatasketches_17-1.7.0-1PIGSTY.el10.x86_64.rpm
datasketches_171.7.0el10.aarch64pigsty319.4 KiBdatasketches_17-1.7.0-1PIGSTY.el10.aarch64.rpm
postgresql-17-datasketches1.7.0d12.x86_64pigsty918.3 KiBpostgresql-17-datasketches_1.7.0-1PIGSTY~bookworm_amd64.deb
postgresql-17-datasketches1.7.0d12.aarch64pigsty919.2 KiBpostgresql-17-datasketches_1.7.0-1PIGSTY~bookworm_arm64.deb
postgresql-17-datasketches1.7.0d13.x86_64pigsty942.9 KiBpostgresql-17-datasketches_1.7.0-1PIGSTY~trixie_amd64.deb
postgresql-17-datasketches1.7.0d13.aarch64pigsty943.8 KiBpostgresql-17-datasketches_1.7.0-1PIGSTY~trixie_arm64.deb
postgresql-17-datasketches1.7.0u22.x86_64pigsty1.1 MiBpostgresql-17-datasketches_1.7.0-1PIGSTY~jammy_amd64.deb
postgresql-17-datasketches1.7.0u22.aarch64pigsty1.1 MiBpostgresql-17-datasketches_1.7.0-1PIGSTY~jammy_arm64.deb
postgresql-17-datasketches1.7.0u24.x86_64pigsty977.8 KiBpostgresql-17-datasketches_1.7.0-1PIGSTY~noble_amd64.deb
postgresql-17-datasketches1.7.0u24.aarch64pigsty991.2 KiBpostgresql-17-datasketches_1.7.0-1PIGSTY~noble_arm64.deb
PackageVersionOSORGSIZEFile URL
datasketches_161.7.0el8.x86_64pigsty324.4 KiBdatasketches_16-1.7.0-1PIGSTY.el8.x86_64.rpm
datasketches_161.7.0el8.aarch64pigsty314.1 KiBdatasketches_16-1.7.0-1PIGSTY.el8.aarch64.rpm
datasketches_161.7.0el9.x86_64pigsty309.4 KiBdatasketches_16-1.7.0-1PIGSTY.el9.x86_64.rpm
datasketches_161.7.0el9.aarch64pigsty315.0 KiBdatasketches_16-1.7.0-1PIGSTY.el9.aarch64.rpm
datasketches_161.7.0el10.x86_64pigsty319.1 KiBdatasketches_16-1.7.0-1PIGSTY.el10.x86_64.rpm
datasketches_161.7.0el10.aarch64pigsty319.3 KiBdatasketches_16-1.7.0-1PIGSTY.el10.aarch64.rpm
postgresql-16-datasketches1.7.0d12.x86_64pigsty918.1 KiBpostgresql-16-datasketches_1.7.0-1PIGSTY~bookworm_amd64.deb
postgresql-16-datasketches1.7.0d12.aarch64pigsty919.5 KiBpostgresql-16-datasketches_1.7.0-1PIGSTY~bookworm_arm64.deb
postgresql-16-datasketches1.7.0d13.x86_64pigsty943.1 KiBpostgresql-16-datasketches_1.7.0-1PIGSTY~trixie_amd64.deb
postgresql-16-datasketches1.7.0d13.aarch64pigsty943.8 KiBpostgresql-16-datasketches_1.7.0-1PIGSTY~trixie_arm64.deb
postgresql-16-datasketches1.7.0u22.x86_64pigsty1.1 MiBpostgresql-16-datasketches_1.7.0-1PIGSTY~jammy_amd64.deb
postgresql-16-datasketches1.7.0u22.aarch64pigsty1.1 MiBpostgresql-16-datasketches_1.7.0-1PIGSTY~jammy_arm64.deb
postgresql-16-datasketches1.7.0u24.x86_64pigsty977.8 KiBpostgresql-16-datasketches_1.7.0-1PIGSTY~noble_amd64.deb
postgresql-16-datasketches1.7.0u24.aarch64pigsty991.2 KiBpostgresql-16-datasketches_1.7.0-1PIGSTY~noble_arm64.deb
PackageVersionOSORGSIZEFile URL
datasketches_151.7.0el8.x86_64pigsty342.1 KiBdatasketches_15-1.7.0-1PIGSTY.el8.x86_64.rpm
datasketches_151.7.0el8.aarch64pigsty332.3 KiBdatasketches_15-1.7.0-1PIGSTY.el8.aarch64.rpm
datasketches_151.7.0el9.x86_64pigsty323.5 KiBdatasketches_15-1.7.0-1PIGSTY.el9.x86_64.rpm
datasketches_151.7.0el9.aarch64pigsty329.1 KiBdatasketches_15-1.7.0-1PIGSTY.el9.aarch64.rpm
datasketches_151.7.0el10.x86_64pigsty325.9 KiBdatasketches_15-1.7.0-1PIGSTY.el10.x86_64.rpm
datasketches_151.7.0el10.aarch64pigsty325.2 KiBdatasketches_15-1.7.0-1PIGSTY.el10.aarch64.rpm
postgresql-15-datasketches1.7.0d12.x86_64pigsty932.6 KiBpostgresql-15-datasketches_1.7.0-1PIGSTY~bookworm_amd64.deb
postgresql-15-datasketches1.7.0d12.aarch64pigsty933.7 KiBpostgresql-15-datasketches_1.7.0-1PIGSTY~bookworm_arm64.deb
postgresql-15-datasketches1.7.0d13.x86_64pigsty957.8 KiBpostgresql-15-datasketches_1.7.0-1PIGSTY~trixie_amd64.deb
postgresql-15-datasketches1.7.0d13.aarch64pigsty957.9 KiBpostgresql-15-datasketches_1.7.0-1PIGSTY~trixie_arm64.deb
postgresql-15-datasketches1.7.0u22.x86_64pigsty1.1 MiBpostgresql-15-datasketches_1.7.0-1PIGSTY~jammy_amd64.deb
postgresql-15-datasketches1.7.0u22.aarch64pigsty1.1 MiBpostgresql-15-datasketches_1.7.0-1PIGSTY~jammy_arm64.deb
postgresql-15-datasketches1.7.0u24.x86_64pigsty984.6 KiBpostgresql-15-datasketches_1.7.0-1PIGSTY~noble_amd64.deb
postgresql-15-datasketches1.7.0u24.aarch64pigsty998.8 KiBpostgresql-15-datasketches_1.7.0-1PIGSTY~noble_arm64.deb
PackageVersionOSORGSIZEFile URL
datasketches_141.7.0el8.x86_64pigsty342.1 KiBdatasketches_14-1.7.0-1PIGSTY.el8.x86_64.rpm
datasketches_141.7.0el8.aarch64pigsty332.3 KiBdatasketches_14-1.7.0-1PIGSTY.el8.aarch64.rpm
datasketches_141.7.0el9.x86_64pigsty323.9 KiBdatasketches_14-1.7.0-1PIGSTY.el9.x86_64.rpm
datasketches_141.7.0el9.aarch64pigsty328.8 KiBdatasketches_14-1.7.0-1PIGSTY.el9.aarch64.rpm
datasketches_141.7.0el10.x86_64pigsty325.9 KiBdatasketches_14-1.7.0-1PIGSTY.el10.x86_64.rpm
datasketches_141.7.0el10.aarch64pigsty325.2 KiBdatasketches_14-1.7.0-1PIGSTY.el10.aarch64.rpm
postgresql-14-datasketches1.7.0d12.x86_64pigsty932.6 KiBpostgresql-14-datasketches_1.7.0-1PIGSTY~bookworm_amd64.deb
postgresql-14-datasketches1.7.0d12.aarch64pigsty933.4 KiBpostgresql-14-datasketches_1.7.0-1PIGSTY~bookworm_arm64.deb
postgresql-14-datasketches1.7.0d13.x86_64pigsty957.3 KiBpostgresql-14-datasketches_1.7.0-1PIGSTY~trixie_amd64.deb
postgresql-14-datasketches1.7.0d13.aarch64pigsty957.5 KiBpostgresql-14-datasketches_1.7.0-1PIGSTY~trixie_arm64.deb
postgresql-14-datasketches1.7.0u22.x86_64pigsty1.1 MiBpostgresql-14-datasketches_1.7.0-1PIGSTY~jammy_amd64.deb
postgresql-14-datasketches1.7.0u22.aarch64pigsty1.1 MiBpostgresql-14-datasketches_1.7.0-1PIGSTY~jammy_arm64.deb
postgresql-14-datasketches1.7.0u24.x86_64pigsty984.5 KiBpostgresql-14-datasketches_1.7.0-1PIGSTY~noble_amd64.deb
postgresql-14-datasketches1.7.0u24.aarch64pigsty998.7 KiBpostgresql-14-datasketches_1.7.0-1PIGSTY~noble_arm64.deb

Source

pig build pkg datasketches;		# build rpm/deb

Install

Make sure PGDG and PIGSTY repo available:

pig repo add pgsql -u   # add both repo and update cache

Install this extension with pig:

pig install datasketches;		# install via package name, for the active PG version

pig install datasketches -v 18;   # install for PG 18
pig install datasketches -v 17;   # install for PG 17
pig install datasketches -v 16;   # install for PG 16
pig install datasketches -v 15;   # install for PG 15
pig install datasketches -v 14;   # install for PG 14

Create this extension with:

CREATE EXTENSION datasketches;

Usage

Sources: README, Apache DataSketches site PostgreSQL extension for approximate analytics sketches and aggregates.

CREATE EXTENSION datasketches;

The extension supports CPC, HLL, Theta, Array Of Doubles, KLL, Quantiles, and Frequent Strings sketches.

Sketch Families

  • CPC for compact distinct counting.
  • HLL for HyperLogLog-style distinct counting.
  • Theta for distinct counting with set operations such as union, intersection, and A-not-B.
  • Array Of Doubles for tuple sketches with arrays of double values per key.
  • KLL for quantiles, ranks, PMF, and CDF estimation.
  • Quantiles sketch for long-term support of distribution estimates.
  • Frequent strings for tracking the heaviest items by count or weight.

Examples

SELECT cpc_sketch_to_string(cpc_sketch_build(1));
SELECT cpc_sketch_distinct(id) FROM random_ints_100m;
SELECT cpc_sketch_get_estimate(cpc_sketch_union(sketch)) FROM cpc_sketch_test;
SELECT theta_sketch_get_estimate(theta_sketch_union(sketch)) FROM theta_sketch_test;
SELECT theta_sketch_get_estimate(theta_sketch_intersection(sketch1, sketch2)) FROM theta_set_op_test;
SELECT hll_sketch_get_estimate(hll_sketch_union(sketch)) FROM hll_sketch_test;
SELECT hll_sketch_get_estimate(hll_sketch_union(hll_sketch_build(1), hll_sketch_build(2)));
SELECT kll_float_sketch_get_quantile(kll_float_sketch_merge(sketch), 0.5) FROM kll_float_sketch_test;
SELECT frequent_strings_sketch_result_no_false_negatives(frequent_strings_sketch_build(9, value), 1000000) FROM zipf_1p1_8k_100m;

Core Operations

  • Build sketches with *_sketch_build(...).
  • Merge or aggregate them with *_sketch_union(...), *_sketch_merge(...), and sketch-specific set-operation helpers.
  • Read estimates with *_sketch_get_estimate(...) and distribution helpers such as kll_float_sketch_get_quantile(...).

Notes

  • The README says the extension targets PostgreSQL 9.6 and higher and depends on Boost 1.75 and DataSketches C++ core 5.0.0 or later.
  • The upstream examples emphasize additive analytics in data cubes, not exact replacement for normal aggregates.
Last updated on