pg_kazsearch
pg_kazsearch : Kazakh full-text search extension for PostgreSQL
Overview
| ID | Extension | Package | Version | Category | License | Language |
|---|---|---|---|---|---|---|
| 2200 | pg_kazsearch | pg_kazsearch | 0.1.0 | FTS | LGPL-3.0 | Rust |
| Attribute | Has Binary | Has Library | Need Load | Has DDL | Relocatable | Trusted |
|---|---|---|---|---|---|---|
--s-d-- | No | Yes | No | Yes | no | no |
Upstream release/package version is 2.0.0; extension control version is 0.1.0.
Packages
| Type | Repo | Version | PG Major Compatibility | Package Pattern | Dependencies |
|---|---|---|---|---|---|
| EXT | PIGSTY | 0.1.0 | 18 17 16 15 14 | pg_kazsearch | - |
| RPM | PIGSTY | 2.0.0 | 18 17 16 15 14 | pg_kazsearch_$v | - |
| DEB | PIGSTY | 2.0.0 | 18 17 16 15 14 | postgresql-$v-pg-kazsearch | - |
| Linux / PG | PG18 | PG17 | PG16 | PG15 | PG14 |
|---|---|---|---|---|---|
el8.x86_64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
el8.aarch64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
el9.x86_64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
el9.aarch64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
el10.x86_64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
el10.aarch64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
d12.x86_64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
d12.aarch64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
d13.x86_64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
d13.aarch64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
u22.x86_64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
u22.aarch64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
u24.x86_64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
u24.aarch64 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | PIGSTY 2.0.0 | MISS | MISS |
Source
pig build pkg pg_kazsearch; # build rpm/debInstall
Make sure PGDG and PIGSTY repo available:
pig repo add pgsql -u # add both repo and update cacheInstall this extension with pig:
pig install pg_kazsearch; # install via package name, for the active PG version
pig install pg_kazsearch -v 18; # install for PG 18
pig install pg_kazsearch -v 17; # install for PG 17
pig install pg_kazsearch -v 16; # install for PG 16Create this extension with:
CREATE EXTENSION pg_kazsearch;Usage
Sources: README and project repo.
pg_kazsearch is a PostgreSQL full-text search extension for the Kazakh language.
The upstream README describes it as a Rust extension built with pgrx that plugs into PostgreSQL’s text search pipeline.
It creates a ready-to-use configuration named kazakh_cfg and the supporting dictionary pg_kazsearch_dict.
Quick Start
CREATE EXTENSION pg_kazsearch;
SELECT to_tsvector('kazakh_cfg', 'президенттің жарлығы');
-- 'жарлық':2 'президент':1
SELECT ts_lexize('pg_kazsearch_dict', 'алмаларымыздағы');
-- {алма}Use Cases
The README shows the extension being used for:
- stemming individual Kazakh words
- building
tsvectorvalues withto_tsvector('kazakh_cfg', ...) - adding generated
tsvectorcolumns to a table - indexing those columns with GIN
- searching with
websearch_to_tsquery('kazakh_cfg', ...)
Example table workflow:
ALTER TABLE articles ADD COLUMN fts tsvector
GENERATED ALWAYS AS (
setweight(to_tsvector('kazakh_cfg', title), 'A') ||
setweight(to_tsvector('kazakh_cfg', body), 'B')
) STORED;
CREATE INDEX idx_fts ON articles USING GIN (fts);
SELECT title FROM articles
WHERE fts @@ websearch_to_tsquery('kazakh_cfg', 'президенттің жарлығы')
ORDER BY ts_rank_cd(fts, websearch_to_tsquery('kazakh_cfg', 'президенттің жарлығы')) DESC
LIMIT 10;Tuning
Penalty weights are adjustable at runtime:
ALTER TEXT SEARCH DICTIONARY pg_kazsearch_dict (w_deriv = 3.5, w_short_char = 100.0);Deployment
The README documents three supported paths:
- pre-built Debian/Ubuntu packages
- a Docker image based on
ghcr.io/darkhanakh/pg-kazsearch - source builds with
cargo pgrx install
The repository metadata in this project matches PostgreSQL 16-18.