address_standardizer
postgis : Used to parse an address into constituent elements. Generally used to support geocoding address normalization step.
Overview
| ID | Extension | Package | Version | Category | License | Language |
|---|---|---|---|---|---|---|
| 1505 | address_standardizer | postgis | 3.6.2 | GIS | GPL-2.0 | C |
| Attribute | Has Binary | Has Library | Need Load | Has DDL | Relocatable | Trusted |
|---|---|---|---|---|---|---|
--s-d-r | No | Yes | No | Yes | yes | no |
| Relationships | |
|---|---|
| See Also | pgrouting pointcloud pointcloud_postgis h3 h3_postgis q3c ogr_fdw geoip |
| Siblings | postgis postgis_topology postgis_raster postgis_sfcgal postgis_tiger_geocoder address_standardizer_data_us |
Packages
| Type | Repo | Version | PG Major Compatibility | Package Pattern | Dependencies |
|---|---|---|---|---|---|
| EXT | PGDG | 3.6.2 | 18 17 16 15 14 | postgis | - |
| RPM | PGDG | 3.6.2 | 18 17 16 15 14 | postgis36_$v | - |
| DEB | PGDG | 3.6.2 | 18 17 16 15 14 | postgresql-$v-postgis-3 | - |
| Linux / PG | PG18 | PG17 | PG16 | PG15 | PG14 |
|---|---|---|---|---|---|
el8.x86_64 | PGDG 3.6.1 | PGDG 3.6.1 | PGDG 3.6.1 | PGDG 3.6.1 | PGDG 3.6.1 |
el8.aarch64 | PGDG 3.6.1 | PGDG 3.6.1 | PGDG 3.6.1 | PGDG 3.6.1 | PGDG 3.6.1 |
el9.x86_64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
el9.aarch64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
el10.x86_64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
el10.aarch64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
d12.x86_64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
d12.aarch64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
d13.x86_64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
d13.aarch64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
u22.x86_64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
u22.aarch64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
u24.x86_64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
u24.aarch64 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 | PGDG 3.6.2 |
Source
Install
Make sure PGDG repo available:
pig repo add pgdg -u # add pgdg repo and update cacheInstall this extension with pig:
pig install postgis; # install via package name, for the active PG version
pig install address_standardizer; # install by extension name, for the current active PG version
pig install address_standardizer -v 18; # install for PG 18
pig install address_standardizer -v 17; # install for PG 17
pig install address_standardizer -v 16; # install for PG 16
pig install address_standardizer -v 15; # install for PG 15
pig install address_standardizer -v 14; # install for PG 14Create this extension with:
CREATE EXTENSION address_standardizer;Usage
Address Standardizer: Address parsing and standardization for PostGIS
The Address Standardizer is a PostGIS extension that parses a single-line address string into a structured form using configurable lexicon, grammar, and rules tables. It is a more flexible alternative to the built-in normalize_address function in the TIGER geocoder.
Setup
CREATE EXTENSION address_standardizer;Standardizing Addresses
The core function takes an address string and three table references (lex, gaz, rules):
SELECT *
FROM standardize_address(
'us_lex', -- lexicon table
'us_gaz', -- gazetteer table
'us_rules', -- rules table
'1600 Pennsylvania Ave NW, Washington, DC 20500'
);The result contains structured fields:
| Field | Description |
|---|---|
building | Building name or identifier |
house_num | Street number |
predir | Prefix direction (N, S, E, W) |
qual | Qualifier |
pretype | Prefix type |
name | Street name |
suftype | Suffix type (St, Ave, Blvd) |
sufdir | Suffix direction |
ruralroute | Rural route |
extra | Extra information |
city | City name |
state | State |
country | Country |
postcode | ZIP/postal code |
box | PO Box |
unit | Unit/apartment number |
Lexicon, Gazetteer, and Rules Tables
The standardizer is driven by three user-configurable tables:
Lexicon (lex) – Maps input tokens to standardized forms and token classes:
CREATE TABLE us_lex (
id serial PRIMARY KEY,
seq integer,
word text,
stdword text,
token integer
);Gazetteer (gaz) – Maps place names (cities, states) to standard forms:
CREATE TABLE us_gaz (
id serial PRIMARY KEY,
seq integer,
word text,
stdword text,
token integer
);Rules (rules) – Defines grammar rules for parsing addresses:
CREATE TABLE us_rules (
id serial PRIMARY KEY,
rule text
);For US addresses, the address_standardizer_data_us extension provides pre-built data for these tables.