Geocoding Addresses in CARTO Workflows
Converts street addresses or place names into geographic coordinates (point geometries). This is an essential first step when working with tabular data that has an address column but no spatial column.
Prerequisites: Load
for the development process, JSON structure, and validation commands.
Instructions
A geocoding workflow follows this pipeline:
Source Table (with address column) -> Geocode -> (Filter successful) -> Save
Step 1: Load Source Data
Use
. The input table must contain a column with address strings (e.g.
,
,
).
Success: Node outputs a table with at least one text column containing address data.
Step 2: Geocode
| Input | Description | Required |
|---|
| Table with address data | Yes |
| Column containing the address string | Yes |
| Country filter to improve accuracy (e.g. , ) | No, but strongly recommended |
The address column can contain full addresses (
"123 Main St, Springfield, IL 60001"
) or composite values built from multiple columns (concatenate street + city + postal code in a prior
step).
Two output handles: The geocode component produces two separate outputs:
- : Rows where geocoding succeeded -- a column with point geometry is added, plus a JSON column with quality info (confidence score, match type).
- : Rows where geocoding failed -- is NULL.
Connect downstream nodes to the correct handle based on your needs.
Success: The geocode node is configured with the address column and (ideally) a country filter. Edges connect to the
and/or
output handles.
Step 3: Filter or Review Results (optional)
For the match output:
- Optionally filter by confidence using the metadata column (e.g. extract confidence from via ).
For the unmatch output:
- Save to a separate table for review and manual correction.
- Common failure causes: typos, incomplete addresses, PO boxes, ambiguous place names.
Success: High-confidence geocoded rows are isolated; failed rows are captured for review.
Step 4: Save Results
Use
to persist the geocoded output. The
column contains WGS84 (EPSG:4326) point geometries, ready for visualization in CARTO Builder or further spatial analysis.
Success: Validated workflow that can be uploaded via
.
Geocoding Under the Hood
The workflow component wraps the CARTO Analytics Toolbox function
, which:
- Adds a column with point geometry to each row
- Adds a JSON column with quality information (confidence, match type)
- Uses CARTO Location Data Services (LDS) -- each geocoded row consumes LDS quota
Check available quota by querying the Analytics Toolbox
function. The fully-qualified name is provider-specific (BigQuery:
`carto-un.carto`.LDS_QUOTA_INFO()
; Snowflake:
CARTO.CARTO.LDS_QUOTA_INFO()
; Databricks: stored procedure in the dedicated AT schema). Load
and consult
references/providers/<provider>.md
for the AT path on your warehouse.
Gotchas
- Geocoding consumes LDS quota. Each row geocoded counts against the account's Location Data Services quota. Check quota availability before bulk operations, especially on large tables.
- Two output handles: and . Don't connect to the wrong one -- has geometries, has NULLs. If you connect the handle to a spatial operation, it will fail.
- Country filter is strongly recommended. Without it, ambiguous addresses may resolve to the wrong country (e.g. "Springfield" exists in 30+ US states and in other countries). The country parameter improves both accuracy and speed.
- Address formatting matters. Well-formatted addresses produce better results:
"123 Main St, Springfield, IL 60001"
works better than "123 main street springfield"
. Include city, state/region, and postal code when available.
- Provider casing & SQL dialect. Examples in this skill use lowercase column names (BigQuery / Databricks / Postgres / Redshift convention); on Snowflake unquoted identifiers surface UPPERCASE (e.g. , ). When writing dialect-specific SQL or referencing the AT path, see
carto-create-workflow/references/providers/<provider>.md
.
- For large tables, consider batching. Geocoding hundreds of thousands of rows in a single run can exhaust quota or time out. Split into batches if needed.
- Output geometry is always WGS84 points. The column contains EPSG:4326 point geometries regardless of the input address format or country.
- Failed geocodes deserve review. The output is not just noise -- it often reveals data quality issues (missing postal codes, abbreviated city names, non-standard formatting) that can be fixed and re-geocoded.
Reference Templates
Academy tutorials and workflow templates covering geocoding:
| Resource | Description | URL |
|---|
| Geocoding (BigQuery AT) | Step-by-step geocoding with Analytics Toolbox for BigQuery | Academy link |
| Geocoding (Snowflake AT) | Step-by-step geocoding with Analytics Toolbox for Snowflake | Academy link |
| Workflow template: Geocode street addresses | Generating new spatial data from addresses | Academy link |
Common Variations
| Variation | How |
|---|
| Composite address from multiple columns | Add a step before geocoding to concatenate street, city, state, zip into one column |
| Geocode + spatial join | Chain: Geocode -> match -> Spatial Join (e.g. point-in-polygon to assign regions) |
| Geocode + enrichment | Chain: Geocode -> match -> Buffer/Isochrone -> Enrich (add demographics around each geocoded point) |
| Capture failures for re-processing | Connect both and handles to separate nodes |
| Filter by confidence | After geocode, use to extract confidence from the metadata JSON, then filter |