Skip to content

Parquet files: spatial query speedups #26

@njhenry

Description

@njhenry

We partition the output parquet files for OSM and Overture Maps by top-level tag, which is likely the most common filtering use case.

We should also optimize the parquet files to speed up spatial queries. Some options could include one or more of the following:

  • Hilbert-sorting each partitioned file
  • Spatial sub-partitions
  • File-level bounding boxes
  • H3 or S2 metadata for each row

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions