VLM Streetscape

Vision-Language ModelsComputer Vision

Project Overview

Using Vision-Language Models to extract transport-relevant streetscape attributes from open street-level imagery (Mapillary) and aggregate them into interactive maps.

VLM streetscape overview The pipeline from road-network sampling to VLM extraction, mapping, and attribute distributions.

Structured extraction to JSON A single schema turns each street image into nine validated attributes plus a pedestrian-safety score.

GitHub repository →