CLIPSeg-Map

Open-Vocabulary Top-Down Semantic Mapping from RGB-D Sequences

Generates top-down semantic maps from RGB-D sequences using CLIPSeg for zero-shot, open-vocabulary segmentation.

Installation

# Clone
git clone https://github.com/cukurovaai/ClipSegMap.git
cd ClipSegMap

# Create conda environment
conda env create -f environment.yml
conda activate clipseg-map

# Install package
pip install -e .

Quick Start

Command Line

# Edit config/default.yaml with your scene path, then run:
clipseg-map

# Or specify scene directly:
clipseg-map --scene /path/to/scene --output results/

Python API

from clipseg_map import SemanticMapper, MapConfig

# Load configuration
config = MapConfig.from_yaml("config/default.yaml")

# Create mapper and build map
mapper = SemanticMapper(config.data_dir, config)
result = mapper.build_map()

# Access outputs
predictions = result["predictions"]      # (H, W) class indices
color_topdown = result["color_topdown"]  # (H, W, 3) RGB top-down

# Save results
mapper.save_results(result, config.output_path)
mapper.visualize(result, f"{config.output_path}/semantic_map.png")

Configuration

Edit config/default.yaml:

# Scene configuration
scene_data_root: "/path/to/dataset"
scene_name: "scene_001"
output_dir: "output"

# Map parameters
grid_size: 500
cell_size: 0.05
camera_height: 1.5
fov: 90.0

# Processing
frame_skip: 10
depth_sample_rate: 100

# Semantic labels
labels:
  - floor
  - wall
  - chair
  - table
  - door

Override via CLI:

clipseg-map --grid-size 800 --cell-size 0.03 --frame-skip 5

Data Format

scene_dir/
├── rgb/          # RGB images: 000000.png, 000001.png, ...
├── depth/        # Depth maps: 000000.npy, 000001.npy, ... (meters, float32)
└── pose/         # Camera poses: 000000.txt, 000001.txt, ...

Pose format: 7 values x y z qx qy qz qw or 16 values (4×4 matrix, row-major)

Output

File	Description
`predictions.npy`	Class indices (H, W)
`scores.npy`	Class scores (H, W, N)
`obstacles.npy`	Obstacle map (H, W)
`color_topdown.npy`	RGB projection (H, W, 3)
`semantic_map.png`	Visualization

Project Structure

ClipSegMap/
├── clipseg_map/
│   ├── __init__.py      # Package exports
│   ├── cli.py           # CLI entry point
│   ├── config.py        # MapConfig dataclass
│   ├── model.py         # CLIPSeg model wrapper
│   ├── mapper.py        # SemanticMapper class
│   ├── data/
│   │   └── loader.py    # Data loading utilities
│   └── utils/
│       ├── geometry.py      # Point cloud utilities
│       └── visualization.py # Visualization functions
├── config/
│   └── default.yaml     # Default configuration
├── scripts/
│   └── run_mapping.py   # Example script
└── tests/

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
clipseg_map		clipseg_map
config		config
scripts		scripts
tests		tests
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLIPSeg-Map

Installation

Quick Start

Command Line

Python API

Configuration

Data Format

Output

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CLIPSeg-Map

Installation

Quick Start

Command Line

Python API

Configuration

Data Format

Output

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages