Cone Segmentation Models analysis

Instance segmentation model framework for Formula Student cone detection.

Dataset

All models are fine-tuned on the FSOCO segmentation dataset (CVAT XML format, converted to COCO JSON).

Supported Models

Model	Architecture	mAP@50	mAP@50:95	Precision	Recall	Status
RF-DETR Small	DINOv2 backbone	80.4%	55.7%	89.9%	75.8%	Trained

Weights

Download pre-trained weights and place them in the corresponding model directory.

Model	Weights	Size	Location
RF-DETR	checkpoint_best_ema.pth	388 MB	`models/rfdetr/weights/segmentation/`

Quick Start

Real-Time Webcam Inference

# Default model (rfdetr)
python inference/realtime.py

# Specify model explicitly
python inference/realtime.py --model rfdetr

# List available models
python inference/realtime.py --list-models

# Custom settings
python inference/realtime.py --model rfdetr --camera 1 --resolution 1920x1080 --conf-threshold 0.7

Keyboard Controls:

q - Quit
s - Save screenshot
m - Toggle mask overlay
b - Toggle bounding boxes
c - Toggle confidence scores
l - Toggle legend
d - Toggle detection counts
+/- - Adjust confidence threshold
SPACE - Pause/Resume

Image Inference

# Single image
python inference/image.py --model rfdetr --image path/to/image.jpg

# Folder of images
python inference/image.py --model rfdetr --image path/to/folder/ --output output/

# Options
python inference/image.py --model rfdetr --image test.jpg --no-masks  # Bounding boxes only
python inference/image.py --model rfdetr --image test.jpg --no-boxes  # Masks only

Project Structure

Segmentation_models/
├── inference/                    # Unified inference system
│   ├── base.py                  # Abstract model interface
│   ├── registry.py              # Model loader
│   ├── realtime.py              # Real-time webcam inference
│   └── image.py                 # Single/batch image inference
│
├── common/                       # Shared utilities
│   ├── visualization.py         # Drawing masks, boxes, legends
│   ├── data/
│   │   ├── convert_yolo_to_coco.py   # YOLO → COCO conversion
│   │   └── convert_cvat_to_coco.py   # CVAT XML (RLE) → COCO conversion
│   └── tools/
│       └── visualize_dataset.py      # Dataset visualization
│
├── models/                       # Model implementations
│   └── rfdetr/
│       ├── adapter.py           # RF-DETR adapter
│       ├── training/
│       │   ├── train_local.py
│       │   └── train_modal.py
│       └── weights/
│           └── segmentation/
│               └── checkpoint_best_ema.pth
│
├── results/                      # Model evaluation metrics (JSON)
│
├── output/                       # Inference output images
│
└── config/
    └── models.yaml              # Model registry

Training

Training is model-specific:

# RF-DETR local training
python models/rfdetr/training/train_local.py

# RF-DETR cloud training (Modal)
modal run models/rfdetr/training/train_modal.py

Data Tools

# Convert CVAT XML segmentation dataset (RLE masks) to COCO format
python common/data/convert_cvat_to_coco.py

# Convert YOLO segmentation dataset to COCO format
python common/data/convert_yolo_to_coco.py

# Visualize dataset annotations
python common/tools/visualize_dataset.py --dataset-dir /path/to/dataset --split train

Adding a New Model

Create model directory: models/<model_name>/

Implement the adapter in models/<model_name>/adapter.py:

from inference.base import BaseSegmentationModel

class MyModelAdapter(BaseSegmentationModel):
    @property
    def name(self) -> str:
        return "mymodel"
    
    def load(self, weights_path, device="cuda", num_classes=4):
        # Load your model
        pass
    
    def predict(self, image, conf_threshold=0.5):
        # Return dict with: boxes, masks, class_ids, scores, inference_time_ms
        pass

Register in config/models.yaml:

models:
  mymodel:
    module: models.mymodel.adapter
    class: MyModelAdapter
    default_weights: models/mymodel/weights/best.pth
    num_classes: 4
    description: My custom segmentation model

Add training scripts in models/<model_name>/training/

Requirements

pip install torch torchvision opencv-python numpy pyyaml

# Model-specific
pip install rfdetr  # For RF-DETR

Classes

The models detect four cone types:

Blue Cone (class 0) - Track boundary
Yellow Cone (class 1) - Track boundary
Large Orange Cone (class 2) - Special marker
Orange Cone (class 3) - Track boundary

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.vscode		.vscode
common		common
config		config
export		export
inference		inference
models		models
onnx		onnx
tensorrt		tensorrt
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cone Segmentation Models analysis

Dataset

Supported Models

Weights

Quick Start

Real-Time Webcam Inference

Image Inference

Project Structure

Training

Data Tools

Adding a New Model

Requirements

Classes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Cone Segmentation Models analysis

Dataset

Supported Models

Weights

Quick Start

Real-Time Webcam Inference

Image Inference

Project Structure

Training

Data Tools

Adding a New Model

Requirements

Classes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages