Dataset Acquisition and Preparation

Instance Segmentation Dataset [Roboflow]

Step 1: Annotate Dataset

The Pallet Dataset contains 519 images, which were annotated as the first step. To expedite the labeling process, I used auto-labeling with Grounding DINO, integrated directly in Roboflow for object detection dataset and SAM2 for segmentation dataset, which significantly improved efficiency.

Note: While Grounding DINO performed well in identifying the ground, it encountered challenges in accurately labeling pallets. It often inferred both the pallet and the payload as a single entity. For SAM2, while it is near to accurate for gound I found using basic polygon tool more efficient for pallets.

Step 2: Dataset Split

For the initial testing phase, the dataset was split as follows:

Training: 70%
Validation: 20%
Testing: 10%

Note: In the future, we aim to implement k-fold cross-validation for more robust model evaluation.

Step 3: Data Augmentation

Using the dataset export feature, the 519 images were augmented with the following parameter variations:

Saturation: Between -25% to +25%
Brightness: Between -25% to +25%
Exposure: Between -15% to +15%

Note: These ranges were set to the maximum permissible limits before Roboflow suggested adjustments, to establish a baseline model. Future enhancements may involve additional augmentation to improve model robustness.

Step 4: Dataset for Training

Dataset consists following distribution:

Training set: 1092
Validation set: 103
Test set: 52