Dataset Acquisition and Preparation


Step 1: Annotate Dataset
The Pallet Dataset contains 519 images, which were annotated as the first step. To expedite the labeling process, I used auto-labeling with Grounding DINO, integrated directly in Roboflow for object detection dataset and SAM2 for segmentation dataset, which significantly improved efficiency.
Note: While Grounding DINO performed well in identifying the ground, it encountered challenges in accurately labeling pallets. It often inferred both the pallet and the payload as a single entity. For SAM2, while it is near to accurate for gound I found using basic polygon tool more efficient for pallets.
Step 2: Dataset Split
For the initial testing phase, the dataset was split as follows:
- Training: 70%
- Validation: 20%
- Testing: 10%
Note: In the future, we aim to implement k-fold cross-validation for more robust model evaluation.
Step 3: Data Augmentation
Using the dataset export feature, the 519 images were augmented with the following parameter variations:
- Saturation: Between -25% to +25%
- Brightness: Between -25% to +25%
- Exposure: Between -15% to +15%
Note: These ranges were set to the maximum permissible limits before Roboflow suggested adjustments, to establish a baseline model. Future enhancements may involve additional augmentation to improve model robustness.
Step 4: Dataset for Training
Dataset consists following distribution:
- Training set: 1092
- Validation set: 103
- Test set: 52