Vanilla License Plate CNN

This repo is where I iterated on compact CNNs for license plate localisation—building the preprocessing, training, and evaluation stack from scratch in PyTorch. It let me benchmark ResNet, MobileNetV3, EfficientNet, and my own depthwise network on unified datasets I curated. Repo here: github.com/zhaojinchu/vanilla_LPR_CNN.

Experiment goals

Treat plate localisation as a single-object regression problem with (x_min, y_min, x_max, y_max) outputs.
Compare backbone trade-offs (accuracy vs. params vs. throughput) across ResNet, MobileNet, EfficientNet, and custom CNNs.
Stress-test mixed-precision training to keep GPU utilisation high while experimenting on laptops and lab machines.

Model tooling

Configurable training scripts (train.py, test_train.py, test_train2.py) that resume from checkpoints and log IoU/F1 to CSV.
Factory-driven MODEL_MAP makes swapping backbones a one-line change while keeping a shared regression head.
Automatic checkpoint rotation that snapshots best_model_epoch_*.pth and captures hyperparameters alongside metrics.

Data & evaluation

Preprocessing pipeline merges CCPD2019 and Kaggle YOLO datasets, letterboxes imagery, and normalises annotations.
Iterable dataset streams samples from CSV, applies augmentations, and feeds 640×480 tensors to the trainers.
Evaluation scripts surface IoU, precision/recall, FPS, and model size to contextualise each experiment.

Placeholder

Next up: port the best-performing backbone into a multi-object detector and bolt on OCR so it can run alongside the YOLO pipeline.