The dataset was collected using fixed-position, real-world traffic cameras located in Indiana, USA, provided by an industrial partner under a collaborative agreement. The video footage was captured under natural driving conditions, without experimental interference, to reflect realistic urban traffic patterns. All annotations were manually curated using a custom-built semi-automated labeling toolkit developed specifically for this project. This tool significantly enhanced annotation efficiency while ensuring high labeling accuracy. The labeling process included object detection, tracking, and identity association across multiple cameras.