TY - GEN
T1 - Speaking the Same Traffic Language
T2 - 12th International Conference on Future Data and Security Engineering, FDSE 2025
AU - Shahin, Mahtab
AU - Saeidi, Soheila
AU - Malhi, Avleen
AU - Kaisar, Evangelos I.
AU - Bauk, Sanja
AU - Soe, Ralf Martin
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
PY - 2026
Y1 - 2026
N2 - Traffic incident prediction is a key function of smart city management, allowing operators to proactively mitigate congestion and enhance roadway safety. While prior studies report promising results in single-city contexts, the generalizability of models across cities with distinct infrastructures, traffic dynamics, and data standards remains underexplored. This paper presents a cross-continental case study comparing Florida, USA (Traffic Message Channel data), and Tallinn, Estonia (DATEX II data). We introduce a harmonization framework to align structurally different datasets and evaluate three machine learning models: Extreme Gradient Boosting (XGBoost), Random Forest, and Long Short-Term Memory (LSTM) networks under both intra-city and cross-city settings. Intra-city experiments achieve accuracies of up to 94%, with XGBoost consistently outperforming alternatives. However, direct cross-city transfer without retraining results in a 13–15% F1-score decline, evidencing substantial domain shift. Feature-level analysis reveals that speed deviations and travel time anomalies generalize across contexts, whereas flow and temporal encodings are city-specific. The proposed harmonization and evaluation methodology establishes a reproducible basis for benchmarking across heterogeneous environments, underscoring the importance of local adaptation while guiding the development of transferable Intelligent Transportation System (ITS) solutions.
AB - Traffic incident prediction is a key function of smart city management, allowing operators to proactively mitigate congestion and enhance roadway safety. While prior studies report promising results in single-city contexts, the generalizability of models across cities with distinct infrastructures, traffic dynamics, and data standards remains underexplored. This paper presents a cross-continental case study comparing Florida, USA (Traffic Message Channel data), and Tallinn, Estonia (DATEX II data). We introduce a harmonization framework to align structurally different datasets and evaluate three machine learning models: Extreme Gradient Boosting (XGBoost), Random Forest, and Long Short-Term Memory (LSTM) networks under both intra-city and cross-city settings. Intra-city experiments achieve accuracies of up to 94%, with XGBoost consistently outperforming alternatives. However, direct cross-city transfer without retraining results in a 13–15% F1-score decline, evidencing substantial domain shift. Feature-level analysis reveals that speed deviations and travel time anomalies generalize across contexts, whereas flow and temporal encodings are city-specific. The proposed harmonization and evaluation methodology establishes a reproducible basis for benchmarking across heterogeneous environments, underscoring the importance of local adaptation while guiding the development of transferable Intelligent Transportation System (ITS) solutions.
KW - DATEX II
KW - LSTM networks
KW - cross-city model transfer
KW - data harmonization
KW - gradient boosting
KW - intelligent transportation systems (ITS)
KW - machine learning
KW - traffic incident prediction
KW - traffic message channel (TMC)
UR - https://www.scopus.com/pages/publications/105023638970
U2 - 10.1007/978-981-95-4721-0_21
DO - 10.1007/978-981-95-4721-0_21
M3 - Conference contribution
AN - SCOPUS:105023638970
SN - 9789819547203
T3 - Communications in Computer and Information Science
SP - 311
EP - 324
BT - Future Data and Security Engineering - 12th International Conference, FDSE 2025, Proceedings
A2 - Dang, Tran Khanh
A2 - Küng, Josef
A2 - Chung, Tai M.
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 27 November 2025 through 29 November 2025
ER -