Abstract
The emergence of 6G-enabled Vehicle-to-Everything (V2X) networks has created unprecedented demand for ultra-reliable, low-latency spectrum allocation across heterogeneous entities including vehicles, IoT devices, and industrial systems. Current spectrum allocation methods suffer from exponential computational complexity, extensive information sharing requirements, and poor scalability in dense networks. This paper proposes AMADRL (Attention-based Multi-Agent Deep Reinforcement Learning), a novel framework employing dual critic networks with multi-head self-attention mechanisms for intelligent spectrum allocation. The dual critic architecture resolves individual-collective optimization conflicts through local critics for independent entity optimization and a global critic with attention-based coordination. Our approach significantly reduces information sharing requirements while handling heterogeneous QoS demands across diverse entity types. Comprehensive experimental evaluation comparing AMADRL against state-of-the-art baselines including MADDPG, MAAC, QMIX, attention-based methods (A-DDPG, MHA-DQN), and game-theoretic approaches reveals that AMADRL achieves superior performance across multiple metrics including spectrum utilization efficiency, interference mitigation, and network scalability, while preserving user privacy and satisfying strict latency constraints required by safety-critical and industrial use cases.
| Original language | English |
|---|---|
| Journal | IEEE Internet of Things Journal |
| DOIs | |
| Publication status | Accepted/In press - 2025 |
| Externally published | Yes |
Keywords
- 6G Networks
- Attention Mechanisms
- Deep Reinforcement Learning
- IoT
- Privacy-Preserving Networks
- Spectrum Allocation
- V2X
- Vehicular Communications
ASJC Scopus subject areas
- Signal Processing
- Information Systems
- Hardware and Architecture
- Computer Science Applications
- Computer Networks and Communications