Abstract
In this paper, the minimization of the weighted sum average age of information (AoI) in a multi-source status update communication system is studied. Multiple independent sources send update packets to a common destination node in a time-slotted manner under the limit of maximum retransmission rounds. Different multiple access schemes, i.e., time-division multiple access (TDMA) and non-orthogonal multiple access (NOMA), are exploited here over a block-fading multiple access channel (MAC). Constrained Markov decision process (CMDP) problems are formulated to describe the AoI minimization problems considering both transmission schemes. The Lagrangian method is used to convert CMDP problems to unconstrained Markov decision process (MDP) problems, and corresponding algorithms are designed to derive the power allocation policies. Also, a suboptimal threshold-based policy is proposed. On the other hand, for the case of unknown environments, two online reinforcement learning approaches considering both multiple access schemes are proposed to achieve near-optimal age performance. Numerical simulations validate the improvement of the proposed policy in terms of weighted sum AoI compared to the fixed power transmission policy and illustrate that NOMA is more favorable in the case of larger packet sizes.
| Original language | English |
|---|---|
| Pages (from-to) | 5531-5545 |
| Number of pages | 15 |
| Journal | IEEE Transactions on Vehicular Technology |
| Volume | 73 |
| Issue number | 4 |
| DOIs | |
| State | Published - 1 Apr 2024 |
Keywords
- Age of information (AoI)
- constrained Markov decision process (CMDP)
- non-orthogonal multiple access (NOMA)
- power allocation
- reinforcement learning