Reinforcement learning based adaptive rate control for delay-constrained communications over fading channels

Xiaochen Li, Haibo He, Yu Dong Yao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In this paper, we study efficient rate control schemes for delay sensitive communications over wireless fading channels based on reinforcement learning. Our objective is to find a rate control scheme that optimizes the link layer performance, specifically, maximizes the system throughput subject to a fixed bit error rate (BER) constraint and longterm average power constraint. We assume the buffer at the transmitter is finite; hence packet drop happens when the buffer is full. We assume the fading channel under our study can be modeled as a finite state Markov chain, however the transition probability of channel states is not known, and the only information available about the wireless channel is the instantaneous channel gain, which is estimated and fed back from receiver side to the transmitter side on the fly. In this paper, we use reinforcement learning approach to learn the time-varying channel environment and search for the optimal control policy on line. Simulation results show that starting from an arbitrary control policy, the learning agent gradually modifies its estimation about the system model and adjusts the control policy to its optimality.

Original languageEnglish
Title of host publication2010 IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010
DOIs
StatePublished - 2010
Event2010 6th IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010 - Barcelona, Spain
Duration: 18 Jul 201023 Jul 2010

Publication series

NameProceedings of the International Joint Conference on Neural Networks

Conference

Conference2010 6th IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010
Country/TerritorySpain
CityBarcelona
Period18/07/1023/07/10

Fingerprint

Dive into the research topics of 'Reinforcement learning based adaptive rate control for delay-constrained communications over fading channels'. Together they form a unique fingerprint.

Cite this