privacy protection; Markov decision process; quality of service; SARSA reinforcement learning