Abstract: Reinforcement learning (RL)’s powerful optimization capabilities have been extensively applied in the field of wireless communication jamming decision-making. However, the generalization of ...