For reinforcement learning, an agent receives feedback in the form of( ). A、RewardB、StateC、PheromoneD、Fitness 发布时间:2025-07-31 17:46:44