14.2.3. 設定 reward 獎勵

當 turtlebot3 在一個 state 狀態採取某個 action 動作時，它會收到一個 reward 獎勵。 reward 獎勵設計對學習非常重要。獎勵可以是正值或負值。當 turtlebot3 進入目的地時，它會得到很大的正獎勵 positive reward。但當 turtlebot3 與障礙物碰撞時，它會獲得很大的負獎勵 negative reward。如果你想套用你自己的獎勵設計，請在 /turtlebot3_machine_learning/turtlebot3_dqn/src/turtlebot3_dqn/environment_stage_#.py 修改 setReward 函數。

Previous14.2.2. 設定 action 動作 Next14.2.4. 設定 hyper parameters 超參數

Last updated 6 years ago

Was this helpful?