<th id="5nh9l"></th><strike id="5nh9l"></strike><th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th><strike id="5nh9l"></strike>
<progress id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"><noframes id="5nh9l">
<th id="5nh9l"></th> <strike id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>
<progress id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span><strike id="5nh9l"><noframes id="5nh9l"><strike id="5nh9l"></strike>
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"></span><span id="5nh9l"><video id="5nh9l"></video></span>
<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th>
<progress id="5nh9l"><noframes id="5nh9l">

基于深度強化學習算法的純電動礦用汽車再生制動策略研究

Regenerative braking strategy based on deep reinforcement learning for an electric mining truck

  • 摘要: 以載重50 t純電動礦用汽車為研究對象,提出了一種基于深度強化學習優化算法的再生制動回饋策略. 首先建立了純電動礦用自卸車的數學模型. 隨后提出了一種考慮載重和坡度變化的基于自動熵調節Soft actor-critic (SAC)和深度確定性策略梯度算法(DDPG)的能量管理策略. 其中車速、加速度、車輛質量與道路坡度、動力電池荷電狀態(SOC)及充放電倍率作為狀態變量;變速箱擋位作為動作變量;動力電池SOC及電池壽命作為獎勵函數. 仿真結果表明,基于動態規劃的控制策略和所提出的基于SAC算法與基于DDPG算法的優化控制策略回饋效率分別提高了18.15%、17.18%和16.63%,電池壽命分別提升了57.31%、56.87%和57.38%. 最后通過比較兩種基于深度強化學習算法策略的獎勵曲線,可以看出與基于DDPG算法的控制策略相比,所提出的基于SAC的能量管理控制策略的收斂速度提升了166.7%.

     

    Abstract: With the promotion of national “carbon neutral” and “green mine” strategies, pure electric mining vehicles are crucial in promoting energy conservation and emission reduction in the mining industry. However, “mileage anxiety” is the primary problem limiting their promotion and application. Regenerative braking is an essential technology for improving energy efficiency and reducing the life-cycle costs of pure electric vehicles. However, because of harsh driving conditions and substantial changes in load capacity and road slope, the scale and fluctuation characteristics of energy demand vary sharply during operation, affecting the feedback efficiency and battery life of an electric mining dump truck. Therefore, designing reasonable regenerative braking strategies for pure electric mining dump trucks is crucial. This paper uses a 50-ton pure electric mining truck as the research object and proposes a regenerative braking feedback strategy based on the deep reinforcement learning optimization algorithm. First, a mathematical model of a pure electric mining dump truck was established, which included a permanent magnet synchronous motor, power battery, four-speed automated mechanical transmission, and vehicle longitudinal dynamic model. Furthermore, power performance verification based on the Matlab/Simulink simulation platform was performed. Subsequently, an energy management strategy was proposed based on the soft actor–critic (SAC) algorithm and the deep deterministic strategy gradient (DDPG) deep reinforcement learning algorithm considering load and slope changes. In particular, the state variables include vehicle speed, acceleration, vehicle mass, road slope, battery state of charge (SOC), and battery charge–discharge rate. The transmission gear is selected as the action variable of the proposed strategy. Battery SOC and battery lifetime are used as reward functions. Furthermore, an automatic entropy adjustment mechanism is introduced to improve the adaptability of the proposed control strategy to different operating conditions. Simulation results show that compared to the rule-based control strategy, the energy efficiency of the control strategy based on dynamic programming and the proposed optimization control strategy based on the SAC and DDPG algorithms are improved by 18.15%, 17.18%, and 16.63%, respectively, and the battery lifetime is improved by 57.31%, 56.87%, and 57.38%, respect ively. Finally, the proposed energy management strategy is compared with the control strategy based on DDPG to further verify its superiority by comparing the reward curves. The results demonstrate the feasibility of the proposed control strategy based on the SAC algorithm, which has improved convergence speed by 166.7%.

     

/

返回文章
返回
<th id="5nh9l"></th><strike id="5nh9l"></strike><th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th><strike id="5nh9l"></strike>
<progress id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"><noframes id="5nh9l">
<th id="5nh9l"></th> <strike id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span>
<progress id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"><noframes id="5nh9l"><span id="5nh9l"></span><strike id="5nh9l"><noframes id="5nh9l"><strike id="5nh9l"></strike>
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"><noframes id="5nh9l">
<span id="5nh9l"></span><span id="5nh9l"><video id="5nh9l"></video></span>
<th id="5nh9l"><noframes id="5nh9l"><th id="5nh9l"></th>
<progress id="5nh9l"><noframes id="5nh9l">
259luxu-164