L2RPN Hackathon 2020 - Robustness Track
[ PwC US ] Vishakha Bansal, Prasang Gupta
We achieved $28^{th}$ rank in the hackathon. ( $21^{st}$ highest score )
The aim of L2RPN (Learning to Run a Power Network) 2020 hackathon - Robustness Track was to build a robust agent that would be keep delivering reliable electricity everywhere and also operate the grid safely when an agent takes unknown adversarial actions at regular intervals.
The dataset contained episodes spanning different adversarial actions taken by the agent. The agent could terminate any 1 of the 10 possible power lines (some of them being high voltage lines). Our agent was to be evaluated on how long it can provide reliable power to consumers without causing blackout. Once a blackout occurs, it is game over for that episode. The operation cost to be minimized included powerline losses, redispatch cost and blackout cost.
Every substation in the competition grid had a “double busbar layout”. Hence, there was a choice of bus for making a connection from one of the bus to a grid object. Due to this and the medium sized grid used, the action space was of the order of 10^5 with the combinatorial action space reaching infinity.
Our approach was derived from a previous public solution for this competition. THe idea was to reduce the action space and train 2 A3C models with different training parameters.
We managed to improve upon the baseline set and managed to achieve a score of 10.84, which was the $21^{st}$ highest score on the leaderboard, gaining us $28^{th}$ rank.