Originally published 31 Aug 20
Here is an excerpt:
The AlphaDogfight simulation on Aug. 20 was an important milestone for AI and its potential military uses. While this achievement shows that AI can master increasingly difficult combat skills at warp speed, the Pentagon’s futurists still must remain mindful of its limitations and risks — both because AI remains long away from eclipsing the human mind in many critical decision-making roles, despite what the likes of Elon Musk have warned, and to make sure we don’t race ahead of ourselves and inadvertently leave the military exposed to new threats.
That’s not to minimize this latest development. Within the scope of the simulation, the AI pilot exceeded human limitations in the tournament: It was able to consistently execute accurate shots in very short timeframes; consistently push the airframe’s tolerance of the force of gravity to its maximum potential without going beyond that; and remain unaffected by the crushing pressure exerted by violent maneuvers the way a human pilot would.
All the more remarkable, Heron’s AI pilot was self-taught using deep reinforcement learning, a method in which an AI runs a combat simulation over and over again and is “rewarded” for rapidly successful behaviors and “punished” for failure.
I emboldened the last sentence because of its importance.