filmov
tv
AI Learns Parallel Parking - Deep Reinforcement Learning
Показать описание
Subscribe for more content like this:
Follow me on Twitter for more frequent updates on my projects:
Basically, the input of the Neural Network are the readings of eight depth sensors, the car's current speed and position, as well as its relative position to the target. The outputs of the Neural Network are interpreted as engine force, braking force and turning force. These outputs can be seen at the top right corner of the zoomed out camera shots.
The AI starts off with random behaviour, i.e. the Neural Network is initialized with random weights. It then gradually learns to solve the task by reacting to environment feedback accordingly. The environment tells the AI whether it is doing good or bad with positive or negative reward signals.
The training was done on a computer with an i5 (7th or 8th gen) and a GTX 1070 with 100x simulation speed, using 6 instances of the environment and up to 6 processes running in parallel.
Timelapse Music: "The Elevator Bossa Nova"
Outro: "All That"
#ArtificialIntelligence #MachineLearning #ReinforcementLearning #AI #NeuralNetworks #hostinger #inspeedwebelieve #speedfreak
Комментарии