{"id":763,"date":"2019-06-14T12:11:04","date_gmt":"2019-06-14T10:11:04","guid":{"rendered":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/"},"modified":"2021-03-29T17:43:52","modified_gmt":"2021-03-29T15:43:52","slug":"deepracer-our-journey-to-the-top-ten","status":"publish","type":"post","link":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/","title":{"rendered":"DeepRacer: our journey to the top ten!"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-756 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o-1024x682.jpg\" alt=\"\" width=\"1024\" height=\"682\" \/><\/p>\n<p>In the last few years, <strong>Las Vegas<\/strong>\u00a0has become the reference point for AWS Cloud events. We have seen first-hand re:Invent grow from 6,000 participants in 2012 to over 40,000 last year. An immense event, in which it has become difficult to simply choose the sessions in which to participate! It must also be for this reason that this year, AWS has decided to complement their main event with some conferences with a more specific focus, the first of which,\u00a0<strong>the AWS Re:MARS,<\/strong>\u00a0was created around the hottest topics of the moment:\u00a0<strong>M<\/strong>achine Learning, <strong>A<\/strong>utomation, <strong>R<\/strong>obotics and\u00a0<strong>S<\/strong>pace.<\/p>\n<p><strong>beSharp &#8211; obviously &#8211; could not miss it.<\/strong><\/p>\n<p>Many big names were present as keynote speakers:\u00a0<strong>Jeff Bezos, Werner Vogels,<\/strong>\u00a0Coursera co-founder\u00a0<strong>Andrew Ng,<\/strong>\u00a0IRobot CEO and founder\u00a0<strong>Colin M. Angle<\/strong>\u00a0and &#8230;\u00a0<strong>Robert Downey Jr.!<\/strong>\u00a0Who better than &#8220;Iron Man&#8221; to talk about the technological wonders that will radically change our lives in the coming years? Robert himself is, inter alia, the co-financier of Footprint Coalition, a private organization created with the aim of cleaning up our planet through robotics and cutting-edge technologies.<\/p>\n<p><strong>Many sessions<\/strong>\u00a0were organized by disruptive companies that presented innovations made possible by artificial intelligence: oil &amp; gas companies, private space companies for the launch of artificial satellites and, above all, the incredible Amazon GO, the chain of Amazon stores where it is possible to do shopping and checkout without going through the cash registers. As the motto says,\u00a0<strong>&#8220;no lines, no checkout. NO seriously! &#8220;<\/strong>: Thanks to machine learning techniques and simulations in 3D environments, anyone who enters a store is labeled at the entrance, so as to keep track of the actions and items taken from the shelves: upon exiting the store, the system of\u00a0<strong>Amazon GO<\/strong>\u00a0processes the &#8220;cart&#8221; and sends the invoice directly to the user&#8217;s personal Amazon profile. An incredible experience!<\/p>\n<p>While the official sessions only started on June 5th, right from the first day it was possible to participate in\u00a0<strong>workshops<\/strong>\u00a0on some specific topics; we immediately identified one that particularly excited our nerd fantasies:\u00a0<strong>a deep-dive on AWS DeepRacer!<\/strong><\/p>\n<p>The workshop really impressed us: introduced by the keynote speaker of re:Invent 2018 by Andy Jassy, this\u00a0<strong>4WD model<\/strong>\u00a0with monster truck axle is able to learn how to move autonomously on predetermined paths through\u00a0<strong>Reinforcement Learning.<\/strong>\u00a0Described by AWS as the easiest way to learn Machine Learning, AWS DeepRacer keeps all it promises. The series of steps to get on track and watch your car run is truly minimal. It is possible to have\u00a0<strong>a model trained for driving in just under an hour,<\/strong>\u00a0although, obviously, more experiments and much more time are needed to get good results.<\/p>\n<p>We immediately experimented with as many options as possible to improve our time on the track from time to time. Among other things, the re:MARS is one of the stops of\u00a0<strong>the DeepRacer League,<\/strong>\u00a0a competition that takes place in conjunction with the main AWS events.<\/p>\n<p>What better opportunity to learn directly in the field?<\/p>\n<h2><strong>How AWS DeepRacer and Reinforcement Learning work<\/strong><\/h2>\n<p>Before starting to talk about racing and record time, it is good to take a look at<strong> the interface of the AWS DeepRacer service,<\/strong>\u00a0which is the model training tool. It seems silly to specify it, but it is essential<\/p>\n<p>to have an AWS account!<\/p>\n<p>As soon as you enter your console, click on the services bar and search for &#8220;DeepRacer&#8221;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-732 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1.png\" alt=\"\" width=\"845\" height=\"439\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1.png 845w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-400x208.png 400w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-768x399.png 768w\" sizes=\"auto, (max-width: 845px) 100vw, 845px\" \/><\/p>\n<p>From the home screen, you can see our models, check the status of the training and create new ones.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-734 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-1.png\" alt=\"\" width=\"1002\" height=\"227\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-1.png 1002w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-1-400x91.png 400w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-1-768x174.png 768w\" sizes=\"auto, (max-width: 1002px) 100vw, 1002px\" \/><\/p>\n<p>To begin, let\u2019s create a new model by clicking on<strong>\u00a0&#8220;Create model&#8221;.<\/strong><\/p>\n<p>This screen presents the features of the model, as well as checking if we have all the permissions on the account to be able to save it correctly.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-736 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-2.png\" alt=\"\" width=\"779\" height=\"327\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-2.png 779w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-2-400x168.png 400w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-2-768x322.png 768w\" sizes=\"auto, (max-width: 779px) 100vw, 779px\" \/><\/p>\n<p>In case there is anything to fix, AWS will notify you and help you correct it.<\/p>\n<p><strong>We enter a name and a description:<\/strong>\u00a0choose a name that is easy to remember and above all unique because, if you want to compete in an official race, you will be asked to transfer your model to the scale race car through a USB key, and then to identify it from those loaded through an app from the track marshal iPad.<\/p>\n<p><strong>We choose a track to drive the model:<\/strong>\u00a0we have selected the first, which is the official circuit for the DeepRacer League, &#8220;re:Invent 2018&#8221;. You can try any available track.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-738 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Screenshot-2019-06-13-at-09.36.36.png\" alt=\"\" width=\"800\" height=\"809\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Screenshot-2019-06-13-at-09.36.36.png 800w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Screenshot-2019-06-13-at-09.36.36-297x300.png 297w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Screenshot-2019-06-13-at-09.36.36-768x777.png 768w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><\/p>\n<p>Once the training track has been selected,\u00a0<strong>it is time to create the reward function with which<\/strong>\u00a0we will train the model. This step is essential to obtain a performing car and get good scores in the races.<\/p>\n<p>Before telling you about our experience, it is useful to briefly reiterate how\u00a0<strong>Reinforcement Learning works.<\/strong><\/p>\n<p>Reinforcement Learning\u00a0<strong>is a training system of unsupervised neural networks,<\/strong>\u00a0neural networks that do not need an initial ground truth with which to adapt their own weights. Indeed, Reinforcement Learning performs different measurements of the surrounding environment to maximize its reward function. During this process, which is repeated indefinitely until a cutoff threshold is reached, the weights of the network are updated each time, thus optimizing the network itself.<\/p>\n<p>In the case of the DeepRacer Car, we started with a very simple reward function, whose goal is to teach the car to stay in the middle of the track; this means returning a higher reward value if, at the time of measurement, the distance from the center of the roadway is less than half the width of the road. In all other cases, the reward is reduced.<\/p>\n<p>Below is an example of how to construct the function:<\/p>\n<pre class=\" prettyprinted\"><b><span class=\"kwd\">import<\/span><\/b> <b><span class=\"pln\">math<\/span><\/b>\r\n\r\n<b><span class=\"kwd\">def<\/span><\/b> <b><span class=\"pln\">reward_function<\/span><\/b><span class=\"pun\">(<\/span><span class=\"kwd\">params<\/span><span class=\"pun\">):<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">'''<\/span>\r\n\r\n<span class=\"str\">\u00a0 \u00a0Use square root for center line<\/span>\r\n\r\n<span class=\"str\"> \u00a0\u00a0\u00a0'''<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0track_width <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'track_width'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0distance_from_center <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'distance_from_center'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <b><span class=\"lit\">1<\/span><\/b> <span class=\"pun\">-<\/span><span class=\"pln\"> math<\/span><span class=\"pun\">.<\/span><span class=\"pln\">sqrt<\/span><span class=\"pun\">(<\/span><span class=\"pln\">distance_from_center <\/span><span class=\"pun\">\/<\/span> <span class=\"pun\">(<\/span><span class=\"pln\">track_width<\/span><span class=\"pun\">\/<\/span><b><span class=\"lit\">2<\/span><\/b><span class=\"pun\">))<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><b><span class=\"kwd\">if<\/span><\/b><span class=\"pln\"> reward <\/span><span class=\"pun\">&lt;<\/span> <b><span class=\"lit\">0<\/span><\/b><span class=\"pun\">:<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <b><span class=\"lit\">0<\/span><\/b>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><b><span class=\"kwd\">return<\/span><\/b> <span class=\"kwd\">float<\/span><span class=\"pun\">(<\/span><span class=\"pln\">reward<\/span><span class=\"pun\">)<\/span><\/pre>\n<p>We choose the degrees of freedom of our 4WD:\u00a0<strong>maximum speed, steering angle and possible speed levels.<\/strong> \u00a0The linear combination of this information defines how many variations the car is able to manage both in the case of steering and speed changes.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-740 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Screenshot-2019-06-13-at-09.48.40.png\" alt=\"\" width=\"796\" height=\"564\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Screenshot-2019-06-13-at-09.48.40.png 796w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Screenshot-2019-06-13-at-09.48.40-400x283.png 400w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Screenshot-2019-06-13-at-09.48.40-768x544.png 768w\" sizes=\"auto, (max-width: 796px) 100vw, 796px\" \/><\/p>\n<p>This operation is strongly dependent on the training function and vice versa: often, alterations in the degrees of freedom on the reward function produce very different results between them.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-742 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-3.png\" alt=\"\" width=\"1002\" height=\"639\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-3.png 1002w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-3-400x255.png 400w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-3-768x490.png 768w\" sizes=\"auto, (max-width: 1002px) 100vw, 1002px\" \/><\/p>\n<p>Enter this information; you can decide how many hours to train the model, up to a maximum of 8 hours per single operation.<\/p>\n<p>It is useful to know that it\u00a0<strong>is possible to further re-train the same model<\/strong>\u00a0to increase the degree of confidence. What we have verified is that, with a training time of around 8 &#8211; 10 hours, it is possible to give the car a certain confidence on the track, provided you keep a simple model.<\/p>\n<p>We perform some confidence tests on the function described above: from the main screen of the model, we click on\u00a0<strong>&#8220;Start new evaluation&#8221;<\/strong> and choose the number of &#8220;trials&#8221; on the track; with three tests, the results are the following:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-744 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/unnamed.png\" alt=\"\" width=\"512\" height=\"205\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/unnamed.png 512w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/unnamed-400x160.png 400w\" sizes=\"auto, (max-width: 512px) 100vw, 512px\" \/><\/p>\n<p>Not bad as a first result, but we certainly could not stop at 23 seconds! Therefore, here are the different variables that DeepRacer provides to manipulate its reward function<\/p>\n<pre class=\" prettyprinted\"><span class=\"pun\">{<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0<\/span><span class=\"pln\">\u00a0<\/span><span class=\"str\">\"all_wheels_on_track\"<\/span><span class=\"pun\">:<\/span> <span class=\"typ\">Boolean<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"com\"># flag to indicate if the vehicle is on the track<\/span>\r\n\r\n<span class=\"pln\"> \u00a0<\/span><span class=\"pln\">\u00a0\u00a0<\/span><span class=\"str\">\"x\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># vehicle's x-coordinate in meters<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">\"y\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># vehicle's y-coordinate in meters<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0<\/span><span class=\"pln\">\u00a0<\/span><span class=\"str\">\"distance_from_center\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># distance in m<\/span><span class=\"com\">eters from the track center <\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">\"is_left_of_center\"<\/span><span class=\"pun\">:<\/span> <span class=\"typ\">Boolean<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># Flag to indicate if the vehicle is on the left side to the track <\/span><span class=\"com\">center or not. <\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">\"heading\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"pln\">\u00a0<\/span><span class=\"com\"># vehicle's yaw in degrees<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0<\/span><span class=\"pln\">\u00a0<\/span><span class=\"str\">\"progress\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># percentage of track completed<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">\"steps\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">int<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># number steps completed<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0<\/span><span class=\"pln\">\u00a0<\/span><span class=\"str\">\"speed\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># vehicle's speed in meters per second (m\/s)<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0<\/span><span class=\"pln\">\u00a0<\/span><span class=\"str\">\"steering_angle\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># vehicle's steering angle in degrees<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">\"track_width\"<\/span><span class=\"pun\">:<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"com\"># width of the track<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">\"waypoints\"<\/span><span class=\"pun\">:<\/span> <span class=\"pun\">[[<\/span><span class=\"kwd\">float<\/span><span class=\"pun\">,<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">],<\/span> <span class=\"pun\">\u2026<\/span> <span class=\"pun\">],<\/span> <span class=\"com\"># <\/span><span class=\"com\">list of [x,y] as milestones along the track center<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">\"closest_waypoints\"<\/span><span class=\"pun\">:<\/span> <span class=\"pun\">[<\/span><span class=\"kwd\">int<\/span><span class=\"pun\">,<\/span> <span class=\"kwd\">int<\/span><span class=\"pun\">]<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"com\"># <\/span><span class=\"com\">indices of the two nearest waypoints.<\/span>\r\n\r\n<span class=\"pun\">}<\/span><\/pre>\n<p>Let\u2019s try to add some of this information to our reward function:<\/p>\n<pre class=\" prettyprinted\"><span class=\"kwd\">import<\/span> <span class=\"pln\">math<\/span>\r\n\r\n<span class=\"kwd\">def<\/span> <span class=\"pln\">reward_function<\/span><span class=\"pun\">(<\/span><span class=\"kwd\">params<\/span><span class=\"pun\">):<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">'''<\/span>\r\n\r\n<span class=\"str\"> \u00a0\u00a0\u00a0Use square root for center line<\/span>\r\n\r\n<span class=\"str\"> \u00a0\u00a0\u00a0'''<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0track_width <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'track_width'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0distance_from_center <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'distance_from_center'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0steering <\/span><span class=\"pun\">=<\/span> <span class=\"pln\">abs<\/span><span class=\"pun\">(<\/span><span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'steering_angle'<\/span><span class=\"pun\">])<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0speed <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'speed'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0all_wheels_on_track <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'all_wheels_on_track'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0ABS_STEERING_THRESHOLD <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">15<\/span>\r\n\r\n\r\n\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">1<\/span> <span class=\"pun\">-<\/span> <span class=\"pun\">(<\/span><span class=\"pln\">distance_from_center <\/span><span class=\"pun\">\/<\/span> <span class=\"pun\">(<\/span><span class=\"pln\">track_width<\/span><span class=\"pun\">\/<\/span><span class=\"lit\">2<\/span><span class=\"pun\">))**(<\/span><span class=\"lit\">4<\/span><span class=\"pun\">)<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span><span class=\"pln\"> reward <\/span><span class=\"pun\">&lt;<\/span> <span class=\"lit\">0<\/span><span class=\"pun\">:<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span><span class=\"pln\"> steering <\/span><span class=\"pun\">&gt;<\/span><span class=\"pln\"> ABS_STEERING_THRESHOLD<\/span><span class=\"pun\">:<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">*=<\/span> <span class=\"lit\">0.8<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0<\/span><span class=\"pln\">\u00a0<\/span><span class=\"kwd\">if<\/span> <span class=\"kwd\">not<\/span> <span class=\"pun\">(<\/span><span class=\"pln\">all_wheels_on_track<\/span><span class=\"pun\">):<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">return<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">(<\/span><span class=\"pln\">reward<\/span><span class=\"pun\">)\r\n<\/span><\/pre>\n<p>In particular, we added the &#8220;steering angle&#8221;, the &#8220;speed&#8221; and the Boolean variable &#8220;all_wheels_on_track&#8221;, which shows us if at a given moment the car has all the wheels off the track.<\/p>\n<p>If we look at the code, we see that the reward function, after being calculated with respect to the position relative to the center of the track, is modified as follows:<\/p>\n<ul>\n<li>If we are steering for more than 15 degrees respect to our central axis, we reduce the reward by 20%;<\/li>\n<li>If we go out of track, the reward is 0.<\/li>\n<\/ul>\n<p>In doing so,\u00a0<strong>we ask the machine to strive to stay as much as possible in the center of the track,<\/strong>\u00a0heavily penalizing off-road exits and avoiding excessive steering.<\/p>\n<p>Here is the result of four hours of training:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-746 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/unnamed-1.png\" alt=\"\" width=\"512\" height=\"217\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/unnamed-1.png 512w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/unnamed-1-400x170.png 400w\" sizes=\"auto, (max-width: 512px) 100vw, 512px\" \/><\/p>\n<p>We note how the model has not always completed the circuit, but the times in which it has succeeded, it has shown\u00a0<strong>a remarkable improvement<\/strong>\u00a0of the times on the track, bringing the average from 22 to little over 13 seconds.<\/p>\n<p>We took part in the first day of the competition with this model.<\/p>\n<h2><strong>1st day: warm-up<\/strong><\/h2>\n<p>We arrive at the circuit and join a large group of people who are preparing to compete. Assistants provide us with a USB key to load our model. Here is how to do it from the AWS console:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-748 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-4.png\" alt=\"\" width=\"1002\" height=\"327\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-4.png 1002w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-4-400x131.png 400w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/Picture1-4-768x251.png 768w\" sizes=\"auto, (max-width: 1002px) 100vw, 1002px\" \/><\/p>\n<p>From the list of trained models,\u00a0<strong>select the model to download and press, &#8220;Download model&#8221;.<\/strong> \u00a0The model is downloaded in compressed format and must be copied to the USB stick in a directory called &#8220;Models&#8221;.<\/p>\n<p class=\" prettyprinted\"><span class=\"pun\">Once this is done, we are ready to compete! We await our turn by observing the performance of our opponents: some set excellent times; others go off the road all the time. A few curious people just try to race with the AWS sample models.<\/span><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-750 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/LRM_EXPORT_70216235913160_20190604_195326062-1024x768.jpeg\" alt=\"\" width=\"1024\" height=\"768\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/LRM_EXPORT_70216235913160_20190604_195326062-1024x768.jpeg 1024w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/LRM_EXPORT_70216235913160_20190604_195326062-400x300.jpeg 400w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/LRM_EXPORT_70216235913160_20190604_195326062-768x576.jpeg 768w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/LRM_EXPORT_70216235913160_20190604_195326062-1536x1152.jpeg 1536w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/LRM_EXPORT_70216235913160_20190604_195326062.jpeg 1600w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>It is our turn: we fill out our profile and register for the next race.\u00a0<strong>We choose &#8220;beSharp&#8221; and &#8220;beSharp-2&#8221; <\/strong>as the name of our cars: now we have to make a good impression!<\/p>\n<p>The USB key with the model is loaded into the DeepRacer Car, and an operator synchronizes everything on the iPad that is used by the &#8220;pilot&#8221; to control the behavior of the machine.<\/p>\n<p>On the iPad, we have three commands available:<\/p>\n<ul>\n<li>Turn on vehicle;<\/li>\n<li>Turn off vehicle;<\/li>\n<li>Set the speed: it is a numeric input with which you can increase the speed of the DeepRacer Car by a percentage at any time of the lap.<\/li>\n<\/ul>\n<p>The countdown starts, we press &#8220;Start&#8221; and the vehicle moves on its own following the track with reasonable safety. The rules are simple: we have\u00a0<strong>four minutes to complete as many laps as possible,<\/strong>\u00a0we can go off the road only three times, otherwise the lap is nullified and we can make mistakes whenever we want. For the purposes of the classification, the fastest complete lap will count.<\/p>\n<p>At the end of our first attempt, our best time is 13 seconds and 964 thousandths: we are fifth! Nevertheless, the road is still long, the top three go around 9 &#8211; 10 seconds and we know that soon many will approach those times: in fact, as the model is trained for a specific track, the car will perform better and better on that track, even with increasing speeds.<\/p>\n<p>Speaking with some of our opponents, we understand that the most widespread strategy is to maximize speed at the expense of overall stability on the track, but it is possible to recover this factor with full days of training on the track. Someone also talks about\u00a0<strong>Waypoint,<\/strong>\u00a0a parameter that we had not yet considered.<\/p>\n<p>At this point, we also decide to experiment: while we continue to try new settings on already developed algorithms, we begin to prepare a new model that goes to exploit the waypoints, characteristic points on the track. In practice, in the training phase, the system will be based on the optimal points defined in the path and we will maximize the reward function based on the proximity of the machine to those points.<\/p>\n<p>As an impromptu approach, while waiting for optimal training on waypoints, we optimize the function as suggested by some competitors:<\/p>\n<pre class=\" prettyprinted\"><span class=\"kwd\">import<\/span> <span class=\"pln\">math<\/span>\r\n\r\n<span class=\"kwd\">def<\/span> <span class=\"pln\">reward_function<\/span><span class=\"pun\">(<\/span><span class=\"kwd\">params<\/span><span class=\"pun\">):<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"str\">'''<\/span>\r\n\r\n<span class=\"str\"> \u00a0\u00a0\u00a0Use square root for center line<\/span>\r\n\r\n<span class=\"str\"> \u00a0\u00a0\u00a0'''<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0track_width <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'track_width'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0distance_from_center <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'distance_from_center'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0speed <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'speed'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0progress <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'progress'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0all_wheels_on_track <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'all_wheels_on_track'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0SPEED_TRESHOLD <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">6<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">1<\/span> <span class=\"pun\">-<\/span> <span class=\"pun\">(<\/span><span class=\"pln\">distance_from_center <\/span><span class=\"pun\">\/<\/span> <span class=\"pun\">(<\/span><span class=\"pln\">track_width<\/span><span class=\"pun\">\/<\/span><span class=\"lit\">2<\/span><span class=\"pun\">))**(<\/span><span class=\"lit\">4<\/span><span class=\"pun\">)<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span><span class=\"pln\"> reward <\/span><span class=\"pun\">&lt;<\/span> <span class=\"lit\">0<\/span><span class=\"pun\">:<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span><span class=\"pln\"> speed <\/span><span class=\"pun\">&gt;<\/span><span class=\"pln\"> SPEED_TRESHOLD<\/span><span class=\"pun\">:<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">*=<\/span> <span class=\"lit\">0.8<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span> <span class=\"kwd\">not<\/span> <span class=\"pun\">(<\/span><span class=\"pln\">all_wheels_on_track<\/span><span class=\"pun\">):<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">0<\/span>\r\n\r\n\r\n\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span><span class=\"pln\"> progress <\/span><span class=\"pun\">==<\/span> <span class=\"lit\">100<\/span><span class=\"pun\">:<\/span><span class=\"pln\"> \u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">+=<\/span> <span class=\"lit\">100<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">return<\/span> <span class=\"kwd\">float<\/span><span class=\"pun\">(<\/span><span class=\"pln\">reward<\/span><span class=\"pun\">)<\/span><\/pre>\n<p>In this new function, we are going to <strong>reduce the reward if the machine speed decreases and we give a much higher score if the machine manages to complete the track.<\/strong><\/p>\n<h2>2nd and 3rd day: some progress on the times, but the competition is getting tougher&#8230;<\/h2>\n<p>Thanks to the improved algorithm,\u00a0<strong>the times are reduced<\/strong>\u00a0and our machines are able to settle around 11 &#8211; 12 seconds. It is not much, but in these races even a fraction of a second can make the difference. However, the more prepared begin to climb the rankings: we are witnessing the first 8 seconds laps!<\/p>\n<p>A Japanese competitor, currently in first place, tells us that in order to make the car perform, he has trained the model for more than two consecutive days on the same track.<\/p>\n<p>The competition is really addictive and a large crowd of people gathers around the track. Among them, we talk to a girl who turns out to be none other than the AWS DeepRacer project leader! She gives us a couple of tips on how to improve the performance of our model and, most importantly, immediately agrees with us on the validity of using waypoints.<\/p>\n<p>At the moment, we are twelfth and sixteenth:\u00a0<strong>it is time to move up the rankings and secure a spot in the Top Ten!<\/strong><\/p>\n<h2>4th day: the turning point!<\/h2>\n<p>On the last day, we present the\u00a0<strong>newly trained model that uses waypoints:<\/strong><\/p>\n<pre class=\" prettyprinted\"><span class=\"kwd\">import<\/span> <span class=\"pln\">math<\/span>\r\n\r\n<span class=\"kwd\">def<\/span> <span class=\"pln\">reward_function<\/span><span class=\"pun\">(<\/span><span class=\"kwd\">params<\/span><span class=\"pun\">):<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0 <\/span>\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0track_width <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'track_width'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0distance_from_center <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'distance_from_center'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0steering <\/span><span class=\"pun\">=<\/span> <span class=\"pln\">abs<\/span><span class=\"pun\">(<\/span><span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'steering_angle'<\/span><span class=\"pun\">])<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0direction_stearing<\/span><span class=\"pun\">=<\/span><span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'steering_angle'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0speed <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'speed'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0steps <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'steps'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0progress <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'progress'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0all_wheels_on_track <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'all_wheels_on_track'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0ABS_STEERING_THRESHOLD <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">15<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0SPEED_TRESHOLD <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">5<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0TOTAL_NUM_STEPS <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">85<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0 <\/span>\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><i><span class=\"com\"># Read input variables<\/span><\/i>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0waypoints <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'waypoints'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0closest_waypoints <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'closest_waypoints'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0heading <\/span><span class=\"pun\">=<\/span> <span class=\"kwd\">params<\/span><span class=\"pun\">[<\/span><span class=\"str\">'heading'<\/span><span class=\"pun\">]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0 <\/span>\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">1.0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 <\/span>\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span><span class=\"pln\"> progress <\/span><span class=\"pun\">==<\/span> <span class=\"lit\">100<\/span><span class=\"pun\">:<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0reward <\/span><span class=\"pun\">+=<\/span> <span class=\"lit\">100<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0 <\/span>\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><i><span class=\"com\"># Calculate the direction of the center line based on the closest waypoints<\/span><\/i>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0next_point <\/span><span class=\"pun\">=<\/span><span class=\"pln\"> waypoints<\/span><span class=\"pun\">[<\/span><span class=\"pln\">closest_waypoints<\/span><span class=\"pun\">[<\/span><span class=\"lit\">1<\/span><span class=\"pun\">]]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0prev_point <\/span><span class=\"pun\">=<\/span><span class=\"pln\"> waypoints<\/span><span class=\"pun\">[<\/span><span class=\"pln\">closest_waypoints<\/span><span class=\"pun\">[<\/span><span class=\"lit\">0<\/span><span class=\"pun\">]]<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><i><span class=\"com\"># Calculate the direction in radius, arctan2(dy, dx), the result is (-pi, pi) in radians<\/span><\/i>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0track_direction <\/span><span class=\"pun\">=<\/span><span class=\"pln\"> math<\/span><span class=\"pun\">.<\/span><span class=\"pln\">atan2<\/span><span class=\"pun\">(<\/span><span class=\"pln\">next_point<\/span><span class=\"pun\">[<\/span><span class=\"lit\">1<\/span><span class=\"pun\">]<\/span> <span class=\"pun\">-<\/span><span class=\"pln\"> prev_point<\/span><span class=\"pun\">[<\/span><span class=\"lit\">1<\/span><span class=\"pun\">],<\/span><span class=\"pln\"> next_point<\/span><span class=\"pun\">[<\/span><span class=\"lit\">0<\/span><span class=\"pun\">]<\/span> <span class=\"pun\">-<\/span><span class=\"pln\"> prev_point<\/span><span class=\"pun\">[<\/span><span class=\"lit\">0<\/span><span class=\"pun\">])<\/span> \r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><i><span class=\"com\"># Convert to degree<\/span><\/i>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0track_direction <\/span><span class=\"pun\">=<\/span><span class=\"pln\"> math<\/span><span class=\"pun\">.<\/span><span class=\"pln\">degrees<\/span><span class=\"pun\">(<\/span><span class=\"pln\">track_direction<\/span><span class=\"pun\">)<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><i><span class=\"com\"># Calculate the difference between the track direction and the heading direction of the car<\/span><\/i>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0direction_diff <\/span><span class=\"pun\">=<\/span> <span class=\"pln\">abs<\/span><span class=\"pun\">(<\/span><span class=\"pln\">track_direction <\/span><span class=\"pun\">-<\/span><span class=\"pln\"> heading<\/span><span class=\"pun\">)<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><i><span class=\"com\"># Penalize the reward if the difference is too large<\/span><\/i>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0DIRECTION_THRESHOLD <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">10.0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0 <\/span>\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0malus<\/span><span class=\"pun\">=<\/span><span class=\"lit\">1<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0 <\/span>\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span><span class=\"pln\"> direction_diff <\/span><span class=\"pun\">&gt;<\/span><span class=\"pln\"> DIRECTION_THRESHOLD<\/span><span class=\"pun\">:<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0malus<\/span><span class=\"pun\">=<\/span><span class=\"lit\">1<\/span><span class=\"pun\">-(<\/span><span class=\"pln\">direction_diff<\/span><span class=\"pun\">\/<\/span><span class=\"lit\">50<\/span><span class=\"pun\">)<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"kwd\">if<\/span><span class=\"pln\"> malus<\/span><span class=\"pun\">&lt;<\/span><span class=\"lit\">0<\/span> <span class=\"kwd\">or<\/span><span class=\"pln\"> malus<\/span><span class=\"pun\">&gt;<\/span><span class=\"lit\">1<\/span><span class=\"pun\">:<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0malus <\/span><span class=\"pun\">=<\/span> <span class=\"lit\">0<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0<\/span><span class=\"pln\">reward<\/span> <span class=\"pun\">*=<\/span><span class=\"pln\"> malus<\/span>\r\n\r\n<span class=\"pln\"> \u00a0\u00a0 <\/span>\r\n<span class=\"pln\"> \u00a0\u00a0\u00a0<\/span><span class=\"kwd\">return<\/span><span class=\"pln\"> reward<\/span><\/pre>\n<p>We get to the track on the morning of the last day, skipping lunch (for the glory, this and more!) We start to run: the algorithm seems to work well at high speeds, but the overall stability is not the best. After the first uncertain laps, we understand how to regulate ourselves with speed: the penultimate perfect lap,<strong>\u00a0the commentator shouts: 10 seconds and 272 thousandths!<\/strong> We are in the Top Ten again, but there are several hours left to the end of the race and our tenth position is shaky.<\/p>\n<p>Second run, now we know how to manually manage the speed of the car in the various points of the track and we can meet our model to make the most of it.<\/p>\n<p>After a few attempts, we only have time for a final lap: 9,222 seconds, eighth place four hours from the end.<\/p>\n<p>Let us go back to following the sessions with an eye on the real-time ranking, some new competitors at the bottom of the list and the usual five competing for the top positions. It was not until Thursday that the world record on the track was broken twice, falling below 7.7 seconds!<\/p>\n<p><strong>Our time holds up!<\/strong>\u00a0We win an honorable eighth place and a DeepRacer Car!<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-754 aligncenter\" src=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/AWSDEEPRACER-1019x1024.png\" alt=\"\" width=\"1019\" height=\"1024\" srcset=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/AWSDEEPRACER-1019x1024.png 1019w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/AWSDEEPRACER-299x300.png 299w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/AWSDEEPRACER-768x771.png 768w, https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/AWSDEEPRACER-1529x1536.png 1529w\" sizes=\"auto, (max-width: 1019px) 100vw, 1019px\" \/><\/p>\n<p><strong>What did we take home from this experience?<\/strong>\u00a0If the main purpose of DeepRacer is to teach Machine Learning in an easy and fun way, mission accomplished! Among spin offs, twists, and overtakes in standing we had the opportunity to study and test some simple but effective Reinforcement Learning notions.<\/p>\n<p>Hoping that there will soon be new opportunities to get on track,\u00a0<strong>we want to share some of our notes with you:<\/strong><\/p>\n<ul>\n<li>It is better to use a machine learning model specific to the track on which you want to compete;<\/li>\n<li>It is not strictly necessary to train a model for more than eight consecutive hours but, to obtain record times, it becomes essential;<\/li>\n<li>It is always possible to increase confidence by changing the car&#8217;s degrees of freedom;<\/li>\n<li>Using Waypoints allows you to outline the ideal path;<\/li>\n<li>To gain those thousandths of a second that make the difference, you can manually vary the speed of the machine during the laps.<\/li>\n<\/ul>\n<p>Do you feel like challenging us? Check the official AWS website for the list of upcoming events, and start training!<\/p>\n<p><a href=\"https:\/\/aws.amazon.com\/it\/deepracer\/\">https:\/\/aws.amazon.com\/it\/deepracer\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In the last few years, Las Vegas\u00a0has become the reference point for AWS Cloud events. We have seen first-hand re:Invent [&hellip;]<\/p>\n","protected":false},"author":9,"featured_media":756,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[248],"tags":[336,324,338],"class_list":["post-763","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-ml-en","tag-aws-deepracer-en","tag-ml-en","tag-remars-en"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>DeepRacer: our journey to the top ten! - Proud2beCloud Blog<\/title>\n<meta name=\"description\" content=\"AWS DeepRacer: how we entered the top 10!\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"AWS DeepRacer: how we entered the top 10!\" \/>\n<meta property=\"og:description\" content=\"AWS DeepRacer: how we entered the top 10!\" \/>\n<meta property=\"og:url\" content=\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/\" \/>\n<meta property=\"og:site_name\" content=\"Proud2beCloud Blog\" \/>\n<meta property=\"article:published_time\" content=\"2019-06-14T10:11:04+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-03-29T15:43:52+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1024\" \/>\n\t<meta property=\"og:image:height\" content=\"682\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Matteo Moroni\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:title\" content=\"AWS DeepRacer: how we entered the top 10!\" \/>\n<meta name=\"twitter:description\" content=\"AWS DeepRacer: how we entered the top 10!\" \/>\n<meta name=\"twitter:image\" content=\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Matteo Moroni\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"15 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/\",\"url\":\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/\",\"name\":\"DeepRacer: our journey to the top ten! - Proud2beCloud Blog\",\"isPartOf\":{\"@id\":\"https:\/\/blog.besharp.it\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg\",\"datePublished\":\"2019-06-14T10:11:04+00:00\",\"dateModified\":\"2021-03-29T15:43:52+00:00\",\"author\":{\"@id\":\"https:\/\/blog.besharp.it\/#\/schema\/person\/0b3e69eb2dcb125d58476b906ec1c7bc\"},\"description\":\"AWS DeepRacer: how we entered the top 10!\",\"breadcrumb\":{\"@id\":\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#primaryimage\",\"url\":\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg\",\"contentUrl\":\"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg\",\"width\":1024,\"height\":682,\"caption\":\"AWS DeepRacer: how we entered the top 10!\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/blog.besharp.it\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"DeepRacer: our journey to the top ten!\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/blog.besharp.it\/#website\",\"url\":\"https:\/\/blog.besharp.it\/\",\"name\":\"Proud2beCloud Blog\",\"description\":\"il blog di beSharp\",\"alternateName\":\"Proud2beCloud Blog\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/blog.besharp.it\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/blog.besharp.it\/#\/schema\/person\/0b3e69eb2dcb125d58476b906ec1c7bc\",\"name\":\"Matteo Moroni\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/blog.besharp.it\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/acad790b9bb4c6d62e076ecdc1debb35?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/acad790b9bb4c6d62e076ecdc1debb35?s=96&d=mm&r=g\",\"caption\":\"Matteo Moroni\"},\"description\":\"DevOps e Solution Architect di beSharp, mi occupo di sviluppare soluzioni Saas, Data Analysis, HPC e di progettare architetture non convenzionali a complessit\u00e0 divergente. Appassionato di informatica e fisica, da sempre lavoro nella prima e ho un PhD nella seconda. Parlare di tutto ci\u00f2 che \u00e8 tecnico e nerd mi rende felice!\",\"url\":\"https:\/\/blog.besharp.it\/author\/matteo-moroni\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"DeepRacer: our journey to the top ten! - Proud2beCloud Blog","description":"AWS DeepRacer: how we entered the top 10!","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/","og_locale":"en_US","og_type":"article","og_title":"AWS DeepRacer: how we entered the top 10!","og_description":"AWS DeepRacer: how we entered the top 10!","og_url":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/","og_site_name":"Proud2beCloud Blog","article_published_time":"2019-06-14T10:11:04+00:00","article_modified_time":"2021-03-29T15:43:52+00:00","og_image":[{"width":1024,"height":682,"url":"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg","type":"image\/jpeg"}],"author":"Matteo Moroni","twitter_card":"summary_large_image","twitter_title":"AWS DeepRacer: how we entered the top 10!","twitter_description":"AWS DeepRacer: how we entered the top 10!","twitter_image":"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg","twitter_misc":{"Written by":"Matteo Moroni","Est. reading time":"15 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/","url":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/","name":"DeepRacer: our journey to the top ten! - Proud2beCloud Blog","isPartOf":{"@id":"https:\/\/blog.besharp.it\/#website"},"primaryImageOfPage":{"@id":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#primaryimage"},"image":{"@id":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#primaryimage"},"thumbnailUrl":"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg","datePublished":"2019-06-14T10:11:04+00:00","dateModified":"2021-03-29T15:43:52+00:00","author":{"@id":"https:\/\/blog.besharp.it\/#\/schema\/person\/0b3e69eb2dcb125d58476b906ec1c7bc"},"description":"AWS DeepRacer: how we entered the top 10!","breadcrumb":{"@id":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#primaryimage","url":"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg","contentUrl":"https:\/\/blog.besharp.it\/wp-content\/uploads\/2019\/06\/61991092_2036660199796238_7688514642691227648_o.jpg","width":1024,"height":682,"caption":"AWS DeepRacer: how we entered the top 10!"},{"@type":"BreadcrumbList","@id":"https:\/\/blog.besharp.it\/deepracer-our-journey-to-the-top-ten\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/blog.besharp.it\/"},{"@type":"ListItem","position":2,"name":"DeepRacer: our journey to the top ten!"}]},{"@type":"WebSite","@id":"https:\/\/blog.besharp.it\/#website","url":"https:\/\/blog.besharp.it\/","name":"Proud2beCloud Blog","description":"il blog di beSharp","alternateName":"Proud2beCloud Blog","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/blog.besharp.it\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/blog.besharp.it\/#\/schema\/person\/0b3e69eb2dcb125d58476b906ec1c7bc","name":"Matteo Moroni","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/blog.besharp.it\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/acad790b9bb4c6d62e076ecdc1debb35?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/acad790b9bb4c6d62e076ecdc1debb35?s=96&d=mm&r=g","caption":"Matteo Moroni"},"description":"DevOps e Solution Architect di beSharp, mi occupo di sviluppare soluzioni Saas, Data Analysis, HPC e di progettare architetture non convenzionali a complessit\u00e0 divergente. Appassionato di informatica e fisica, da sempre lavoro nella prima e ho un PhD nella seconda. Parlare di tutto ci\u00f2 che \u00e8 tecnico e nerd mi rende felice!","url":"https:\/\/blog.besharp.it\/author\/matteo-moroni\/"}]}},"_links":{"self":[{"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/posts\/763","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/users\/9"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/comments?post=763"}],"version-history":[{"count":0,"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/posts\/763\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/media\/756"}],"wp:attachment":[{"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/media?parent=763"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/categories?post=763"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.besharp.it\/wp-json\/wp\/v2\/tags?post=763"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}