Training Started: 20251211_110643
Number of Episodes: 1000
Print Frequency: 10
Target Gap Height: 16.491741 mm
======================================================================

Ep   10 | Reward: -7123.0 | Length:  457 | R/step: -15.580 (-1558.0% of max) | Gap Error: 14.704 mm
Ep   20 | Reward: -6697.0 | Length:  418 | R/step: -16.025 (-1602.5% of max) | Gap Error: 11.138 mm
Ep   30 | Reward: -6050.2 | Length:  377 | R/step: -16.031 (-1603.1% of max) | Gap Error: 12.985 mm
Ep   40 | Reward: -7891.0 | Length:  500 | R/step: -15.782 (-1578.2% of max) | Gap Error: 11.726 mm
Ep   50 | Reward: -8254.2 | Length:  500 | R/step: -16.508 (-1650.8% of max) | Gap Error:  4.602 mm
Ep   60 | Reward: -8255.5 | Length:  500 | R/step: -16.511 (-1651.1% of max) | Gap Error:  4.537 mm
Ep   70 | Reward: -8185.8 | Length:  500 | R/step: -16.372 (-1637.2% of max) | Gap Error:  5.910 mm
Ep   80 | Reward: -7920.1 | Length:  500 | R/step: -15.840 (-1584.0% of max) | Gap Error: 11.089 mm
Ep   90 | Reward: -8262.8 | Length:  500 | R/step: -16.526 (-1652.6% of max) | Gap Error:  4.395 mm
Ep  100 | Reward: -8164.6 | Length:  500 | R/step: -16.329 (-1632.9% of max) | Gap Error:  6.340 mm
Ep  110 | Reward: -5294.0 | Length:  332 | R/step: -15.960 (-1596.0% of max) | Gap Error: 14.124 mm
Ep  120 | Reward: -6081.3 | Length:  375 | R/step: -16.230 (-1623.0% of max) | Gap Error: 11.482 mm
Ep  130 | Reward: -7346.6 | Length:  458 | R/step: -16.058 (-1605.8% of max) | Gap Error: 10.391 mm
Ep  140 | Reward: -7065.4 | Length:  458 | R/step: -15.427 (-1542.7% of max) | Gap Error: 15.974 mm
Ep  150 | Reward: -6516.7 | Length:  415 | R/step: -15.695 (-1569.5% of max) | Gap Error: 13.902 mm
Ep  160 | Reward: -6601.1 | Length:  417 | R/step: -15.822 (-1582.2% of max) | Gap Error: 13.542 mm
Ep  170 | Reward: -6440.3 | Length:  416 | R/step: -15.463 (-1546.3% of max) | Gap Error: 16.043 mm
Ep  180 | Reward: -5308.5 | Length:  336 | R/step: -15.799 (-1579.9% of max) | Gap Error: 15.176 mm
Ep  190 | Reward: -5847.7 | Length:  378 | R/step: -15.458 (-1545.8% of max) | Gap Error: 16.860 mm
Ep  200 | Reward: -5325.4 | Length:  333 | R/step: -16.007 (-1600.7% of max) | Gap Error: 14.364 mm
Ep  210 | Reward: -5893.1 | Length:  376 | R/step: -15.652 (-1565.2% of max) | Gap Error: 15.352 mm
Ep  220 | Reward: -7743.7 | Length:  500 | R/step: -15.487 (-1548.7% of max) | Gap Error: 14.608 mm
Ep  230 | Reward: -7180.3 | Length:  460 | R/step: -15.596 (-1559.6% of max) | Gap Error: 14.406 mm
Ep  240 | Reward: -7221.9 | Length:  459 | R/step: -15.741 (-1574.1% of max) | Gap Error: 13.200 mm
Ep  250 | Reward: -5970.0 | Length:  377 | R/step: -15.852 (-1585.2% of max) | Gap Error: 14.499 mm
Ep  260 | Reward: -7481.9 | Length:  462 | R/step: -16.212 (-1621.2% of max) | Gap Error:  8.666 mm
Ep  270 | Reward: -7189.7 | Length:  457 | R/step: -15.732 (-1573.2% of max) | Gap Error: 13.117 mm
Ep  280 | Reward: -6560.4 | Length:  418 | R/step: -15.710 (-1571.0% of max) | Gap Error: 14.139 mm
Ep  290 | Reward: -7423.1 | Length:  459 | R/step: -16.186 (-1618.6% of max) | Gap Error:  9.175 mm
Ep  300 | Reward: -7137.2 | Length:  458 | R/step: -15.594 (-1559.4% of max) | Gap Error: 14.544 mm
Ep  310 | Reward: -6457.1 | Length:  416 | R/step: -15.522 (-1552.2% of max) | Gap Error: 15.529 mm
Ep  320 | Reward: -7154.6 | Length:  459 | R/step: -15.598 (-1559.8% of max) | Gap Error: 14.440 mm
Ep  330 | Reward: -7814.2 | Length:  500 | R/step: -15.628 (-1562.8% of max) | Gap Error: 13.236 mm
Ep  340 | Reward: -7199.5 | Length:  457 | R/step: -15.744 (-1574.4% of max) | Gap Error: 13.286 mm
Ep  350 | Reward: -7234.7 | Length:  460 | R/step: -15.738 (-1573.8% of max) | Gap Error: 13.107 mm
Ep  360 | Reward: -7819.8 | Length:  500 | R/step: -15.640 (-1564.0% of max) | Gap Error: 13.160 mm
Ep  370 | Reward: -7148.2 | Length:  459 | R/step: -15.580 (-1558.0% of max) | Gap Error: 14.240 mm
Ep  380 | Reward: -7888.0 | Length:  500 | R/step: -15.776 (-1577.6% of max) | Gap Error: 11.788 mm
Ep  390 | Reward: -6321.7 | Length:  416 | R/step: -15.186 (-1518.6% of max) | Gap Error: 18.744 mm
Ep  400 | Reward: -7809.0 | Length:  500 | R/step: -15.618 (-1561.8% of max) | Gap Error: 13.359 mm
Ep  410 | Reward: -7004.8 | Length:  459 | R/step: -15.274 (-1527.4% of max) | Gap Error: 17.361 mm
Ep  420 | Reward: -7003.1 | Length:  455 | R/step: -15.391 (-1539.1% of max) | Gap Error: 15.613 mm
Ep  430 | Reward: -7280.0 | Length:  458 | R/step: -15.906 (-1590.6% of max) | Gap Error: 11.408 mm
Ep  440 | Reward: -7168.7 | Length:  460 | R/step: -15.594 (-1559.4% of max) | Gap Error: 13.981 mm
Ep  450 | Reward: -7605.7 | Length:  500 | R/step: -15.211 (-1521.1% of max) | Gap Error: 17.346 mm
Ep  460 | Reward: -6611.7 | Length:  417 | R/step: -15.871 (-1587.1% of max) | Gap Error: 12.581 mm
Ep  470 | Reward: -7086.7 | Length:  459 | R/step: -15.429 (-1542.9% of max) | Gap Error: 15.629 mm
Ep  480 | Reward: -7674.5 | Length:  500 | R/step: -15.349 (-1534.9% of max) | Gap Error: 16.008 mm
Ep  490 | Reward: -6481.8 | Length:  417 | R/step: -15.529 (-1552.9% of max) | Gap Error: 15.462 mm
Ep  500 | Reward: -7959.4 | Length:  500 | R/step: -15.919 (-1591.9% of max) | Gap Error: 10.456 mm
Ep  510 | Reward: -7088.4 | Length:  459 | R/step: -15.436 (-1543.6% of max) | Gap Error: 15.514 mm
Ep  520 | Reward: -7600.0 | Length:  500 | R/step: -15.200 (-1520.0% of max) | Gap Error: 17.460 mm
Ep  530 | Reward: -5979.7 | Length:  378 | R/step: -15.819 (-1581.9% of max) | Gap Error: 14.469 mm
Ep  540 | Reward: -6667.2 | Length:  416 | R/step: -16.038 (-1603.8% of max) | Gap Error: 11.886 mm
Ep  550 | Reward: -7165.7 | Length:  460 | R/step: -15.595 (-1559.5% of max) | Gap Error: 14.504 mm
Ep  560 | Reward: -7209.9 | Length:  458 | R/step: -15.739 (-1573.9% of max) | Gap Error: 12.949 mm
Ep  570 | Reward: -7745.6 | Length:  500 | R/step: -15.491 (-1549.1% of max) | Gap Error: 14.593 mm
Ep  580 | Reward: -7678.6 | Length:  500 | R/step: -15.357 (-1535.7% of max) | Gap Error: 15.895 mm
Ep  590 | Reward: -7107.0 | Length:  456 | R/step: -15.579 (-1557.9% of max) | Gap Error: 14.736 mm
Ep  600 | Reward: -7189.8 | Length:  458 | R/step: -15.712 (-1571.2% of max) | Gap Error: 13.386 mm
Ep  610 | Reward: -6914.9 | Length:  458 | R/step: -15.115 (-1511.5% of max) | Gap Error: 18.915 mm
Ep  620 | Reward: -7748.9 | Length:  500 | R/step: -15.498 (-1549.8% of max) | Gap Error: 14.607 mm
Ep  630 | Reward: -7894.4 | Length:  500 | R/step: -15.789 (-1578.9% of max) | Gap Error: 11.719 mm
Ep  640 | Reward: -6970.0 | Length:  457 | R/step: -15.255 (-1525.5% of max) | Gap Error: 17.182 mm
Ep  650 | Reward: -7746.9 | Length:  500 | R/step: -15.494 (-1549.4% of max) | Gap Error: 14.609 mm
Ep  660 | Reward: -7604.6 | Length:  500 | R/step: -15.209 (-1520.9% of max) | Gap Error: 17.373 mm
Ep  670 | Reward: -7747.7 | Length:  500 | R/step: -15.495 (-1549.5% of max) | Gap Error: 14.560 mm
Ep  680 | Reward: -6720.6 | Length:  419 | R/step: -16.043 (-1604.3% of max) | Gap Error: 11.076 mm
Ep  690 | Reward: -6998.9 | Length:  458 | R/step: -15.275 (-1527.5% of max) | Gap Error: 17.079 mm
Ep  700 | Reward: -7746.6 | Length:  500 | R/step: -15.493 (-1549.3% of max) | Gap Error: 14.625 mm
Ep  710 | Reward: -7894.9 | Length:  500 | R/step: -15.790 (-1579.0% of max) | Gap Error: 11.707 mm
Ep  720 | Reward: -7966.2 | Length:  500 | R/step: -15.932 (-1593.2% of max) | Gap Error: 10.268 mm
Ep  730 | Reward: -7966.1 | Length:  500 | R/step: -15.932 (-1593.2% of max) | Gap Error: 10.285 mm
Ep  740 | Reward: -7604.9 | Length:  500 | R/step: -15.210 (-1521.0% of max) | Gap Error: 17.376 mm
Ep  750 | Reward: -7673.6 | Length:  500 | R/step: -15.347 (-1534.7% of max) | Gap Error: 16.016 mm
Ep  760 | Reward: -7964.8 | Length:  500 | R/step: -15.930 (-1593.0% of max) | Gap Error: 10.311 mm
Ep  770 | Reward: -7747.0 | Length:  500 | R/step: -15.494 (-1549.4% of max) | Gap Error: 14.583 mm
Ep  780 | Reward: -7963.6 | Length:  500 | R/step: -15.927 (-1592.7% of max) | Gap Error: 10.346 mm
Ep  790 | Reward: -7209.1 | Length:  458 | R/step: -15.747 (-1574.7% of max) | Gap Error: 13.169 mm
Ep  800 | Reward: -7879.7 | Length:  500 | R/step: -15.759 (-1575.9% of max) | Gap Error: 11.951 mm
Ep  810 | Reward: -7960.0 | Length:  500 | R/step: -15.920 (-1592.0% of max) | Gap Error: 10.372 mm
Ep  820 | Reward: -7818.1 | Length:  500 | R/step: -15.636 (-1563.6% of max) | Gap Error: 13.177 mm
Ep  830 | Reward: -7965.3 | Length:  500 | R/step: -15.931 (-1593.1% of max) | Gap Error: 10.322 mm
Ep  840 | Reward: -7891.4 | Length:  500 | R/step: -15.783 (-1578.3% of max) | Gap Error: 11.716 mm
Ep  850 | Reward: -7883.5 | Length:  500 | R/step: -15.767 (-1576.7% of max) | Gap Error: 11.861 mm
Ep  860 | Reward: -7531.1 | Length:  500 | R/step: -15.062 (-1506.2% of max) | Gap Error: 18.853 mm
Ep  870 | Reward: -7601.2 | Length:  500 | R/step: -15.202 (-1520.2% of max) | Gap Error: 17.472 mm
Ep  880 | Reward: -7747.0 | Length:  500 | R/step: -15.494 (-1549.4% of max) | Gap Error: 14.594 mm
Ep  890 | Reward: -7600.6 | Length:  500 | R/step: -15.201 (-1520.1% of max) | Gap Error: 17.470 mm
Ep  900 | Reward: -7676.1 | Length:  500 | R/step: -15.352 (-1535.2% of max) | Gap Error: 15.984 mm
Ep  910 | Reward: -7750.4 | Length:  500 | R/step: -15.501 (-1550.1% of max) | Gap Error: 14.519 mm
Ep  920 | Reward: -7820.0 | Length:  500 | R/step: -15.640 (-1564.0% of max) | Gap Error: 13.170 mm
Ep  930 | Reward: -7884.7 | Length:  500 | R/step: -15.769 (-1576.9% of max) | Gap Error: 11.879 mm
Ep  940 | Reward: -7604.0 | Length:  500 | R/step: -15.208 (-1520.8% of max) | Gap Error: 17.410 mm
Ep  950 | Reward: -7677.5 | Length:  500 | R/step: -15.355 (-1535.5% of max) | Gap Error: 15.963 mm
Ep  960 | Reward: -7883.3 | Length:  500 | R/step: -15.767 (-1576.7% of max) | Gap Error: 11.895 mm
Ep  970 | Reward: -7818.1 | Length:  500 | R/step: -15.636 (-1563.6% of max) | Gap Error: 13.216 mm
Ep  980 | Reward: -7743.4 | Length:  500 | R/step: -15.487 (-1548.7% of max) | Gap Error: 14.655 mm
Ep  990 | Reward: -7960.6 | Length:  500 | R/step: -15.921 (-1592.1% of max) | Gap Error: 10.375 mm
Ep 1000 | Reward: -7601.5 | Length:  500 | R/step: -15.203 (-1520.3% of max) | Gap Error: 17.483 mm

======================================================================
Training Completed: 20251211_112339
