Training Started: 20251211_102102
Number of Episodes: 1000
Print Frequency: 10
Target Gap Height: 16.491741 mm
======================================================================

Ep   10 | Reward: -7172.7 | Length:  500 | R/step: -14.345 (-1434.5% of max) | Gap Error: 16.068 mm
Ep   20 | Reward: -7031.0 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.962 mm
Ep   30 | Reward: -7101.5 | Length:  500 | R/step: -14.203 (-1420.3% of max) | Gap Error: 17.537 mm
Ep   40 | Reward: -7031.1 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.972 mm
Ep   50 | Reward: -7030.8 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.949 mm
Ep   60 | Reward: -7031.1 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.975 mm
Ep   70 | Reward: -7031.1 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.974 mm
Ep   80 | Reward: -7031.2 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.979 mm
Ep   90 | Reward: -7031.1 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.962 mm
Ep  100 | Reward: -7031.1 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.978 mm
Ep  110 | Reward: -7248.5 | Length:  500 | R/step: -14.497 (-1449.7% of max) | Gap Error: 14.626 mm
Ep  120 | Reward: -7164.7 | Length:  500 | R/step: -14.329 (-1432.9% of max) | Gap Error: 16.227 mm
Ep  130 | Reward: -7239.3 | Length:  500 | R/step: -14.479 (-1447.9% of max) | Gap Error: 14.752 mm
Ep  140 | Reward: -7611.2 | Length:  500 | R/step: -15.222 (-1522.2% of max) | Gap Error:  7.389 mm
Ep  150 | Reward: -7316.7 | Length:  500 | R/step: -14.633 (-1463.3% of max) | Gap Error: 13.257 mm
Ep  160 | Reward: -7657.2 | Length:  500 | R/step: -15.314 (-1531.4% of max) | Gap Error:  6.463 mm
Ep  170 | Reward: -7564.9 | Length:  500 | R/step: -15.130 (-1513.0% of max) | Gap Error:  8.263 mm
Ep  180 | Reward: -7320.2 | Length:  500 | R/step: -14.640 (-1464.0% of max) | Gap Error: 13.172 mm
Ep  190 | Reward: -7601.5 | Length:  500 | R/step: -15.203 (-1520.3% of max) | Gap Error:  7.545 mm
Ep  200 | Reward: -7311.8 | Length:  500 | R/step: -14.624 (-1462.4% of max) | Gap Error: 13.311 mm
Ep  210 | Reward: -7100.9 | Length:  500 | R/step: -14.202 (-1420.2% of max) | Gap Error: 17.572 mm
Ep  220 | Reward: -7170.8 | Length:  500 | R/step: -14.342 (-1434.2% of max) | Gap Error: 16.143 mm
Ep  230 | Reward: -7241.1 | Length:  500 | R/step: -14.482 (-1448.2% of max) | Gap Error: 14.729 mm
Ep  240 | Reward: -7310.3 | Length:  500 | R/step: -14.621 (-1462.1% of max) | Gap Error: 13.338 mm
Ep  250 | Reward: -7244.1 | Length:  500 | R/step: -14.488 (-1448.8% of max) | Gap Error: 14.699 mm
Ep  260 | Reward: -7172.7 | Length:  500 | R/step: -14.345 (-1434.5% of max) | Gap Error: 16.127 mm
Ep  270 | Reward: -7031.2 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.962 mm
Ep  280 | Reward: -7031.2 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.977 mm
Ep  290 | Reward: -7031.3 | Length:  500 | R/step: -14.063 (-1406.3% of max) | Gap Error: 18.967 mm
Ep  300 | Reward: -7031.4 | Length:  500 | R/step: -14.063 (-1406.3% of max) | Gap Error: 18.965 mm
Ep  310 | Reward: -7031.3 | Length:  500 | R/step: -14.063 (-1406.3% of max) | Gap Error: 18.927 mm
Ep  320 | Reward: -7031.3 | Length:  500 | R/step: -14.063 (-1406.3% of max) | Gap Error: 18.980 mm
Ep  330 | Reward: -7031.4 | Length:  500 | R/step: -14.063 (-1406.3% of max) | Gap Error: 18.957 mm
Ep  340 | Reward: -7101.4 | Length:  500 | R/step: -14.203 (-1420.3% of max) | Gap Error: 17.557 mm
Ep  350 | Reward: -7102.1 | Length:  500 | R/step: -14.204 (-1420.4% of max) | Gap Error: 17.547 mm
Ep  360 | Reward: -7312.6 | Length:  500 | R/step: -14.625 (-1462.5% of max) | Gap Error: 13.311 mm
Ep  370 | Reward: -7315.5 | Length:  500 | R/step: -14.631 (-1463.1% of max) | Gap Error: 13.255 mm
Ep  380 | Reward: -7380.4 | Length:  500 | R/step: -14.761 (-1476.1% of max) | Gap Error: 11.917 mm
Ep  390 | Reward: -7171.6 | Length:  500 | R/step: -14.343 (-1434.3% of max) | Gap Error: 16.108 mm
Ep  400 | Reward: -7244.8 | Length:  500 | R/step: -14.490 (-1449.0% of max) | Gap Error: 14.659 mm
Ep  410 | Reward: -7030.6 | Length:  500 | R/step: -14.061 (-1406.1% of max) | Gap Error: 18.943 mm
Ep  420 | Reward: -7169.8 | Length:  500 | R/step: -14.340 (-1434.0% of max) | Gap Error: 16.169 mm
Ep  430 | Reward: -7381.3 | Length:  500 | R/step: -14.763 (-1476.3% of max) | Gap Error: 11.908 mm
Ep  440 | Reward: -7306.4 | Length:  500 | R/step: -14.613 (-1461.3% of max) | Gap Error: 13.356 mm
Ep  450 | Reward: -7099.3 | Length:  500 | R/step: -14.199 (-1419.9% of max) | Gap Error: 17.564 mm
Ep  460 | Reward: -7031.0 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.969 mm
Ep  470 | Reward: -7030.5 | Length:  500 | R/step: -14.061 (-1406.1% of max) | Gap Error: 18.934 mm
Ep  480 | Reward: -7100.7 | Length:  500 | R/step: -14.201 (-1420.1% of max) | Gap Error: 17.549 mm
Ep  490 | Reward: -7169.3 | Length:  500 | R/step: -14.339 (-1433.9% of max) | Gap Error: 16.184 mm
Ep  500 | Reward: -7101.3 | Length:  500 | R/step: -14.203 (-1420.3% of max) | Gap Error: 17.559 mm
Ep  510 | Reward: -7168.6 | Length:  500 | R/step: -14.337 (-1433.7% of max) | Gap Error: 16.185 mm
Ep  520 | Reward: -7385.4 | Length:  500 | R/step: -14.771 (-1477.1% of max) | Gap Error: 11.860 mm
Ep  530 | Reward: -7386.8 | Length:  500 | R/step: -14.774 (-1477.4% of max) | Gap Error: 11.816 mm
Ep  540 | Reward: -7173.0 | Length:  500 | R/step: -14.346 (-1434.6% of max) | Gap Error: 16.121 mm
Ep  550 | Reward: -7244.3 | Length:  500 | R/step: -14.489 (-1448.9% of max) | Gap Error: 14.693 mm
Ep  560 | Reward: -7242.9 | Length:  500 | R/step: -14.486 (-1448.6% of max) | Gap Error: 14.731 mm
Ep  570 | Reward: -7315.1 | Length:  500 | R/step: -14.630 (-1463.0% of max) | Gap Error: 13.236 mm
Ep  580 | Reward: -7243.6 | Length:  500 | R/step: -14.487 (-1448.7% of max) | Gap Error: 14.684 mm
Ep  590 | Reward: -7031.1 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.979 mm
Ep  600 | Reward: -7168.0 | Length:  500 | R/step: -14.336 (-1433.6% of max) | Gap Error: 16.203 mm
Ep  610 | Reward: -7169.4 | Length:  500 | R/step: -14.339 (-1433.9% of max) | Gap Error: 16.183 mm
Ep  620 | Reward: -7102.4 | Length:  500 | R/step: -14.205 (-1420.5% of max) | Gap Error: 17.536 mm
Ep  630 | Reward: -7166.3 | Length:  500 | R/step: -14.333 (-1433.3% of max) | Gap Error: 16.216 mm
Ep  640 | Reward: -7030.7 | Length:  500 | R/step: -14.061 (-1406.1% of max) | Gap Error: 18.949 mm
Ep  650 | Reward: -7031.0 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.966 mm
Ep  660 | Reward: -7098.9 | Length:  500 | R/step: -14.198 (-1419.8% of max) | Gap Error: 17.581 mm
Ep  670 | Reward: -7236.5 | Length:  500 | R/step: -14.473 (-1447.3% of max) | Gap Error: 14.809 mm
Ep  680 | Reward: -7174.4 | Length:  500 | R/step: -14.349 (-1434.9% of max) | Gap Error: 16.106 mm
Ep  690 | Reward: -7314.2 | Length:  500 | R/step: -14.628 (-1462.8% of max) | Gap Error: 13.294 mm
Ep  700 | Reward: -7102.0 | Length:  500 | R/step: -14.204 (-1420.4% of max) | Gap Error: 17.539 mm
Ep  710 | Reward: -7314.9 | Length:  500 | R/step: -14.630 (-1463.0% of max) | Gap Error: 13.277 mm
Ep  720 | Reward: -7310.9 | Length:  500 | R/step: -14.622 (-1462.2% of max) | Gap Error: 13.335 mm
Ep  730 | Reward: -7315.2 | Length:  500 | R/step: -14.630 (-1463.0% of max) | Gap Error: 13.275 mm
Ep  740 | Reward: -7386.0 | Length:  500 | R/step: -14.772 (-1477.2% of max) | Gap Error: 11.842 mm
Ep  750 | Reward: -7300.9 | Length:  500 | R/step: -14.602 (-1460.2% of max) | Gap Error: 13.480 mm
Ep  760 | Reward: -7311.7 | Length:  500 | R/step: -14.623 (-1462.3% of max) | Gap Error: 13.314 mm
Ep  770 | Reward: -7382.8 | Length:  500 | R/step: -14.766 (-1476.6% of max) | Gap Error: 11.885 mm
Ep  780 | Reward: -7529.6 | Length:  500 | R/step: -15.059 (-1505.9% of max) | Gap Error:  8.946 mm
Ep  790 | Reward: -7523.1 | Length:  500 | R/step: -15.046 (-1504.6% of max) | Gap Error:  9.063 mm
Ep  800 | Reward: -7391.4 | Length:  500 | R/step: -14.783 (-1478.3% of max) | Gap Error: 11.758 mm
Ep  810 | Reward: -7531.9 | Length:  500 | R/step: -15.064 (-1506.4% of max) | Gap Error:  8.915 mm
Ep  820 | Reward: -7462.2 | Length:  500 | R/step: -14.924 (-1492.4% of max) | Gap Error: 10.318 mm
Ep  830 | Reward: -7214.0 | Length:  500 | R/step: -14.428 (-1442.8% of max) | Gap Error: 15.251 mm
Ep  840 | Reward: -7426.1 | Length:  500 | R/step: -14.852 (-1485.2% of max) | Gap Error: 10.944 mm
Ep  850 | Reward: -7460.9 | Length:  500 | R/step: -14.922 (-1492.2% of max) | Gap Error: 10.270 mm
Ep  860 | Reward: -7663.9 | Length:  500 | R/step: -15.328 (-1532.8% of max) | Gap Error:  6.237 mm
Ep  870 | Reward: -7437.2 | Length:  500 | R/step: -14.874 (-1487.4% of max) | Gap Error: 10.765 mm
Ep  880 | Reward: -6530.8 | Length:  421 | R/step: -15.524 (-1552.4% of max) | Gap Error:  7.413 mm
Ep  890 | Reward: -7696.3 | Length:  500 | R/step: -15.393 (-1539.3% of max) | Gap Error:  5.736 mm
Ep  900 | Reward: -7732.4 | Length:  500 | R/step: -15.465 (-1546.5% of max) | Gap Error:  5.083 mm
Ep  910 | Reward: -7746.3 | Length:  500 | R/step: -15.493 (-1549.3% of max) | Gap Error:  4.839 mm
Ep  920 | Reward: -7745.4 | Length:  500 | R/step: -15.491 (-1549.1% of max) | Gap Error:  4.833 mm
Ep  930 | Reward: -7749.5 | Length:  500 | R/step: -15.499 (-1549.9% of max) | Gap Error:  4.838 mm
Ep  940 | Reward: -5944.1 | Length:  383 | R/step: -15.536 (-1553.6% of max) | Gap Error:  9.080 mm
Ep  950 | Reward: -4103.1 | Length:  261 | R/step: -15.739 (-1573.9% of max) | Gap Error: 12.558 mm
Ep  960 | Reward: -4697.1 | Length:  303 | R/step: -15.517 (-1551.7% of max) | Gap Error: 12.987 mm
Ep  970 | Reward: -4455.6 | Length:  288 | R/step: -15.455 (-1545.5% of max) | Gap Error: 14.009 mm
Ep  980 | Reward: -3854.4 | Length:  251 | R/step: -15.356 (-1535.6% of max) | Gap Error: 14.805 mm
Ep  990 | Reward: -5979.5 | Length:  415 | R/step: -14.405 (-1440.5% of max) | Gap Error: 17.115 mm
Ep 1000 | Reward: -7031.2 | Length:  500 | R/step: -14.062 (-1406.2% of max) | Gap Error: 18.909 mm

======================================================================
Training Completed: 20251211_103854
