Autonomous Driving Model Training Report

Report Date: May 20, 2024 | Week 20

1. First Generated Dataset: Centered Driving + Turns

The main cause of the high repetition of images was the frequency at which images were generated from the driving log file. Once this parameter was reduced, the images were properly generated at a speed of 15 km/h and 20 fps (originally generated at 400x and reduced to 1x). Therefore, images were regenerated and the dataset was built with careful attention to avoid repetition in the system's input images.

The model training has just been completed with the following distribution into 5 categories: hard left, left, centered, right, and hard right as shown in Figure 1.

Figure 1: Distribution into 5 categories

Figure 1: Steering distribution in 5 categories

When this same distribution is displayed in 50 categories, as in Enrique's work, it appears as shown in Figure 2. It appears unbalanced because driving tends to the right. However, two improvements are noticeable:

Very few oscillations compared to previous versions (the lowest so far).
Correct lane driving for a longer period.

However, the model still does not learn to take curves adequately; generally, the steering command is issued before the curve begins.

Figure 2: Distribution into 50 categories

Figure 2: Raw steering distribution in 50 bins (unbalanced due to right-side driving tendency)

Video 1. Autonomos driver trained with dataset on Figure 1.

PilotNet Carla agent for 20,100 samples on Town04: good performance in two‑lane roads but less stable on highways.

2. Second Generated Dataset: Centered Driving + Augmentation (Turns + Lane Recovery)

Based on previous results, to mitigate the early curve-taking effect, lane recovery samples were added. Additionally, dataset augmentation was performed using mirroring for turn and recovery samples. As a result, the dataset profile shown in Figure 3 was obtained. The dataset thus achieves balancing and distribution close to that of Enrique Shirahara's work.

Figure 3: Balanced dataset distribution after augmentation

Figure 3: Balanced dataset distribution after mirror augmentation

Video 2. Autonomous driver trained with dataset on figure 2.

In this case, the following results were observed:

The system improves how turns are taken.
It corrects its position.
It maintains the central lane with low oscillations.

However, the following issues remain:

Oscillations cause driving in the center of the road, between the two lanes; this is possibly due to the mirroring process in dataset augmentation, as the system "seems confused" about which is the correct lane. One proposed solution is to avoid mirroring for recovery maneuvers.
The steering command is still issued early, causing the vehicle to turn and subsequently perform lane recovery, but resulting in curb invasion. This may be corrected by increasing the number of samples.

3. Proposed Work for Current Week

For the work of the current week, the following actions are proposed:

Refinement of the dataset by adjusting mirroring strategies (especially for recovery maneuvers).
Increase the number of samples for early-turn scenarios to reduce curb invasion.
Evaluation of models using BehaviorMetrics for quantitative performance analysis.
Implementation of the "Noise Injection" methodology to improve model robustness.

                Pending Tasks:
                Complete BehaviorMetrics integration for model comparison.
Implement and test Noise Injection in the training pipeline.
Validate refined dataset with additional driving scenarios.

            

Summary

Significant progress has been made in dataset generation and model training for autonomous driving. The first dataset showed improvements in oscillation reduction and lane keeping but suffered from early curve-taking. The second dataset, augmented with mirrored turns and lane recovery, achieved better balance and improved turn execution but introduced oscillation-related lane confusion. The upcoming work focuses on dataset refinement, model evaluation with BehaviorMetrics, and implementation of noise injection to enhance model generalization and robustness.