MG Portfolio

Projects

Delve into a collection of my notable projects, each representing a fusion of creativity and skill.

2023

Perception-Enabled Pure Pursuit for Small Scale Racing Cars

TUe-logo-scarlet-L-1
Associated with TU/e
Feb 2023 - Jun 2023
Reinforcement Learning
Object Detection
Python
R.O.S

This research project delves into the creation of an autonomous agent for a 1:10 scale racing car, designed to efficiently navigate unknown racetracks for the best lap times. The agent uses YOLO for cone detection and a 2D cumulative map for environment representation. Two navigation methods were examined: the Pure Pursuit algorithm, modified to adapt to track curvature for improved speed and resilience, and the Deep Deterministic Policy Gradient (DDPG) method, which is explored theoretically. Experiments revealed that the real-time cone detection was efficient enough to replace hand-placed waypoints without performance loss. The enhanced Pure Pursuit achieved faster speeds, pushing the vehicle to its full potential. The study paves the way for future advancements in small-scale autonomous racing.

Depth-Aware Video Panoptic Segmentation & Monocular Depth Estimation

TUe-logo-scarlet-L-1
Associated with TU/e
Apr 2023 - Jun 2023
Deep Learning
Multi-task Neural Networks
Depth Estimation
Video Panoptic Segmentation
Python
High Performance Computing
In this study, we explore depth-aware video panoptic segmentation (DVPS) focusing on MonoDVPS, a state-of-the-art architecture. Specifically, in our quest of reproducing this architecture, we introduce MonoDVPSLite, a sub-network that excludes pose and instance tracking modules, focusing on depth estimation for single image frames. This paper’s primary contributions include investigating the impact of geometric reprojections on depth estimation by contrasting MonoDVPSLite and MonoDVPS, and implementing an enhanced Gaussian center heatmap for instance masks in MonoDVPSLite. Results highlight challenges in depth estimation and class imbalance issues in panoptic masks. This project was implemented in the context of the Advanced Sensing using Deep Learning course at TU/e.

Semantic Segmentation using a UNet-based Architecture

TUe-logo-scarlet-L-1
Associated with TU/e
Feb 2023 - May 2023
Deep Learning
Sematic Segmentation
Python
One of the most popular tasks in the computer vision discipline utilizing deep learning techniques is semantic segmentation. However, visual understanding of complex urban street scenes may be challenging regarding both correctly semantically classifying innate imbalanced datasets and supporting mechanisms for making robust neural network architectures against image quality degradation and generalization on inference time. In this paper, both of these challenges are addressed on the Cityscapes dataset by providing solutions in terms of data augmentation, external datasets, transfer learning, and penalization class weights techniques on a U-Net-based architecture. We evaluate these mechanisms, firstly, by implementing an off-the-shelf U-Net architecture serving as a baseline model, and secondly, by making slight adjustments to that architecture to create an ensemble model comprising various decoders where our proposed techniques are applied during the training phase. The results show that our solution achieves satisfactory performance when being evaluated on the Cityscapes dataset as well as on the internal challenge of the CodaLab competition server.
Scroll to Top