Pavement Distress Detection From Orthophotos With Two-Stream Convolutional Neural Networks

Student Roland Lõuk
Supervisor Aleksei Tepljakov
Keywords deep learning, computer vision, pavement distress
Degree MSc
Thesis language English
Defense date January 22, 2020
Talk slides Download Presentation Slides
Document link Download Thesis Document

Abstract

Automated pavement distress detection is an important but challenging task towards the goal of timely road maintenance. Given the vastness of road networks across the world, there is a lot of labor involved in manual defect detection for roads. In the recent years, however, convolutional neural networks have been shown to achieve groundbreaking results in the field of image classification. This thesis seeks to research and develop methods for applying convolutional neural networks to pavement distress detection for sections of orthophotos (orthoframes) with a large resolution. To address GPU memory limitations and increase detection localization, a sliding-window approach is used to partition the orthoframe into 224x224-pixel segments, which are subject to binary classification. However, the sliding-window approach does not allow for the model to account for the context surrounding the segment and results may suffer due to the small window size. This thesis proposes a ResNet architecture based convolutional neural network which accounts for two inputs streams, one of which is the 224x224-pixel content segment, which is subject to classification, and the other is the downscaled context view around the content segment. Experiments on two different datasets show an increased classification accuracy for the two-stream approach compared to the single stream approach.

Related publications

Share this