Deep Learning Segmentation and 3D Reconstruction of Road Markings Using Multiview Aerial Imagery
<p>Appearance of road markings in aerial imagery (<b>a</b>–<b>c</b>) and corresponding parts of the digital surface model (DSM) (<b>d</b>–<b>f</b>).</p> "> Figure 2
<p>(<b>Top row</b>) processing steps performed in the image space; (<b>bottom row</b>) processing steps in the object space.</p> "> Figure 3
<p>Basic idea of line-based 3D refinement (<b>a</b>) before optimization and (<b>b</b>) after optimization and principle of a sliding window for the (<b>c</b>) first and (<b>d</b>) second node of a line.</p> "> Figure 4
<p>(<b>a</b>,<b>b</b>) Examples of segmented road markings in pink and center lines in yellow generated by the skeleton operator. (<b>c</b>) Approximation points in object space with first and last point of an iteration in yellow, as well as the target point in red. (<b>d</b>) Reprojected approximation points into image space with search space for the search of corresponding line points (blue box). (<b>e</b>) Selected points in red in one image.</p> "> Figure 5
<p>(<b>a</b>) DSM of a motorway surface. (<b>b</b>) Number of stereo pairs contributing to each DSM pixel. (<b>c</b>) Standard deviation of the height in meter for each DSM pixel.</p> "> Figure 6
<p>Results of the DL road marking segmentation (magenta) in two selected overlapping images (<b>a</b>,<b>d</b>,<b>g</b>,<b>j</b>) and (<b>b</b>,<b>e</b>,<b>h</b>,<b>k</b>) of an image sequence. (<b>c</b>,<b>f</b>,<b>i</b>,<b>l</b>) show the projected road markings in the object space with the number of contributing images (color coded). The darker, the more often the road marking was detected in the contributing images.</p> "> Figure 7
<p>Details of the DL road marking segmentation in overlapping image pairs. (<b>a</b>,<b>e</b>) shows detected road marking in a construction zone; (<b>b</b>,<b>f</b>) markings of parking spaces are partly occluded in one image by a vehicle, false positives on the right of the parking lot are visible; (<b>c</b>,<b>g</b>) the truck on a motorway occludes a road marking in one image; (<b>d</b>,<b>h</b>) false positives caused by vehicles are not detected in the other image.</p> "> Figure 8
<p>Results of the 3D refinement of road markings for four test sites. Left images (<b>a</b>,<b>c</b>,<b>e</b>,<b>g</b>) show the completeness of 3D refinement and right images (<b>b</b>,<b>d</b>,<b>f</b>,<b>h</b>) show a 3D view of the refined road marking points (blue dots) superimposed over the DSM. At green dots all requirements are fulfilled and the 3D refinement converges; the remaining road marking points are red.</p> "> Figure 9
<p>Details of the 3D refinement: Many points at parking spaces and complex urban crossings do not fulfill the requirements, resulting in low completeness (<b>a</b>,<b>b</b>), almost all road markings point converge in the 3D refinement even at double dashed lane markings or at occlusions from vehicles. (<b>c</b>) completeness on motorways.</p> "> Figure 10
<p>(<b>a</b>) 3D view of a planar road surface; refined 3D points are overlaid on the DSM. (<b>b</b>) Cross section of planar road surface, with six lanes showing refined 3D points in red and DSM surface in gray. (<b>c</b>) Root mean square (RMS) in height depending on the number of images used for the refinement.</p> ">
Abstract
:1. Introduction
2. Road Marking Properties
3. Methodology
3.1. DL (Deep Learning) Segmentation of Road Markings
3.2. Least-Squares Refinement of 3D Points
3.3. Generation of Approximations and Selection of Corresponding Line Points
4. Experimental Results and Evaluations
4.1. Input Data
4.2. DSM Generation
4.3. Results of the DL Road Marking Segmentation
4.4. Results of the 3D Refinement
5. Discussion
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
Abbreviations
CNN | Convolutional neural network |
DL | Deep learning |
DLR | German Aerospace Center |
DSM | Digital surface model |
GSD | Ground sampling distance |
IoU | Intersection over union |
RMS | Root mean square |
SGM | Semiglobal matching |
SRTM | Shuttle radar topography mission |
References
- Sheu, C.Y.; Kurz, F.; Angelo, P. Automatic 3D lane marking reconstruction using multi-view aerial imagery. ISPRS Ann. Photogramm. Remote. Sens. Spat. Inf. Sci. 2018, IV-1, 147–154. [Google Scholar] [CrossRef]
- Schmid, C.; Zisserman, A. Automatic Line Matching across Views. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern, San Juan, Puerto Rico, USA, 17–19 June 1997; Volume 1, pp. 666–671. [Google Scholar] [CrossRef]
- Bay, H.; Ferrari, V.; Gool, L.V. Wide-baseline stereo matching with line segments. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, 20–25 June 2005; Volume 1, pp. 329–336. [Google Scholar] [CrossRef]
- Wang, Z.; Wu, F.; Hu, Z. MSLD: A robust descriptor for line matching. Pattern Recognit. 2009, 42, 941–953. [Google Scholar] [CrossRef]
- Azimi, S.M.; Fischer, P.; Körner, M.; Reinartz, P. Aerial LaneNet: Lane Marking Semantic Segmentation in Aerial Imagery using Wavelet-Enhanced Cost-sensitive Symmetric Fully Convolutional Neural Networks. arXiv, 2018; arXiv:1803.06904. [Google Scholar] [CrossRef]
- Kurz, F.; Türmer, S.; Meynberg, O.; Rosenbaum, D.; Runge, H.; Reinartz, P.; Leitloff, J. Low-cost Systems for real-time Mapping Applications. Photogramm. Fernerkund. Geoinf. 2012, 159–176. [Google Scholar] [CrossRef]
- Taylor, C.; Kriegman, D. Structure and motion from line segments in multiple images. IEEE Trans. Pattern Anal. Mach. Intell. 1995, 17, 1021–1032. [Google Scholar] [CrossRef] [Green Version]
- Kurz, F.; Rosenbaum, D.; Meynberg, O.; Mattyus, G.; Reinartz, P. Performance of a real-time sensor and processing system on a helicopter. ISPRS Int. Arch. Photogramm. Remote. Sens. Spat. Inf. Sci. 2014, XL-1, 189–193. [Google Scholar] [CrossRef]
- Fischer, P.; Plaß, B.; Kurz, F.; Krauss, T.; Runge, H. Validation of HD maps for autonomous driving. In Proceedings of the International Conference on Intelligent Transport Systems in Theory and Practice, Munich, Germany, 4–6 July 2017. [Google Scholar]
- d’Angelo, P.; Reinartz, P. Semiglobal Matching Results on the ISPRS Stereo Matching Benchmark. ISPRS Hann. Workshop 2011, XXXVIII-4/W19, 1–6. [Google Scholar] [CrossRef]
- Scharstein, D.; Hirschmüller, H.; Kitajima, Y.; Krathwohl, G.; Nešić, N.; Wang, X.; Westling, P. High-Resolution Stereo Datasets with Subpixel-Accurate Ground Truth. In German Conference on Pattern Recognition; Springer: Cham, Switzerland, 2014; Volume 8753, pp. 31–42. [Google Scholar] [CrossRef]
Motorways | Rural Roads | Other Roads | ||
---|---|---|---|---|
continuous line (narrow width) | [m] | [m] | [m] | |
continuous line (broad width) | [m] | [m] | [m] | |
dashed line (length/gap) | / [m] | / [m] | / [m] | |
double line (distance) | [m] | [m] | [m] |
(b) | ||
Canon EOS 1D-X | ||
Lenses Sensor/Pixel size Image size | Zeiss M. Planar f/2.0 50 mm Full frame CMOS/6.944 μm 5184 × 3456 pixel, ratio 3:2 (17.9 MPix) | |
(c) | ||
Flight Configuration | ||
Oblique angle FOV Coverage @500m Flight height GSD @500-700m Across overlap Along overlap | ±15° ±34° across strip, ±13° along strip 780 m × 230 m 500–700 m 7–9 cm (nadir) 50% 75% |
Test Site | Seg. [IoU] | Seg. Multiview [IoU] | 3D Refinement [% of pts] |
---|---|---|---|
A (urban roads) | |||
B (motorways) | |||
C (rural roads) | |||
D (parking lots) |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kurz, F.; Azimi, S.M.; Sheu, C.-Y.; d’Angelo, P. Deep Learning Segmentation and 3D Reconstruction of Road Markings Using Multiview Aerial Imagery. ISPRS Int. J. Geo-Inf. 2019, 8, 47. https://doi.org/10.3390/ijgi8010047
Kurz F, Azimi SM, Sheu C-Y, d’Angelo P. Deep Learning Segmentation and 3D Reconstruction of Road Markings Using Multiview Aerial Imagery. ISPRS International Journal of Geo-Information. 2019; 8(1):47. https://doi.org/10.3390/ijgi8010047
Chicago/Turabian StyleKurz, Franz, Seyed Majid Azimi, Chun-Yu Sheu, and Pablo d’Angelo. 2019. "Deep Learning Segmentation and 3D Reconstruction of Road Markings Using Multiview Aerial Imagery" ISPRS International Journal of Geo-Information 8, no. 1: 47. https://doi.org/10.3390/ijgi8010047
APA StyleKurz, F., Azimi, S. M., Sheu, C. -Y., & d’Angelo, P. (2019). Deep Learning Segmentation and 3D Reconstruction of Road Markings Using Multiview Aerial Imagery. ISPRS International Journal of Geo-Information, 8(1), 47. https://doi.org/10.3390/ijgi8010047