10.14489/vkit.2019.02.pp.023-029

DOI: 10.14489/vkit.2019.02.pp.023-029

Горбацевич В. С., Князь В. А., Мельниченко М. А., Степаньянц Д. Г.
МЕТОДЫ ГЛУБОКОГО ОБУЧЕНИЯ ДЛЯ АВТОМАТИЧЕСКОГО ПОСТРОЕНИЯ ТРЕХМЕРНЫХ МОДЕЛЕЙ ПО АЭРОФОТОСНИМКАМ
(с. 23-29)

Аннотация. Рассмотрены вопросы применимости методов глубокого обучения в задачах построения трехмерных моделей по данным аэрофотосъемки. Представлены алгоритмы решения задачи и результаты их работы. Показаны преимущества и недостатки в сравнении с существующими подходами. Предложены способы фильтрации случайных выбросов и сглаживания модели на основе алгоритма FastDTW (Dynamic Time Warping), а также алгоритмы субпиксельного измерения координат соответствующих точек снимка. Приведено сравнение 3D-моделей, построенных при помощи нейронной сети и корреляционного алгоритма, с моделью, полученной при помощи 3D-сканера.

Ключевые слова: обучение сверточной нейронной сети; алгоритм FastDTW; стереоотождествление; 3D-модель местности.

Gorbatsevich V. S., Knyaz V. A., Melnichenko M. A., Stepanyants D. G.
DEEP LEARNING METHODS FOR AUTOMATIC BUILDING THREE-DIMENSIONAL MODELS FOR AERIAL IMAGES
(рр. 23-29)

Abstract. The questions of the applicability of deep learning methods for the problems of constructing three-dimensional models according to aerial images are considered. A convolutional neural network is considered that forms for each point of the image a feature vector characterizing this point. The neural network consist of seven convolutional layers and forms a feature vector with a size of 100 elements. For the analysis of the network working and its training were used images of terrain model. Training was conducted on single images and artificially generated textures. For filtering random outliers and smoothing the model, a method based on the FastDTW (Dynamic Time Warping) algorithm has been proposed, in which sequences of feature vectors of rows of left and right stereo pairs are matched. An algorithm for subpixel measurement of the coordinates of the corresponding points of the image using the quadratic approximation of the dependence of the correspondence coefficient on the coordinate of the point is given. For the FastDTW algorithm, the subpixel coordinate measurement can be implemented by stretching the original sequences of the feature vectors before applying the FastDTW-algorithm and then compressing this. A comparison of 3D-models built using a neural network with a model obtained using a 3D-scanner is presented. The best results were achieved using a convolutional network with a subpixel version of the FastDTW-algorithm.

Keywords: Convolutional neural network training; FastDTW-algorithm; Stereo matching; 3D-model terrain.

+ - Информация об авторах (About the Authors) Click to collapse

Рус

В. С. Горбацевич, В. А. Князь, М. А. Мельниченко, Д. Г. Степаньянц (ФГУП «Государственный научно-исследовательский институт авиационных систем» ГНЦ РФ, Москва, Россия) E-mail: Этот e-mail адрес защищен от спам-ботов, для его просмотра у Вас должен быть включен Javascript

Eng

V. S. Gorbatsevich, V. A. Knyaz, M. A. Melnichenko, D. G. Stepanyants (State Research Institute of Aviation Systems State Scientific Center of Russian Federation, Moscow, Russia) E-mail: Этот e-mail адрес защищен от спам-ботов, для его просмотра у Вас должен быть включен Javascript

+ - Библиографический список (References) Click to collapse

Рус

1. Luo W., Schwing A. G., Urtasun R. Efficient Deep Learning for Stereo Matching // IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016. P. 5695 – 5703. doi: ieeecomputersociety.org/ 10.1109/CVPR.2016.614
2. End-to-End Training of Hybrid CNN-CRF Models for Stereo / P. Knobelreiter et al. arXiv:1611.10229v2[cs.CV] 03.05.2017. 16 р. URL: https://arxiv.org/pdf/1611.10229.pdf (дата обращения: 20.12.2018).
3. Salvador S., Chan P. FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space // Proc. of KDD Workshop on Mining Temporal and Sequential Data. 2004. P. 70 – 80.

Eng

1. Luo W., Schwing A. G., Urtasun R. (2016). Effcient Deep Learning for Stereo Matching. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5695-5703. doi: ieeecomputersociety.org/ 10.1109/CVPR.2016.614
2. Knobelreiter P. et al. (2017). End-to-End Training of Hybrid CNN-CRF Models for Stereo. Knobelreiter arXiv:1611.10229v2[cs.CV] Available at: https://arxiv. org/pdf/1611.10229.pdf (Accessed: 20.12.2018).
3. Salvador S., Chan P. (2004). FastDTW: Toward Accurate Dynamic Time Warping in Linear Time and Space. Proceedings of KDD Workshop on Mining Temporal and Sequential Data, pp. 70-80.

+ - Заказать электронную версию статьи (Purchase digital version of a single article) Click to collapse

Рус

Статью можно приобрести в электронном виде (PDF формат).

Стоимость статьи 350 руб. (в том числе НДС 18%). После оформления заказа, в течение нескольких дней, на указанный вами e-mail придут счет и квитанция для оплаты в банке.

После поступления денег на счет издательства, вам будет выслан электронный вариант статьи.

Для заказа скопируйте doi статьи:

10.14489/vkit.2019.02.pp.023-029

и заполните форму

Отправляя форму вы даете согласие на обработку персональных данных.

Eng

This article is available in electronic format (PDF).

The cost of a single article is 350 rubles. (including VAT 18%). After you place an order within a few days, you will receive following documents to your specified e-mail: account on payment and receipt to pay in the bank.

After depositing your payment on our bank account we send you file of the article by e-mail.

To order articles please copy the article doi:

10.14489/vkit.2019.02.pp.023-029

and fill out the form