H.264 encoder based 3D model acquisition and compression for aerial video

Alexander Samochin, Evgeny Kaminsky, Hugo Guterman

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, an automatic 3D model extraction from aerial images is proposed. The system is based on the H.264 video encoder that is used for 3D information extraction and compression. In order to obtain a disparity map from the sequence of aerial images, motion vectors received from the H.264 encoder motion estimation algorithm are used. The main drawback using motion estimation for disparity generation is computational noise. A new statistical approach is proposed to estimate the disparity of building surfaces from a noisy motion estimation map. A method for disparity compression for further transmission is also suggested. The proposed algorithm can be integrated in the H.264 encoder for fully automatic 3D urban scene extraction and compression. Tests are performed on real video sequences taken from several sites from various airborne camera heights.

Original languageEnglish
Title of host publication2010 IEEE 26th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2010
Pages637-640
Number of pages4
DOIs
StatePublished - 1 Dec 2010
Event2010 IEEE 26th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2010 - Eilat, Israel
Duration: 17 Nov 201020 Nov 2010

Publication series

Name2010 IEEE 26th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2010

Conference

Conference2010 IEEE 26th Convention of Electrical and Electronics Engineers in Israel, IEEEI 2010
Country/TerritoryIsrael
CityEilat
Period17/11/1020/11/10

Keywords

  • 3D reconstruction
  • Aerial 3D modeling
  • Aerial images
  • Disparity compression
  • H.264 encoding
  • Urban scene compression
  • Urban scene extraction

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'H.264 encoder based 3D model acquisition and compression for aerial video'. Together they form a unique fingerprint.

Cite this