Augmented Reality Anywhere and Anytime |
|
Projects
Technology
Software Libraries
More
The Handheld Augmented Reality
Come to ISMAR 2011
|
Full 6-DOF Localization FrameworkProject DescriptionOur localization framework is based on sparse 3D point clouds and our latest achievements in natural feature tracking. The environment is captured using a high quality DSLR camera, and sparse 3D point clouds are created through a 3D reconstruction pipeline. The resulting reconstructions are investigated with respect to visibility conditions from realistic view points, and the datasets are organized in more compact blocks of data, which can be efficiently handled. Additionally, an efficient search structure for the individual feature blocks is built to allow for fast and efficient indexing. The entire dataset is stored on a remote server which is connected to a spatial database.
3D sparse reconstruction of a single room.
3D sparse reconstruction of the city center of Graz, Austria.
Given an approximate position of a mobile phone (by GPS coordinates or WIFI triangulation), the server sends the necessary data content to the client on request. The mobile phone application is provided with the necessary data for instant calculation of a full 6DOF pose given the actual input image from the mobile phone camera. Localization from ordinary images (ISMAR 2009)The features from the 2D input image are matched against the dataset of fully globally registered 3D points, and a robust 3-point pose algorithm is used to robustly find the correct pose.
Workflow during the offline and the online stage of our framework. In the offline stage sparse feature
Due to the limited field of view of the mobile phone camera, attaching an additional wide angle lens to the camera is advisable. Especially in indoor environments a large field of view is essential for a good performance of the approach.
Color-coded path through three reconstructed rooms in our department, starting in the right room (red)
Localization from panoramic images (ISMAR 2011)To overcome the narrow field of view of standard cameras, in earlier work we used a wide angle lens attached to the mobile phone. In this work we use a special algorithm for generating panoramic images in real-time (see Figure below). The details about the mapping and tracking algorithm can be found here.
A typical panoramic image shot from a mobile phone without any exposure change compensation.
While the panoramic image is created, the features are matched against the database in the background, followed by a robust pose estimation step. The flowchart of the system is depicted in the Figure below. A part of the reconstruction is retrieved from the server given an approximate GPS position. The feature extraction, feature matching and pose estimation steps are integrated as background tasks while new pixels are mapped into the panoramic image from the live video feed. As soon as the localization succeeds, a full 6-DOF pose is obtained.
Flowchart of the localization system.
The result of the localization procedure is shown in the Figure below. The inliers are spread around the entire 360-degree panorama. The rays from the camera center directly hit the corresponding 3D points in the environment, passing through their projection in the image (i.e. the textured cylinder).
Localization result for a full 360-degree panoramic image. The yellow lines indicate the inliers used for establishing the pose estimate.
A path through the city center of Graz, Austria is shown in the Figure below. Around 230 full panoramic images were acquired using an omnidirectional PointGrey Ladybug camera. A full noise-free 360-degree panorama is the ideal case for our localization procedure obviously. The localization also works with distorted and noisy images, however. We conclude that an increase of the field of view directly infers an increasing success rate and also an improved accuracy of the localization results.
Color-coded path through the city center of Graz, Austria. The path starts in the lower left area of the image and ends in the upper left area. Since parts of the reconstruction from the square in the right area is missing, the localization fails in this area. As soon as enough environmental area is available as reconstruction again, the localization approach succeeds again.
A video describing our approach in short can be found here. Current Project StatusThe current systems already contains all parts for reconstruction, registration and alignment. The algorithms can be used to create reconstructions for indoor or outdoor scenarios. At the moment we consider the generation of multiple reconstructions using an omnidirectional camera to allow for different investigations, mainly concerning feature stability under different weather and lighting conditions. Project Team Clemens Arth Alumni
PublicationsReal-Time Self-Localization from Panoramic Images on Mobile Devices Authors: Clemens Arth, Manfred Klopschitz, Gerhard Reitmayr, Dieter Schmalstieg Details: To Appear in: International Symposium on Mixed and Augmented Reality (ISMAR), 26-29 Oct. 2011 Self-localization in large environments is a vital task for accurately registered information visualization in outdoor Augmented Reality (AR) applications. In this work, we present a system for selflocalization on mobile phones using a GPS prior and an onlinegenerated panoramic view of the user’s environment. The approach is suitable for executing entirely on current generation mobile devices, such as smartphones. Parallel execution of online incremental panorama generation and accurate 6DOF pose estimation using 3D point reconstructions allows for real-time self-localization and registration in large-scale environments. The power of our approach is demonstrated in several experimental evaluations.
Towards Wide Area Localization on Mobile Phones Authors: Clemens Arth, Daniel Wagner, Manfred Klopschitz, Arnold Irschara, Dieter Schmalstieg Details: International Symposium on Mixed and Augmented Reality (ISMAR), 19-23 Oct. 2009 We present a fast and memory effcient method for localizing a mobile user’s 6DOF pose from a single camera image. Our approach registers a view with respect to a sparse 3D point reconstruction. The 3D point dataset is partioned into pieces based on visibility constraints and occlusion culling, making it scalable and efficient to handle. Starting with a coarse guess, our system only considers features that can be seen from the user’s position. Our method is resource efficient, usually requiring only a few megabytes of memory, thereby making it feasible to run on low-end devices such as mobile phones. At the same time it is fast enough to give instant results on this device class.
website maintained by
Tobias Langlotz |
copyright (c) 2011 Graz University of Technology |