Real-time Virtual Object Insertion - UCLA Vision Lab

Recommend Documents

Los Angeles, CA 90095 and. Washington University, St. Louis [email protected], [email protected]. *. We present a system to insert virtual objects into real.

Ð²Ð± - UCLA Vision Lab

Dec 4, 2000 - We show that this is the case for the model we propose, which .... For instance, a walking person is free to move the upper part of the body. ..... Computer Vision and Pattern Recognition, volume 1, pages 326â332, 1999.

Actionable Information in Vision - UCLA Vision Lab

Actionable Information in Vision. Stefano Soatto. University of California, Los Angeles http://vision.ucla.edu; [email protected]. Abstract. I propose a notion of ...

Warping Background Subtraction - UCLA Vision Lab

Deborah Estrin. University of California, Los Angeles, CA 90095 ... not a viable proposition. Instead, we ...... In VSSN, pages 215â218, 2006. 2. [13] A. Mittal and ...

Learning Shape from Defocus - UCLA Vision Lab

Page 1 ..... rearrange Ii in a column vector of length L for each i = 1 ...T;. 2. ... size W Ã W and rearrange the resulting set of windows into a column vector I (of.

EE 396: Lecture 3 - UCLA Vision Lab

Feb 15, 2011 - (which we will see again in more detail when we study image registration, see [2]). â¢ The irradiance R,

EE 396: Lecture 3 - UCLA Vision Lab

Feb 15, 2011 - The irradiance R, that is, the light incident on the surface is directly recorded ... partials of u1 and

The Photogrammetric Record - UCLA Vision Lab

By Y. Ma, S. Soatto, J. Košecká and S. S. Sastry Springer-Verlag, New York,. USA , 2004. ISBN 0 387 00893 4. 162 × 242 mm. xx + 526 pages. 170 illustrations.

A Fast RANSAC - UCLA Vision Lab

Sep 25, 2007 - Localization in Unknown Environments using LIDAR Measurements. Daniele ..... One of the most common assumption for registration.

Background Subtraction on Distributions - UCLA Vision Lab

explicitly modeled the background as composed of dynamic textures [17]. Wallflower. [18] uses a Wiener filter to predict the expected pixel value based on the ...

Recognition of Human Gaits - UCLA Vision Lab

human gait in the space of dynamical systems where each gait is represented. Established techniques are employed to track a kinematic model of a human ...

Editable Dynamic Textures - UCLA Vision Lab

Editable Dynamic Textures. Gianfranco Doretto and Stefano Soatto. UCLA Computer Science Department. Los Angeles, CA 90095. {doretto, soatto}@ucla.edu.

Flexible Dictionaries for Action Classification - UCLA Vision Lab

modeled, thus creating a flexible dictionary of action primitives. We then .... It is important to note that our methods are not bound to the above rep- resentation ...

A Generative Model Based Approach to Motion ... - UCLA Vision Lab

A Generative Model Based Approach to Motion Segmentation*. Daniel Cremers1 and Alan Yuille2. 1 Department of Computer Science. University of California ...

Scene-Aware Video Modeling and Compression - UCLA Vision Lab

applications1 calls for new approaches to break the âvideo compression wall.â To do so, encoders .... Segmentation using a low threshold (center). Segmentation ...

Real-Time Classification of Dance Gestures from ... - UCLA Vision Lab

revolution in the videogame industry by embracing the human body as a ... experiences such as Harmonix' Dance Central [Har] where a player, vis-Ã -vis, ...

Using Bilinear Models for View-invariant Action ... - UCLA Vision Lab

UCLA Vision Lab, University of California at Los Angeles. Boelter Hall, 90025 Los Angeles, CA [email protected]. Abstract. Human identification from gait is ...

A Geometric Approach to Shape from Defocus - UCLA Vision Lab

In computer vision such a task is called shape-from-. X, where X ... E-mail: [email protected], [email protected]. â¢ S. Soatto .... addition, the apple is more blurred than the grapefruit since it is farther from the background than the grapefruit.

Simultaneous localization and mapping using ... - UCLA Vision Lab

Jason Meltzer. Dept. of Computer Science. University of .... invariant feature transform (SIFT) developed by Lowe [12] is invariant to image translation, scaling,.

Volumetric Reconstruction Applied to Perceptual ... - UCLA Vision Lab

specifically in the quantification of the size-weight illusion, whereby humans tend ..... and a lot of details appear to be missing â details which are present in our.

Nonrigid Registration Combining Global and Local ... - UCLA Vision Lab

tion, they are used to guide the local structure of the deformation field. Elsewhere, local .... where the parameter Î± â [0, 1] trades off local and global NMI. ..... Computing and Computer-Assisted Intervention, volume. 1679, pages 656â664, .

A Geometric Approach to Shape from Defocus - UCLA Vision Lab

Jan 14, 2005 - images that involves computing orthogonal operators which are regularized via ... Index TermsâShape from defocus, depth from defocus, blind ...

Motion Competition: A Variational Approach to ... - UCLA Vision Lab

eigenvalue problem for the motion parameters and in a gradient descent ... single energy functional by gradient descent. ...... The images on the right demon-.

EARTH MOVER DISTANCE ON SUPERPIXELS ... - UCLA Vision Lab

ABSTRACT. Earth Mover Distance (EMD) is a popular distance to com- pute distances between Probability Density Functions (PDFs). It has been successfully ...

Real-time Virtual Object Insertion - UCLA Vision Lab

Download PDF

4 downloads 13061 Views 539KB Size Report

Comment

University of California, Los Angeles. Computer Science. Los Angeles, CA 90095 and. Washington University, St. Louis [email protected], [email protected].

Real-time Virtual Object Insertion Paolo Favaro Washington University Electrical Engineering Campus Box 1127 St. Louis, MO 63130 [email protected]

Hailin Jin Washington University Electrical Engineering Campus Box 1127 St. Louis, MO 63130 [email protected]

We present a system to insert virtual objects into real image sequences in real time. The system consists of offthe-shelf hardware (a camera connected to a Pentium PC) and software to (a) automatically select and track region features despite changes in illumination, (b) estimate threedimensional position and orientation of surface patches relative to an inertial reference frame despite individual pointfeatures appearing and disappearing, (c) insert a texturemapped virtual object into the scene so as to make it appear to be part of the scene and moving with it (see Figures 1 and 2). This is all done in real time. Additional graphic

Stefano Soatto University of California, Los Angeles Computer Science Los Angeles, CA 90095 and Washington University, St. Louis [email protected], [email protected]

Figure 2.

Once the camera motion and the scene structure are estimated, we insert a “virtual” vase into the scene. As it can be observed, its relative position is fixed within the scene. Four rendered views corresponding to the snapshots in Figure 1 are shown.

Figure 1. An object is placed on a turntable, which is rotated in front of the camera. Four snapshots are shown. features, such as cast shadows, inter-reflections, occlusions, can be handled off-line. While virtual object insertion has been demonstrated before, and is even part of commercial video editing products nowadays [1], current systems require several batch steps, including feature tracking, outlier rejection, epipolar bundle adjustment, some of which involve human intervention. What our demo shows is that results of only slightly inferior quality can be obtained in a completely automatic fashion (no human intervention whatsoever), while processing the sequence causally (the pose of the virtual object at time is decided based on observations of the sequence up to time ). This enables a whole new

This research is supported in part by Intel grant 8029.

range of applications for real-time interaction with mixed reality (real as well as virtual objects), all on commercial off-the-shelf hardware while the camera is undergoing arbitrary motion, including hand-held. Our system uses a region tracking algorithm [3] based on a normalized affine warp that accounts for the motion of smooth planar patches and compensates for changes in illumination. Outliers, such as T-junctions, are automatically detected as violations of the region deformation model and rejected. An extended Kalman filter (analogous to [2]) interfaces with the lowlevel tracker in order to estimate aggregate rigid motion, while allowing individual point-features to appear and disappear as the low-level tracking model is violated. It runs in real time for 40 to 50 patches on a dual-processor 1GHz Pentium III computer. The multi-thread C++ code, which is readily interfaced with a frame grabber as well as Matlab for development, will be made available to the public at the demonstration.

References [1] http://www.2d3.com [2] A. Chiuso, P. Favaro, H. Jin and S. Soatto. MFm: 3-D Motion From 2-D Motion Causally Integrated Over Time. In ECCV, June 2000. [3] H. Jin, P. Favaro and S. Soatto. Real-time Feature Tracking and Outlier Rejection with Changes in Illumination. In ICCV, July 2001.