idea: take 2 or more images (bursts of images) and combine them
$\begin{bmatrix}wx'\\wy'\\w\end{bmatrix} = \begin{bmatrix}* & * & *\\* & * & *\\* & * & *\end{bmatrix} \begin{bmatrix}x\\y\\1\end{bmatrix}$
$p' = Hp$
$\begin{bmatrix}wx'\\wy'\\w\end{bmatrix} = \begin{bmatrix}a & b & c\\d & e & f\\g & h & i\end{bmatrix} \begin{bmatrix}x\\y\\1\end{bmatrix}$
9 unknowns and $w'$
$w'$ is easy: $w' = gy + hx + i$
Set up a system of linear equations:
$Ah = b$
where vector of unknowns $h = [a,b,c,d,e,f,g,h]^T$
Need at least 8 eqs
Solve for h. SVD for Eigen Value = 0
$min\|Ah-b\|^2$
Backward Mapping eliminate holes
Needs a invertible wrap function: Not always possible
Using Internet Billions of Images to...
A.I. for the postmodern world:
all questions have already been answered…many times, in many ways
Google is dumb, the “intelligence” is in the data
Text is simple:
clean, segmented, compact, 1D
Visual data is much harder:
Noisy, unsegmented, high entropy, 2D/3D
Texture synthesis
inpainting
looking for semantic information
Compute oriented edge response at multiple scales (5 spatial scales, 6 orientations)
Gist scene descriptor (Oliva and Torralba 2001)
Color descriptor – color of the query image downsampled to 4x4
Find 200 closest neighbors in database
Graph-cut
10 nearest neighbors, 20.000 image database
10 nearest neighbors, 2.3M image database
label an image as containing a person or not
colorize gray scale images
more later...
What can you say about where these photos were taken?
6.5 million images with both GPS coordinates and geographic keywords, removing images with keywords like birthday, concert, abstract, …
Test set – 400 randomly sampled images from this collection. Manually removed abstract photos and photos with recognizable people – 237 test photos.
across database size
across features
Where was the Painter Standing?
people follow photographic conventions
These slides have been prepared with materials, slides, and discussions from the authors of the papers.