Computer Vision: What are state of the art algorithms for efficient image retrieval under light distortion?
-
details: the target and the source image are the same image - subjected to quasi-affine transform (scale/rotate/light/crop) as an example consider matching the images resulting from capturing exactly the same scene taken by two photographers (different cameras, angles, directions, zoom, position). interesting algorithms perform efficiently with large databases.
-
Answer:
What you're describing fits the description of "near duplicate mage retrieval," which is a relatively mature sub-area of computer vision. A good approach to use for this is the Bigimbaz method of Douze et al. at INRIA [1]. You can read more about it in the IJCV article by the same authors [2]. The method is based on bag-of-features, binary signatures and Hamming distance. [1] http://bigimbaz.inrialpes.fr/ [2] http://lear.inrialpes.fr/pubs/2010/JDS10a/
Serge Belongie at Quora Visit the source
Other answers
In general, it is dependent on the type of object for which you are trying to detect. It is important to consider the object class, because if it is very general (bird, chair, etc.), then distortions may be the least of your worries. In this case, you will require many different views of many different examples (a classic example is the large variation in types of chairs). General frameworks are certainly improving, and are showing promise (such as Felzenszwalb et al.'s deformable parts model http://people.cs.uchicago.edu/~pff/latent/). SIFT (http://en.wikipedia.org/wiki/Scale-invariant_feature_transform) is still one of the best-in-class general purpose descriptors. If your problem is less abstract, and more in the lines of a specific product, template matching may work for you. Book covers are an example of this. Of course, this assumes there is only very slight distortion. Example, with code: http://nashruddin.com/template-matching-in-opencv-with-example.html Finally, if it is more shape-based (handwritten characters, etc.), shape contexts are a great solution. It requires low clutter, but scale, rotation and lighting are rarely a significant issue. Belongie et al. on Shape Contexts: http://www.eecs.berkeley.edu/Research/Projects/CS/vision/shape/belongie-pami02.pdf Note that there have been many variations and improvements for all of these algorithms, so it's worth seeing the new algorithms folks have built on top of these key works.
Dan Preston
The problem you described can be solved using image search algorithms based on local features. This class of algorithms can handle huge image databases and provide accurate results. Please check the following source for more detail: An introduction to the image search algorithms based on local features is available at my blog: http://blog.gregorybegelman.com
Gregory Begelman
Related Q & A:
- What do you think about Art?Best solution by Yahoo! Answers
- What is post modernism in art exactly?Best solution by Yahoo! Answers
- What are some careers in art?Best solution by Yahoo! Answers
- What does Kitsch mean in Art?Best solution by Yahoo! Answers
- What previous movements influenced the art of the Renaissance, and how?Best solution by Yahoo! Answers
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.