I'm Drowning in Images
-
I have a couple hundred thousand digital photos. I have a bunch of duplicate images from overlapping backups. I have inconsistent naming and folder schemes. I have edited and cropped versions of the same image. I need some software to help me clean and sort my images. I need one or many pieces of software that can:De-duplicate Images -- I would prefer something simple that shows two images with dimensions & metadata then asks me if they are the same.Add tags to the images -- Geo, face, and notes about each of the images. I need the tags to be written back into the images, so that in a few years other software can read the tags.Image Versioning -- I want to be able to associate a edited version or a crop to the original image.Web Browsable -- I want to be able to share all of my images online and let people view and sort based on tags, date, etc.I will pay hundreds of dollars for this software, but most of the free-ish software that I've seen doesn't do this (Picasa, iPhoto, Flickr, etc). On the other end of the spectrum there are corporate grade digital asset management tools that run in the $10,000+ range that I can't afford. I know this is a pretty specific feature set, and I want to make sure it doesn't exist before I look into writing it myself.
-
Answer:
These utilities need all the files on the same drive. I've used both, recommended: To get rid of exact duplicate files, even if the filename has been changed, free http://www.bigbangenterprises.de/en/doublekiller/. Do this first. This program is not particularly image oriented, it checks all files by a computed checksum. To de-duplicate images which may be different sizes/resolutions or resaved JPGs, in other words the same image in different formats, free http://www.visipics.info. This can help somewhat with versioning too. This one can be very slow, and for what it does that is understandable. Tagging and web albums will need something else.
gregr at Ask.Metafilter.Com Visit the source
Other answers
I suspect that identifying which images are resized versions of each other or similar images is a fairly hard problem. http://stackoverflow.com/questions/4196453/simple-and-fast-method-to-compare-images-for-similarity has some suggestions; https://www.memonic.com/user/aengus/folder/coding/id/1qVeq appears to be another approach. The http://en.wikipedia.org/wiki/Scale-invariant_feature_transform may be another approach - it's implemented in the http://fiji.sc/wiki/index.php/Fiji distribution of ImageJ, among other places and I've used it a bit for image registration.
pombe
Easiest part - sort by md5sum, print duplicates. For similar images, maybe you sort by some function of the color space?
gregglind
Doing an MD5 of the files should be pretty easy & catch a lot of the duplicates. For similar images I've looked into some kind of histogram comparison, but that looks like it might get a little involved.
gregr
I spent several days writing custom AppleScripts for accomplishing this with Aperture. It's true âThere is no commercial software that delivers decent results.
mmdei
If you are using a Mac, have a look at http://www.hyperbolicsoftware.com/DupeZap.html for finding duplicates.
conrad53
I've done duplicate image detection using Fourier crosscorrelation. You'd want to do it on a small subsample of the images (like 128x128). It's simple enough that you can implement it in Python with the Numeric library in a couple of hours. It would still take quite a while to compare hundreds of thousands of images though, since each pairwise comparison would require a matrix multiplication.
miyabo
(That method would be great for detecting cropped and resized images, but it wouldn't handle rotations at all.)
miyabo
Related Q & A:
- Why i cannot open any images in the yahoo pages?Best solution by Yahoo! Answers
- Why do I pee a little when I sneeze, even though I'm not pregnant?Best solution by answers.yahoo.com
- I'm looking for a job but I'm only 16, where should I apply?Best solution by snagajob.com
- I want to go into school swimming carnivals, but I think I'm too old.Best solution by Yahoo! Answers
- Should I take the ACT test and the SAT II Subject test in June if I haven't prepared yet and I'm a Junior?Best solution by Yahoo! Answers
Just Added Q & A:
- How many active mobile subscribers are there in China?Best solution by Quora
- How to find the right vacation?Best solution by bookit.com
- How To Make Your Own Primer?Best solution by thekrazycouponlady.com
- How do you get the domain & range?Best solution by ChaCha
- How do you open pop up blockers?Best solution by Yahoo! Answers
For every problem there is a solution! Proved by Solucija.
-
Got an issue and looking for advice?
-
Ask Solucija to search every corner of the Web for help.
-
Get workable solutions and helpful tips in a moment.
Just ask Solucija about an issue you face and immediately get a list of ready solutions, answers and tips from other Internet users. We always provide the most suitable and complete answer to your question at the top, along with a few good alternatives below.