Finding Duplicates

Geeqie provides a utility to find images that have similar attributes or content.

To display a new Find Duplicates Window select File ▸ Find duplicates .

Adding files to be compared

Add files to be compared using drag and drop. Drop files or folders onto the Find Duplicates window to add them to the list of files to compare. When one or more folders are dropped onto the window a menu will appear allowing you to choose the desired action:

Add contents
The contents of dropped folders will added to the window.
Add contents recursive
The contents of dropped folders and all sub folders will be added to the window.
Skip folders
Ignore folders contained in the drop list.

When files are added to the window, the comparison is restarted to include the new files.

Symbolic links to folders will be followed, but symbolic links to files will be ignored. Only actual files will be tested.

Comparison methods

The attribute to use for two images to match can be selected with the Compare by: drop down menu. Each method is explained below:

Name

The file name.

Name case-insensitive

The file name but ignoring case.

Size

The file size.

Date

The file date.

Dimensions

The image dimensions.

Checksum

The MD5 file checksum.

Path

The complete path to file.

Similarity (high)

Very similar image content. Preset to 95%.

Similarity

Similar image content. Preset to 90%.

Similarity (low)

Slightly similar image content. Preset to 85%.

Similarity (custom)

The percentage value to used to consider two images a match is configured in the spin box at the bottom of the window. Setting a value to less than 85 is not likely to be useful.

Name ≠ Content

Show images with the same name but different content.

Because pairs of files are being compared, when there are multiple hits care should be taken when analyzing the results.

Name case-insensitive ≠ Content

Show images with the same name, ignoring case, but different content.

Because pairs of files are being compared, when there are multiple hits care should be taken when analyzing the results.

Show all

Do not compare. Show all images.

Results list

Files that match with the selected comparison method will appear in the list. Matching files are grouped in alternating color.

The order of the result list can be changed by clicking on the column header. This will re-order the images within each set. When comparing by image content similarity, the matching sets will be sorted by order of rank starting with the files that are most similar.

A context menu is available for the result list by right clicking the mouse or pressing the Menu key when a row has the focus.

Groups in the results list may be selected by using the keyboard. Refer to the Find Duplicates Window section of Keyboard Shortcuts .

The selection can be changed using the keyboard and mouse the same as in a file pane of the main window.

The image Dimensions column of the result list will only contain dimension information when comparing by dimensions, or when the data is easily available from memory or has been read from the cache.

Status bar

Along the bottom of the Find Duplicates window is an area that displays the count of files contained in the window, and the number of files in the result list.

The status bar will also display the status of an active compare operation using the progress bar. A compare operation involves 2 or 3 stages, depending on the type of comparison. These are the stages in order:

  1. If necessary, extra data is read into memory for the comparison stage and the progress bar will indicate this stage with text such as “Reading dimensions...”, “Reading checksums...”, or “Reading similarity data...”.

  2. The images are compared using the selected method, the progress bar will indicate this stage with the text “Comparing...”.
  3. The results are sorted for display, the progress bar will indicate this stage with the text “Sorting...”.

    Stage 1 is only used for the Dimensions, Checksum, and Similarity compare methods.

    If the time to complete a stage will be significant, an estimated time to completion will also be displayed in the progress bar. The estimated time only refers to the current stage, other stages are not included in the estimate. The time estimate is displayed using the format MINUTES:SECONDS.

Thumbnails

Thumbnails can be displayed beside each image in the result list by enabling the Thumbnails check box.

Ignore Orientation

When selected and a similarity compare is being used, the images are checked against 4 rotations: 0°, 90°, 180°, 270°, plus flip and mirror.

Sort

The normal sort order is for groups (in the case of Similarity checks) with the highest number of near-100% matches to be at the top of the list.

If this box is checked, groups with the lowest number of matches are placed at the top of the list.

Compare two file sets

Sometimes it is useful to compare one group of files to another, different group of files. Enable this check box to compare two groups of files. When enabled, a second list will appear and files can be added to this list using the same methods for the main list.

When comparing two file sets the results list will display matches between the two lists. For each match group, the first file is always from the main group, and the remaining files are always from the second group.

Drag and Drop

Drag and drop can be initiated with the primary or middle mouse buttons. Dragging a file that is selected will include all selected files in the drag. Dragging a file that is not selected will first change the selection to the dragged file, and clear the previous selection.

Image Data Window

Ctrl+Shift+Right Mouse click : Use this to display a dialog containing the data stored for the clicked image file. This is usually only useful for debugging purposes.