ciid - Chronological Image Identifier

ciid is a utility to derive a chronologically sortable, unique identifier for images.

Usually, digital cameras and phones assign file names to images with a sequence of only 4 digits (e.g. IMG_1234.dng). Those names will easily clash for any sufficiently large amount of images.

ciid tackles this problem by deriving a hash from the image buffer. Additionally to being able to derive an identifier that is very unlikely to clash, this hash can later be used to check the integrity of the image content.

Some image processing programs update metadata of files (e.g inline JPEG- previews, tags, modified date). The resulting ciid will be unaffected from those changes, since only the actual image buffer is hashed. This has the nice side-effect that proprietary camera RAW file formats and converted .dng files will yield the same identifier most of the time.

Here's how a resulting identifier looks like:

0A1B2C3D4E-5F6G7H8J9K0M1N2P3Q4R5S6T7V8W9X0Y1Z2A3B4C5D6E7F8G9H00
└───┬────┘ └────────────────────────┬─────────────────────────┘
 timestamp                hash of image buffer

The first part of the identifier encodes the creation date of the image (a 50-bit timestamp with millisecond precision), while the second part is a hash (SHA-256) based on the contents of the image buffer.

The encoding uses Douglas Crackford's base32 alphabet with the following characters:

0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, F, G, H, J, K, M, N, P, Q, R, S, T, V, W, X, Y, Z.

Following criteria were considered when choosing the character set:

encode information sufficiently compact (in this case 5 bits per character)
have sensible alphabetical ordering on file systems (timestamps with a higher value should appear strictly after lower ones)
no distinction between upper- und lowercase, avoiding issues on case insensitive file systems
be safe to use in URLs

Installation (via script)

Download and run the installation script:

$ curl -s https://raw.githubusercontent.com/pablosichert/ciid/master/bin/install.sh | bash

Installation (manually)

Prerequisites

For help with installing the dependencies, have a look at the install script.

Install the ciid binary onto your system via cargo:

$ cargo install ciid

Usage

$ ciid [FLAGS] <file path>...

Flags

Short	Long	Description
-h	--help	Prints help information
	--rename-file	Renames the file to the derived identifier. Preserves the file extension
-V	--version	Prints version information
	--verify-name	Verifies if the provided file name is equal to the derived identifier

Options

Short	Long	Description
	--print <template>	Prints provided template to stdout, substituting variables with file information. Available variables: ${file_path}, ${identifier}, ${date_time}, ${timestamp}

Arguments

Name	Description
<file path>...	Path to image file

FAQ

Why not use a more human-readable format for the timestamp?

Why do we encode the timestamp as 0A1B2C3D4E instead of e.g. 2319-11-21 14:22:59.726? The timestamp represents an unambiguous¹ single point in time, whereas the date string needs to be contextualized with a time zone. That means that you would either need to annotate the date string with a time zone or change the file name every time you are on a system which uses a different time zone.

Apart from that, the former encoding is significantly more compact.

While unfortunately it's not easy to derive the actual date from the encoded timestamp just by looking at it, you can compare two encoded timestamps chronologically by sorting them alphabetically.

¹ ignoring leap-seconds.

Prior Art

The timestamp used in ciid was inspired by ulid.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
bin		bin
src		src
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
build.rs		build.rs
rust-toolchain		rust-toolchain

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ciid - Chronological Image Identifier

Installation (via script)

Installation (manually)

Prerequisites

Usage

Flags

Options

Arguments

FAQ

Why not use a more human-readable format for the timestamp?

Prior Art

About

Releases

Packages

Languages

License

pablosichert/ciid

Folders and files

Latest commit

History

Repository files navigation

ciid - Chronological Image Identifier

Installation (via script)

Installation (manually)

Prerequisites

Usage

Flags

Options

Arguments

FAQ

Why not use a more human-readable format for the timestamp?

Prior Art

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages