Question 1

When is MD5 acceptable to use?

Accepted Answer

Cache deduplication keys, ETag generation, legacy checksum verification, content-addressed storage where the producer is trusted, and forensic comparison against historical evidence. In all these contexts, an attacker cannot influence the inputs to produce a collision.

Question 2

When should I never use MD5?

Accepted Answer

Password storage, digital signatures, certificate fingerprints, code signing, JWT signing, HMAC keys, anywhere an attacker controls either input. The 2008 chosen-prefix attack lets attackers create two different inputs with the same MD5 in seconds — devastating for any security claim that depends on MD5 being a cryptographic hash.

Question 3

What is the output length?

Accepted Answer

16 bytes — 32 hex characters or 24 Base64 characters (with padding) / 22 Base64URL characters (without padding). Half the size of SHA-256.

Question 4

What is the maximum input size?

Accepted Answer

5 MiB for files in this browser tool. The MD5 algorithm itself has no upper bound; for larger inputs use `md5sum` on Linux, `md5` on macOS, `Get-FileHash -Algorithm MD5` on Windows.

Question 5

What does "collision resistance broken" mean?

Accepted Answer

It means an attacker can construct two different inputs that produce the same MD5 hash. The 2004 Wang attack and the 2008 chosen-prefix attack made this routine. Any security property that depends on "if hashes match, the inputs are equal" is invalid for MD5 — but properties that depend on "the same input produces the same hash" still hold.

Question 6

MD5 vs CRC32 for cache keys?

Accepted Answer

MD5 has 128 bits of output; CRC32 has 32 bits. CRC32 is faster but collides much more often: with ~65,000 entries you have a 50% chance of any collision. MD5 is fine for billions of keys without collision concerns. CRC32 is appropriate for small caches and short-lived deduplication; MD5 for anything larger.

Question 7

How do I replace MD5 in legacy code?

Accepted Answer

For security-relevant code: replace with SHA-256 (or migrate passwords to bcrypt / argon2). For non-adversarial code (cache keys, ETags): leave it alone — switching to SHA-256 just makes things slower without improving the property you actually need. The right replacement depends on what the MD5 was guarding.

Question 8

Where is MD5 specified?

Accepted Answer

RFC 1321 (1992, Ronald Rivest). The Wang collision attack was published at EUROCRYPT 2005 and the chosen-prefix attack by Stevens et al. at CRYPTO 2009. NIST formally deprecated MD5 for security applications in 2010.

MD5 Hash Generator

About the MD5 Generator

Related Tools

How to use the MD5 generator

Common use cases (non-adversarial only)

Privacy and security

Tips and pitfalls

Frequently Asked Questions