Charcuterie: A Visual Unicode Similarity Explorer for Finding Confusable Characters

Available in: 中文
2026-04-09T23:12:19.899Z·2 min read
Charcuterie is an interactive web tool that helps users explore visual similarity between Unicode characters — finding lookalikes that could be used for spoofing, homograph attacks, or creative typ...

Charcuterie: Visual Explorer for Unicode Character Similarity and Confusability

Charcuterie is an interactive web tool that helps users explore visual similarity between Unicode characters — finding lookalikes that could be used for spoofing, homograph attacks, or creative typography. The project has gained 70 points on Hacker News with 10 comments.

The Problem It Solves

Unicode contains over 149,000 characters, many of which look identical or nearly identical to each other:

These confusable characters are exploited in:

How Charcuterie Works

The tool provides:

Real-World Applications

  1. Security: Identify potential homograph attacks in domains and URLs
  2. Font design: Understand which characters need distinct glyphs
  3. Localization: Discover encoding issues caused by confusable characters
  4. Typography: Explore the richness of the Unicode character set
  5. Data cleaning: Find and fix character substitution errors in datasets

The Name

Charcuterie is a French term for cooked meats — a playful reference to the tool slicing and examining the character set.

Technical Background

Unicode confusability is formally defined in the Unicode Security Mechanism specification (UAX #39), which provides algorithms for identifying confusable characters. Charcuterie makes this data accessible through a visual interface rather than technical documentation.

Source: elastiq.ch / HN — 70 points, 10 comments

↗ Original source · 2026-04-09T10:00:00.000Z
← Previous: Reverse Engineering Gemini SynthID: How Google Watermarks AI-Generated TextNext: Microsoft Is Using Dark Patterns to Pressure Users Into Paying for Cloud Storage →
Comments0