my blog post about a different wordlist deduplication approach
-
Here, I try to put some context around why a continuous "mash up everything into one file" monolithic wordlist is understandable, but can be significantly improved for some use cases by thinking about the problem differently. Not news to many of you, but wanted to provide a reference for newcomers.
Mastodon post:
https://infosec.exchange/@tychotithonus/114361777520791358
Direct link:
https://blog.techsolvency.com/2025/04/managing-unique-wordlists-password-cracking.html
Corrections to form or content welcome! The blog is on the ancient Blogger, which has some pretty ugly WYSIWYG markup ugliness behaviors that make Word's back-end markup look like a cakewalk. So some of the formatting irregularity is harder to fix than it might appear.