Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity
-
Updated
Jan 26, 2025 - TypeScript
Extract and decompose (fuzzy) URLs (including emails, which are conceptually a part of URLs) in texts with Area-Pattern-based modularity
Turn any messy URL into a clean canonical https:// string — or null if invalid. Handles bare domains, www/port stripping, query sorting, punycode, domain extraction, and display-friendly humanization. 980B brotli. TypeScript. ESM + CJS.
URL normalizer to canonicalize (standardize) the text representation of a URL to determine if differently-formatted URLs are identical
🔗 Pathor is a PHP library for normalizing, analyzing, and comparing URLs.
Add a description, image, and links to the url-normalizer topic page so that developers can more easily learn about it.
To associate your repository with the url-normalizer topic, visit your repo's landing page and select "manage topics."