HTML Cleaner
Clean HTML by removing extra whitespace, empty lines, and optionally tags to extract text.
Clean messy HTML in your browser. Paste markup that has extra spaces, blank lines, or unwanted comments and get a tidier result back. Optionally strip every tag to extract the plain text from a page or email template.
Cleaning options
- Remove HTML comments – strips
<!-- ... -->blocks (including multi-line) so the output stays focused on visible markup. - Collapse repeated spaces – squashes runs of spaces or tabs to a single space without touching newlines.
- Collapse blank lines – keeps at most one empty line between blocks instead of three or four.
- Trim trailing whitespace – removes spaces and tabs at the end of every line.
- Remove all blank lines – flattens the result to a continuous block of non-empty lines.
- Strip HTML tags – removes every element to leave plain text. HTML entities such as
and&are decoded automatically.
When to use HTML Cleaner
- Pasting copy from CMS exports or rich-text editors that smuggle stray whitespace.
- Pulling plain text out of email templates to feed search indexes or analytics.
- Tidying scraped HTML before diffing or storing it in source control.
- Quickly checking what an article would look like with tags removed for word-count or readability tools.
How it differs from a minifier
An HTML minifier removes every avoidable byte (including whitespace inside tags and around text) so machines load it faster. The cleaner keeps the document human-readable: indentation between blocks remains, but repeated whitespace and noise go away. Pick the cleaner when you still want to read the result, and a minifier when you want to ship the smallest payload.
Privacy
All processing happens in your browser. No HTML is uploaded or stored. Closing the tab discards every value.
Frequently Asked Questions
Does it remove inline styles and scripts?
Only when the “Strip HTML tags” option is enabled. With that option off, tags and attributes are kept untouched and only whitespace and comments are normalized.
Are HTML entities decoded?
Entities like , &, and numeric escapes are decoded only when “Strip HTML tags” is on so the text extract reads naturally. With tags kept, entities stay as written.
Will this break my <pre> blocks?
The cleaner collapses spaces and trailing whitespace globally, so significant whitespace inside <pre> or <textarea> may be affected. Disable “Collapse repeated spaces” when preserving code blocks matters.
Can I extract just the visible text?
Yes. Enable “Strip HTML tags” and the output becomes plain text. The download button switches to cleaned.txt so you get a sensible file extension.
Is large HTML supported?
The cleaner runs as a single synchronous regex pass per option. Documents up to a few megabytes process instantly in modern browsers; for very large inputs split them into sections.
Related tools
Your recent visits