public class Cleaner extends Object
The HTML cleaner parses the input as HTML and then runs it through a safe-list, so the output HTML can only contain HTML that is allowed by the safelist.
It is assumed that the input HTML is a body fragment; the clean methods only pull from the source's body, and the canned safe-lists only allow body contained tags.
Rather than interacting directly with a Cleaner object, generally see the clean methods in Jsoup.
| Constructor and Description |
|---|
Cleaner(Safelist safelist)
Create a new cleaner, that sanitizes documents using the supplied safelist.
|
Cleaner(Whitelist whitelist)
Deprecated.
as of 1.14.1.
|
| Modifier and Type | Method and Description |
|---|---|
Document |
clean(Document dirtyDocument)
Creates a new, clean document, from the original dirty document, containing only elements allowed by the safelist.
|
boolean |
isValid(Document dirtyDocument)
Determines if the input document bodyis valid, against the safelist.
|
boolean |
isValidBodyHtml(String bodyHtml) |
public Cleaner(Safelist safelist)
safelist - safe-list to clean with@Deprecated public Cleaner(Whitelist whitelist)
Cleaner(Safelist) instead.public Document clean(Document dirtyDocument)
body are used. The
OutputSettings of the original document are cloned into the clean document.dirtyDocument - Untrusted base document to clean.public boolean isValid(Document dirtyDocument)
head.
This method can be used as a validator for user input. An invalid document will still be cleaned successfully
using the clean(Document) document. If using as a validator, it is recommended to still clean the document
to ensure enforced attributes are set correctly, and that the output is tidied.
dirtyDocument - document to testpublic boolean isValidBodyHtml(String bodyHtml)
Copyright © 2009–2025 Jonathan Hedley. All rights reserved.