league/commonmark
Highly extensible PHP Markdown parser supporting full CommonMark and GitHub-Flavored Markdown. Convert Markdown to HTML with simple converters, customize rendering via extensions, and run safely with options like stripping HTML and blocking unsafe links.
layout: default title: Security description: How to configure league/commonmark against possible security issues when handling untrusted user input redirect_from:
In order to be fully compliant with the CommonMark spec, certain security settings are disabled by default. You will want to configure these settings if untrusted users will be providing the Markdown content:
html_input: How to handle raw HTMLallow_unsafe_links: Whether unsafe links are permittedmax_nesting_level: Protect against long render times or segfaultsmax_delimiters_per_line: Protect against long parse times or rendering segfaultsFurther information about each option can be found below.
All HTML input is unescaped by default. This behavior ensures that league/commonmark is 100% compliant with the CommonMark spec.
If you're developing an application which renders user-provided Markdown from potentially untrusted users, you are strongly encouraged to set the html_input option in your configuration to either escape or strip:
use League\CommonMark\CommonMarkConverter;
$converter = new CommonMarkConverter(['html_input' => 'escape']);
echo $converter->convert('<script>alert("Hello XSS!");</script>');
// <script>alert("Hello XSS!");</script>
use League\CommonMark\CommonMarkConverter;
$converter = new CommonMarkConverter(['html_input' => 'strip']);
echo $converter->convert('<script>alert("Hello XSS!");</script>');
// (empty output)
Failing to set this option could make your site vulnerable to cross-site scripting (XSS) attacks!
See the configuration section for more information.
Unsafe links are also allowed by default due to CommonMark spec compliance. An unsafe link is one that uses any of these protocols:
javascript:vbscript:file:data: (except for data:image in png, gif, jpeg, or webp format)To prevent these from being parsed and rendered, you should set the allow_unsafe_links option to false.
No maximum nesting level is enforced by default. Markdown content which is too deeply-nested (like 10,000 nested blockquotes: '> > > > > ...') could result in long render times or segfaults.
If you need to parse untrusted input, consider setting a reasonable max_nesting_level (perhaps 10-50) depending on your needs. Once this nesting level is hit, any subsequent Markdown will be rendered as plain text.
use League\CommonMark\CommonMarkConverter;
$markdown = str_repeat('> ', 10000) . ' Foo';
$converter = new CommonMarkConverter(['max_nesting_level' => 5]);
echo $converter->convert($markdown);
// <blockquote>
// <blockquote>
// <blockquote>
// <blockquote>
// <blockquote>
// <p>> > > > > > > ... Foo</p></blockquote>
// </blockquote>
// </blockquote>
// </blockquote>
// </blockquote>
See the configuration section for more information.
Similarly to the maximum nesting level, no maximum number of delimiters per line is enforced by default. Delimiters can be nested (like *a **b** c*) or un-nested (like *a* *b* *c*) - in either case, having too many in a single line can result in long parse times. We therefore have a separate option to limit the number of delimiters per line.
If you need to parse untrusted input, consider setting a reasonable max_delimiters_per_line (perhaps 100-1000) depending on your needs. Once this level is hit, any subsequent delimiters on that line will be rendered as plain text.
use League\CommonMark\CommonMarkConverter;
$markdown = '*a* **b *c **d** c* b**'; // 8 delimiters (* and **)
$converter = new CommonMarkConverter(['max_delimiters_per_line' => 6]);
echo $converter->convert($markdown);
// <p><em>a</em> **b *c <strong>d</strong> c* b**</p>
Although this library does offer these security features out-of-the-box, some users may opt to also run the HTML output through additional filtering layers (like HTMLPurifier). If you do this, make sure you thoroughly test your additional post-processing steps and configure them to work properly with the types of HTML elements and attributes that converted Markdown might produce, otherwise, you may end up with weird behavior like missing images, broken links, mismatched HTML tags, etc.
How can I help you explore Laravel packages today?