How do I install Symfony DomCrawler in a Laravel project?

Use Composer: `composer require symfony/dom-crawler`. No additional Laravel-specific setup is needed. The package integrates seamlessly with Laravel’s HTTP client (e.g., `Http::get()`) or Guzzle for fetching content before parsing.

Can DomCrawler handle dynamic JavaScript-rendered content (SPAs)?

No, DomCrawler parses static HTML/XML only. For SPAs, pair it with a headless browser like Playwright or Puppeteer. DomCrawler excels at scraping server-rendered content or APIs returning HTML.

What’s the best way to extract data from a Laravel HTTP response using DomCrawler?

Fetch the response with Laravel’s `Http` facade, then pass the body to DomCrawler: `$crawler = new Crawler(Http::get('url')->body());`. Use CSS selectors (e.g., `$crawler->filter('div.product')->text()`) or XPath for precise targeting.

Does DomCrawler support Laravel’s queue system for batch scraping?

Yes. Dispatch a job (e.g., `ScrapeJob`) with DomCrawler logic, then process it via Laravel Queues. This is ideal for high-volume scraping tasks, like crawling 10,000+ pages daily without blocking requests.

How does DomCrawler compare to `simple_html_dom` for Laravel scraping?

DomCrawler is more maintainable and modern, with built-in CSS/XPath support and Symfony’s testing integration. `simple_html_dom` is deprecated and lacks PHP 8+ optimizations. DomCrawler also avoids jQuery-like syntax, reducing cognitive overhead.

Will DomCrawler work with Laravel’s BrowserKit for functional testing?

Absolutely. DomCrawler is the backbone of Symfony’s BrowserKit, which Laravel can use via `symfony/browser-kit`. Test responses by crawling `$client->get('url')->getCrawler()` for assertions (e.g., checking rendered HTML).

What PHP version is required for full HTML5 parsing support?

PHP 8.4+ is recommended for robust HTML5 parsing (e.g., handling malformed tags). Older versions (7.4+) may require `libxml_use_internal_errors()` for edge cases. Always pin to a stable minor version (e.g., `^8.1`).

How do I handle XML documents with DomCrawler in Laravel?

DomCrawler supports XML natively. Load it like HTML: `$crawler = new Crawler(file_get_contents('data.xml'), 'https://example.com')`. Use XPath (e.g., `$crawler->filterXPath('//book/title')`) for structured data extraction.

Are there performance concerns when scraping large-scale websites?

Test with your expected volume (e.g., 10K+ pages/day) to monitor memory/CPU usage. Cache parsed results in Redis or use Laravel Queues to distribute load. Avoid parsing the same content repeatedly in loops.

How do I mitigate XXE vulnerabilities when parsing untrusted HTML/XML?

Update to Symfony DomCrawler `v8.0.12+` for built-in protections. For extra safety, use `libxml_disable_entity_loader(true)` before parsing untrusted sources. Always validate inputs if scraping user-uploaded content.

Weave Code

Code Weaver

Helps Laravel developers discover, compare, and choose open-source packages. See popularity, security, maintainers, and scores at a glance to make better decisions.

Dom Crawler Laravel Package

symfony/dom-crawler

Symfony DomCrawler makes it easy to parse and navigate HTML/XML using a fluent, CSS/XPath-friendly API. Ideal for scraping, testing responses, and extracting data from markup, with smooth integration across the Symfony ecosystem.

View on GitHub

Deep Wiki

Context7

Eases DOM navigation for HTML and XML documents

Frequently asked questions about Dom Crawler

How do I install Symfony DomCrawler in a Laravel project?: Use Composer: `composer require symfony/dom-crawler`. No additional Laravel-specific setup is needed. The package integrates seamlessly with Laravel’s HTTP client (e.g., `Http::get()`) or Guzzle for fetching content before parsing.
Can DomCrawler handle dynamic JavaScript-rendered content (SPAs)?: No, DomCrawler parses static HTML/XML only. For SPAs, pair it with a headless browser like Playwright or Puppeteer. DomCrawler excels at scraping server-rendered content or APIs returning HTML.
What’s the best way to extract data from a Laravel HTTP response using DomCrawler?: Fetch the response with Laravel’s `Http` facade, then pass the body to DomCrawler: `$crawler = new Crawler(Http::get('url')->body());`. Use CSS selectors (e.g., `$crawler->filter('div.product')->text()`) or XPath for precise targeting.
Does DomCrawler support Laravel’s queue system for batch scraping?: Yes. Dispatch a job (e.g., `ScrapeJob`) with DomCrawler logic, then process it via Laravel Queues. This is ideal for high-volume scraping tasks, like crawling 10,000+ pages daily without blocking requests.
How does DomCrawler compare to `simple_html_dom` for Laravel scraping?: DomCrawler is more maintainable and modern, with built-in CSS/XPath support and Symfony’s testing integration. `simple_html_dom` is deprecated and lacks PHP 8+ optimizations. DomCrawler also avoids jQuery-like syntax, reducing cognitive overhead.
Will DomCrawler work with Laravel’s BrowserKit for functional testing?: Absolutely. DomCrawler is the backbone of Symfony’s BrowserKit, which Laravel can use via `symfony/browser-kit`. Test responses by crawling `$client->get('url')->getCrawler()` for assertions (e.g., checking rendered HTML).
What PHP version is required for full HTML5 parsing support?: PHP 8.4+ is recommended for robust HTML5 parsing (e.g., handling malformed tags). Older versions (7.4+) may require `libxml_use_internal_errors()` for edge cases. Always pin to a stable minor version (e.g., `^8.1`).
How do I handle XML documents with DomCrawler in Laravel?: DomCrawler supports XML natively. Load it like HTML: `$crawler = new Crawler(file_get_contents('data.xml'), 'https://example.com')`. Use XPath (e.g., `$crawler->filterXPath('//book/title')`) for structured data extraction.
Are there performance concerns when scraping large-scale websites?: Test with your expected volume (e.g., 10K+ pages/day) to monitor memory/CPU usage. Cache parsed results in Redis or use Laravel Queues to distribute load. Avoid parsing the same content repeatedly in loops.
How do I mitigate XXE vulnerabilities when parsing untrusted HTML/XML?: Update to Symfony DomCrawler `v8.0.12+` for built-in protections. For extra safety, use `libxml_disable_entity_loader(true)` before parsing untrusted sources. Always validate inputs if scraping user-uploaded content.

Popularity trends

Recorded values over time (once-a-day snapshots). Jun 22, 2026 – Jul 21, 2026

GitHub · stars

GitHub · forks

GitHub · watchers

Packagist · monthly downloads

View on GitHub

Stars

4,035

Favorites

4,118

Forks

124

Score

54.9

Score breakdown

Sum of components, capped 0–100. Halved if archived.

Stars

input: 4035

+20.2
Forks

input: 124

+3.7
Open issues + PRs

input: 0

+0.0
Releases

input: 40

+12.0
Recency

input: 24

+18.7
Issue opportunity

input: 0

+0.0
Laravel News mentions

input: 0

+0.0
Dependents

input: 0

+0.0

Total 54.6

Opportunity

51.9

Opportunity score breakdown

Hidden gem signal × 0.65 + contribution need × 0.35, scaled by health factor.

Hidden gem

log(monthly_downloads / (stars + 1)) × 25

80.5
Contribution need

open_issues + open_prs: 0

0.0
Health factor

archived + recency + open issues

×0.99

Total 51.8

License

MIT

Last release

Jun 27, 2026

Watchers

Downloads

7M/mo

Dependents

Open issues

Weaver

How can I help you explore Laravel packages today?

Conversation history is not saved when not logged in.

Add packages to context

No packages found.

codraw/framework-extra-bundle

codraw/messenger

codraw/security

codraw/mailer

codraw/contracts

codraw/profiling

codraw/dependency-injection

codraw/tester

codraw/core

nexmo/api-specification

capell-app/block-library

axium/identity

cetria/laravel-dummy-models

cetria/reflection-helper

agropredict/sso-auth-bundle

evolvestudio/spam-protection

datacore/hub-sdk

develia/commons

cuci/prototurk-sdk

cuci/prototurk-sdk-symfony