xparse / recursive-pagination
Recursive Pagination
Installs: 26 826
Dependents: 0
Suggesters: 0
Security: 0
Stars: 1
Watchers: 3
Forks: 3
Language:HTML
Requires
- php: ^7.1
- xparse/element-finder: ^0.5.0
- xparse/parser: ^0.4.0
Requires (Dev)
- phpunit/phpunit: ^7.5
This package is not auto-updated.
Last update: 2024-01-12 06:04:17 UTC
README
Recursive Pagination allows you to parse website pages recursively. You need to pass link from where to start and set next page expression (xPath, css, etc).
Installation
You can install the package via Composer
$ composer require xparse/recursive-pagination
Basic Usage
Try to find all links to the repositories on github. Our query will be xparse
.
With recursive pagination we can traverse all pagination links and process each resulting page to fetch repositories links.
use Xparse\Parser\Parser; use Xparse\RecursivePagination\RecursivePagination; # init Parser $parser = new Parser(); # set expression to pagination links $paginator = new RecursivePagination($parser, "//*[@class='pagination']//a/@href"); # set initial page url $paginator->addToQueue('https://github.com/search?q=xparse'); $allLinks = []; while (($page = $paginator->getNextPage())) { # set expression to fetch repository links $adsList = $page->value("//*[@class='repo-list-name']//a/@href")->getItems(); # merge and remove duplicates $allLinks = array_values(array_unique(array_merge($allLinks, $adsList))); } print_r($allLinks);
Testing
$ vendor/bin/phpunit
Credits
License
The MIT License (MIT). Please see License File for more information.