2017 © Pedro Peláez
 

library crawler

image

hanson/crawler

  • Saturday, June 30, 2018
  • by hanson
  • Repository
  • 1 Watchers
  • 5 Stars
  • 8 Installations
  • PHP
  • 0 Dependents
  • 0 Suggesters
  • 1 Forks
  • 0 Open issues
  • 13 Versions
  • 0 % Grown

The README.md

crawler

A easy package to crawl a site list and detail, (*1)

Installation

composer require hanccc/crawler

usage

This package require Goutte, you can get the dom by $this->crawler(); in both of list and detail., (*2)

example

        //or $listCrawler = new ExampleListCrawler(storage_path('logs'));
        $listCrawler = new ExampleListCrawler('http://example.com', storage_path('logs'));
        $listCrawler->setDetailCrawler(new ExampleDetailCrawler());
        $listCrawler->start();

ListCrawler

class ExampleListCrawler extends ListCrawler{
    public $url = 'http://example.com';

    //return links per page
    public function getEachPageUrl($page)
    {
        return 'http://example.com/list&page=' . $page;
    }

    // get the maximum number of pages
    public function setMaxPage()
    {
        $this->maxPage = $num;
    }
}

DetailCrawler

class ExampleDetailCrawler extends DetailCrawler{

    //Returns boolean
    public function isDetailUrl($url)
    {
        if(preg_match('/example.com\/id(\d+)/, $url))
            return true;
    }

    // what you want to do about the detail page
    public function handle()
    {
        echo $this->crawler->filter('title')->text();
    }
}

License

Crawler is open-sourced software licensed under the MIT license., (*3)

The Versions

30/06 2018
12/07 2016

dev-master

9999999-dev

  Sources   Download

MIT

The Requires

 

by Avatar hanson

05/06 2016

0.1.x-dev

0.1.9999999.9999999-dev

  Sources   Download

MIT

The Requires

 

by Avatar hanson

05/06 2016

0.2.x-dev

0.2.9999999.9999999-dev

  Sources   Download

MIT

The Requires

 

by Avatar hanson

05/06 2016

0.2.5

0.2.5.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson

05/06 2016

0.2.51

0.2.51.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson

04/06 2016

0.2.4

0.2.4.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson

27/05 2016

0.2.3

0.2.3.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson

27/05 2016

0.2.2

0.2.2.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson

21/05 2016

0.2.1

0.2.1.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson

21/05 2016

0.2.0

0.2.0.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson

20/05 2016

0.1.1

0.1.1.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson

19/05 2016

0.1.0

0.1.0.0

  Sources   Download

MIT

The Requires

 

by Avatar hanson