imarc/crawler

crawl a site and get all the links

Wednesday, November 1, 2017
by chantron
Repository
4 Watchers
1 Stars
13 Installations

PHP
0 Dependents
0 Suggesters
0 Forks
0 Open issues
3 Versions
0 % Grown

Crawler

Crawls a website and does something with the URLs. By default it will output each URL on a single line in a txt file., _(*1)

This could fairly easily be extended to do many other things. You would just need to create a new Observer (src/Observer), _(*2)

install

composer require imarc/crawler, _(*3)

usage

From your project's directory: ./vendor/bin/crawler csv URL DESTINATION, _(*4)

From the repo: ./crawler.php csv URL DESTINATION, _(*5)

options

crawler --help, _(*6)

Usage:
  csv [options] [--] <url> <destination>

Arguments:
  url                    URL to crawl.
  destination            Write CSV to FILE

Options:
  -s, --show-progress    Show the crawl's progress
  -e, --crawl-external   Crawl external URLs
  -q, --quiet            Do not output any message
      --exclude=EXCLUDE  Exclude certain extensions [default: ["css","gif","ico","jpg","jpg","js","pdf","pdf","png","rss","txt"]] (multiple values allowed)
  -h, --help             Display this help message
  -V, --version          Display this application version
      --ansi             Force ANSI output
      --no-ansi          Disable ANSI output
  -n, --no-interaction   Do not ask any interactive question
  -v|vv|vvv, --verbose   Increase the verbosity of messages: 1 for normal output, 2 for more verbose output and 3 for debug

tests

codecept run, _(*7)

01/11 2017

dev-master

9999999-dev

crawl a site and get all the links

Sources Download

MIT

The Requires

The Development Requires

codeception/codeception ^2.3

by Chandler Blum

01/11 2017

0.2.0

0.2.0.0

crawl a site and get all the links

Sources Download

MIT

The Requires

The Development Requires

codeception/codeception ^2.3

by Chandler Blum

16/10 2017

0.1.0

0.1.0.0

crawl a site and get all the links

Sources Download

MIT

The Requires

The Development Requires

codeception/codeception ^2.3

library crawler

crawl a site and get all the links

imarc/crawler

The README.md

Crawler

install

usage

options

tests

The Versions

dev-master

The Requires

The Development Requires

by Chandler Blum

0.2.0

The Requires

The Development Requires

by Chandler Blum

0.1.0

The Requires

The Development Requires

by Chandler Blum