2017 © Pedro PelĂĄez
 

library php-boiler-pipe

PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages

image

dotpack/php-boiler-pipe

PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages

  • Wednesday, September 21, 2016
  • by lolwat
  • Repository
  • 7 Watchers
  • 14 Stars
  • 1,147 Installations
  • PHP
  • 0 Dependents
  • 0 Suggesters
  • 10 Forks
  • 1 Open issues
  • 1 Versions
  • 3 % Grown

The README.md

PhpBoilerPipe

Project Archived

This project is no longer maintained. Please refer to pforret/pf-article-extractor for further updates and continued development., (*1)

Thank you for your support!, (*2)


Boilerplate Removal and Fulltext Extraction from HTML pages., (*3)

Partial implementation of https://github.com/kohlschutter/boilerpipe in PHP. Requires PHP >= 5.4., (*4)

Example

``` php, (*5)

html

$path = "http://example.com/some-article.html"; $data = file_get_contents($path);, (*6)

code

$ae = new DotPack\PhpBoilerPipe\ArticleExtractor(); echo $ae->getContent($data) . "\n"; ```, (*7)

The Versions

21/09 2016

dev-master

9999999-dev

PhpBoilerPipe. Boilerplate Removal and Fulltext Extraction from HTML pages

  Sources   Download

MIT

The Development Requires

by Christian KohlschĂŒtter
by Ilia Ermolin

html extract