2017 © Pedro Peláez
 

library pdf-to-text

Extract text from a pdf file using pdf-to-text binary.

image

jbpapp/pdf-to-text

Extract text from a pdf file using pdf-to-text binary.

  • Monday, April 4, 2016
  • by jbpapp
  • Repository
  • 1 Watchers
  • 0 Stars
  • 281 Installations
  • PHP
  • 0 Dependents
  • 0 Suggesters
  • 1 Forks
  • 2 Open issues
  • 3 Versions
  • 9 % Grown

The README.md

Extract text from a pdf

Read PDF files with PHP 5.6 (based on spatie/pdf-to-text package), (*1)

This package is a PHP 5.6+ fork of Spatie PDF To Text package. If you use PHP7, please use the original package., (*2)

This package provides a class to extract text from a pdf., (*3)

 \JBPapp\PdfToText\Pdf::getText('book.pdf'); //returns the text from the pdf

Requirements

Behind the scenes this package leverages pdftotext. You can verify if the binary installed on your system by issueing this command:, (*4)

which pdftotext

If it is installed it will return the path to the binary., (*5)

To install the binary you can use this command on Ubuntu or Debian:, (*6)

apt-get install poppler-utils

If you're on RedHat or CentOS use this:, (*7)

yum install poppler-utils

Installation

You can install the package via composer:, (*8)

$ composer require spatie/pdf-to-text

Usage

Extracting text from a pdf is easy., (*9)

$text = (new Pdf())
    ->setPdf('book.pdf')
    ->text();

Or easier:, (*10)

 \JBPapp\PdfToText\Pdf::getText('book.pdf')

By default the package will assume that the pdftotext is located at /usr/bin/pdftotext. If you're using the a different location pass the path to the binary in constructor, (*11)

$text = (new Pdf('/custom/path/to/pdftotext'))
    ->setPdf('book.pdf')
    ->text();

or as the second parameter to the getText-function:, (*12)

 \JBPapp\PdfToText\Pdf::getText('book.pdf', '/custom/path/to/pdftotext')

Change log

Please see CHANGELOG for more information what has changed recently., (*13)

Testing

bash $ composer test, (*14)

Contributing

Please see CONTRIBUTING for details., (*15)

Security

If you discover any security related issues, please email freek@spatie.be instead of using the issue tracker., (*16)

Credits

License

The MIT License (MIT). Please see License File for more information., (*17)

The Versions

04/04 2016

dev-master

9999999-dev https://github.com/jbpapp/pdf-to-text

Extract text from a pdf file using pdf-to-text binary.

  Sources   Download

MIT

The Requires

 

The Development Requires

pdf pdf-to-text

04/04 2016

1.1

1.1.0.0 https://github.com/jbpapp/pdf-to-text

Extract text from a pdf file using pdf-to-text binary.

  Sources   Download

MIT

The Requires

 

The Development Requires

pdf pdf-to-text

01/03 2016

1.0

1.0.0.0 https://github.com/jbpapp/pdf-to-text

Extract text from a pdf file using pdf-to-text binary.

  Sources   Download

MIT

The Requires

 

The Development Requires

pdf pdf-to-text