Read PDF files with PHP 5.6 (based on spatie/pdf-to-text package), (*1)
This package is a PHP 5.6+ fork of Spatie PDF To Text package. If you use PHP7, please use the original package., (*2)
This package provides a class to extract text from a pdf., (*3)
\JBPapp\PdfToText\Pdf::getText('book.pdf'); //returns the text from the pdf
Requirements
Behind the scenes this package leverages pdftotext. You can verify if the binary installed on your system by issueing this command:, (*4)
which pdftotext
If it is installed it will return the path to the binary., (*5)
To install the binary you can use this command on Ubuntu or Debian:, (*6)
apt-get install poppler-utils
If you're on RedHat or CentOS use this:, (*7)
yum install poppler-utils
Installation
You can install the package via composer:, (*8)
$ composer require spatie/pdf-to-text
Usage
Extracting text from a pdf is easy., (*9)
$text = (new Pdf())
->setPdf('book.pdf')
->text();
Or easier:, (*10)
\JBPapp\PdfToText\Pdf::getText('book.pdf')
By default the package will assume that the pdftotext
is located at /usr/bin/pdftotext
.
If you're using the a different location pass the path to the binary in constructor, (*11)
$text = (new Pdf('/custom/path/to/pdftotext'))
->setPdf('book.pdf')
->text();
or as the second parameter to the getText
-function:, (*12)
\JBPapp\PdfToText\Pdf::getText('book.pdf', '/custom/path/to/pdftotext')
Change log
Please see CHANGELOG for more information what has changed recently., (*13)
Testing
bash
$ composer test
, (*14)
Contributing
Please see CONTRIBUTING for details., (*15)
Security
If you discover any security related issues, please email freek@spatie.be instead of using the issue tracker., (*16)
Credits
License
The MIT License (MIT). Please see License File for more information., (*17)