2017 © Pedro Peláez
 

library pdfparser

Pdf parser library. Can read and extract information from pdf file.

image

smalot/pdfparser

Pdf parser library. Can read and extract information from pdf file.

  • Saturday, June 23, 2018
  • by smalot
  • Repository
  • 66 Watchers
  • 867 Stars
  • 878,978 Installations
  • PHP
  • 43 Dependents
  • 3 Suggesters
  • 269 Forks
  • 106 Open issues
  • 34 Versions
  • 10 % Grown

The README.md

PDF parser

Version CI CS Scrutinizer Code Quality Downloads, (*1)

The smalot/pdfparser is a standalone PHP package that provides various tools to extract data from PDF files., (*2)

This library is under active maintenance. There is no active development by the author of this library (at the moment), but we welcome any pull request adding/extending functionality!, (*3)

Features

  • Load/parse objects and headers
  • Extract metadata (author, description, ...)
  • Extract text from ordered pages
  • Support of compressed PDFs
  • Support of MAC OS Roman charset encoding
  • Handling of hexa and octal encoding in text sections
  • Create custom configurations (see CustomConfig.md).

Currently, secured documents and extracting form data are not supported., (*4)

License

This library is under the LGPLv3 license., (*5)

Install

This library requires PHP 7.1+ since v1. You can install it via Composer:, (*6)

composer require smalot/pdfparser

In case you can't use Composer, you can include alt_autoload.php-dist. It will include all required files automatically., (*7)

Quick example

<?php

// Parse PDF file and build necessary objects.
$parser = new \Smalot\PdfParser\Parser();
$pdf = $parser->parseFile('/path/to/document.pdf');

$text = $pdf->getText();
echo $text;

Further usage information can be found here., (*8)

Documentation

Documentation can be found in the doc folder., (*9)

The Versions

23/06 2018

dev-master

9999999-dev http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

LGPL-3.0 LGPLv3

The Requires

 

The Development Requires

parser text pdf extract parse

23/06 2018

v0.13.2

0.13.2.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

LGPL-3.0

The Requires

 

The Development Requires

parser text pdf extract parse

22/06 2018

v0.13.1

0.13.1.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

LGPL-3.0-or-later

The Requires

 

The Development Requires

parser text pdf extract parse

22/06 2018

v0.13.0

0.13.0.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

LGPLv3

The Requires

 

The Development Requires

parser text pdf extract parse

16/03 2018

v0.12.0

0.12.0.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

LGPLv3

The Requires

 

The Development Requires

parser text pdf extract parse

14/09 2017

v0.11

0.11.0.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

LGPLv3

The Requires

 

The Development Requires

parser text pdf extract parse

03/01 2017

v0.10.0

0.10.0.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

LGPLv3

The Requires

 

The Development Requires

parser text pdf extract parse

05/11 2016

v0.9.26

0.9.26.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

18/09 2015

v0.9.25

0.9.25.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

26/05 2015

v0.9.24

0.9.24.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

18/03 2015

v0.9.23

0.9.23.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

21/01 2015

v0.9.22

0.9.22.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

07/07 2014

v0.9.21

0.9.21.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

17/02 2014

v0.9.20

0.9.20.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

30/01 2014

v0.9.19

0.9.19.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

28/01 2014

v0.9.18

0.9.18.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

26/01 2014

v0.9.17

0.9.17.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-3.0

The Requires

 

The Development Requires

25/01 2014

v0.9.16

0.9.16.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

25/01 2014

v0.9.15

0.9.15.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

25/01 2014

v0.9.14

0.9.14.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

25/01 2014

v0.9.13

0.9.13.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

25/01 2014

v0.9.12

0.9.12.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

24/01 2014

v0.9.11

0.9.11.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

09/12 2013

v0.9.10

0.9.10.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

21/11 2013

v0.9.9

0.9.9.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

19/10 2013

v0.9.8

0.9.8.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

16/10 2013

v0.9.7

0.9.7.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

13/10 2013

v0.9.6

0.9.6.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

26/09 2013

v0.9.5

0.9.5.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

24/09 2013

v0.9.4

0.9.4.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

21/09 2013

v0.9.3

0.9.3.0 http://www.pdfparser.org

Pdf parser library. Can read and extract information from pdf file.

  Sources   Download

GPL-2.0

The Requires

 

The Development Requires

06/09 2013

v0.9.2

0.9.2.0

Pdf parser library. Can read and extract informations from pdf file.

  Sources   Download

GPL-2.0

The Development Requires

04/09 2013

v0.9.1

0.9.1.0

Pdf parser library. Can read and extract informations from pdf file.

  Sources   Download

GPL-2.0

The Development Requires

02/09 2013

v0.9.0

0.9.0.0

Pdf parser library. Can read and extract informations from pdf file.

  Sources   Download

GPL-2.0

The Development Requires