2017 © Pedro Peláez
 

library php-document-parser

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file

image

lukemadhanga/php-document-parser

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file

  • Sunday, November 19, 2017
  • by LukeMadhanga
  • Repository
  • 3 Watchers
  • 11 Stars
  • 916 Installations
  • PHP
  • 0 Dependents
  • 0 Suggesters
  • 7 Forks
  • 5 Open issues
  • 6 Versions
  • 53 % Grown

The README.md

PHP DocumentParser

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file, (*1)


Authors - @facuonline - Luke Madhanga @LukeMadhanga, (*2)


This library is perfect if you want users to be able to upload word documents to your content management system, instead of forcing them to copy and paste. Supported file types are .doc, .docx, .txt and .rtf., (*3)

composer require lukemadhanga/php-document-parser, (*4)

May require you to install PHP Zip, (*5)

sudo apt-get install php7.0-zip, (*6)

The above Ubuntu command will vary depending on your version of PHP and what OS is running on your server, (*7)


Methods

parseFromFile

Parse a document from a file, (*8)

Arguments, (*9)

string $filename The path to the file to parse, (*10)

string $mimetype The mimetype of the file. This will be used to determine which algorithm to use when decoding, (*11)

returns string The text from the file, (*12)


parseFromString

Parse a file from a string, (*13)

Arguments, (*14)

string $string The contents of the file to parse, (*15)

string $mimetype The mimetype of the file. This will be used to determine which algorithm to use when decoding, (*16)

returns string The text in the document, (*17)


Change log

September 21 2019 (0.1.4)

Better ODT Support Merged in PR#13 for better ODT support. Author: facuonline, (*18)

August 1 2019 (0.1.3)

PHP Unit Merged in PR#12 for PHP Unit testing. Author: facuonline, (*19)

March 21 2019 (0.1.2)

DOCX Handling Merged in PR#10 For better DOCX handling. Includes bug fixes for exception handling. Author: facuonline, (*20)

September 13th 2017

Added composer, (*21)

composer require lukemadhanga/php-document-parser, (*22)

April 29th 2016

Improved .doc process, (*23)

The script to parse .doc files is unreliable: it breaks on complicated documents. I would suggest installing the antiword command line utility as that works almost perfectly for the larger majority of documents., (*24)

The Versions

19/11 2017

dev-master

9999999-dev

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file

  Sources   Download

The Requires

  • php >=5.3.3

 

19/11 2017

0.1.1

0.1.1.0

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file

  Sources   Download

The Requires

  • php >=5.3.3

 

08/11 2017

0.1.0

0.1.0.0

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file

  Sources   Download

The Requires

  • php >=5.3.3

 

08/11 2017

0.0.13

0.0.13.0

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file

  Sources   Download

The Requires

  • php >=5.3.3

 

14/09 2017

0.0.12

0.0.12.0

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file

  Sources   Download

The Requires

  • php >=5.3.3

 

13/09 2017

0.0.1

0.0.1.0

A PHP parser for getting the text from a .doc, .docx, .rtf or .txt file

  Sources   Download

The Requires

  • php >=5.3.3