library tika-client
Apache Tika PHP Client
marcelomx/tika-client
Apache Tika PHP Client
- Tuesday, September 24, 2013
- by marcelomx
- Repository
- 1 Watchers
- 0 Stars
- 226 Installations
- PHP
- 0 Dependents
- 0 Suggesters
- 3 Forks
- 0 Open issues
- 1 Versions
- 0 % Grown
Apache Tika PHP Client
Usage:, (*1)
$path = __DIR__ . '/../bin/tika-app-1.4.jar';
$tika = new TikaClient($path);
// Get text
$text = $tika->getText('file.doc');
// Get html text
$html = $tika->getHtml('file.doc');
// Get xhtml text
$xhtml = $tika->getXhtml('file.doc');
// Get language
$lang = $tika->getLanguage('file.doc');
// Get content type
$type = $tika->getContentType('file.doc');
// Extract all attachments on doc file
$target = '/tmp/'; // target directory
$tika->extract('file.doc', $target);
If you prefer, use the TikaWrapper to encapsulate all operations to same file. Eg:, (*2)
$wrapper = new TikaWrapper('file.doc', $client);
// Get text
$text = $wrapper->getText();
// Get html text
$html = $wrapper->getHtml();
// Get xhtml text
$xhtml = $wrapper->getXhtml();
// Get language
$lang = $wrapper->getLanguage();
// Get content type
$type = $wrapper->getContentType('file.doc');
dev-master
9999999-dev
Apache Tika PHP Client
Sources
Download
GPL
The Requires
The Development Requires
by
Marcelo Rodrigues