2017 © Pedro Peláez
 

library tika-client

Apache Tika PHP Client

image

marcelomx/tika-client

Apache Tika PHP Client

  • Tuesday, September 24, 2013
  • by marcelomx
  • Repository
  • 1 Watchers
  • 0 Stars
  • 226 Installations
  • PHP
  • 0 Dependents
  • 0 Suggesters
  • 3 Forks
  • 0 Open issues
  • 1 Versions
  • 0 % Grown

The README.md

Apache Tika PHP Client

Usage:, (*1)

$path   = __DIR__ . '/../bin/tika-app-1.4.jar'; 
$tika = new TikaClient($path);

// Get text
$text = $tika->getText('file.doc');

// Get html text
$html = $tika->getHtml('file.doc');

// Get xhtml text
$xhtml = $tika->getXhtml('file.doc');

// Get language
$lang = $tika->getLanguage('file.doc');

// Get content type
$type = $tika->getContentType('file.doc');

// Extract all attachments on doc file
$target = '/tmp/'; // target directory
$tika->extract('file.doc', $target);

If you prefer, use the TikaWrapper to encapsulate all operations to same file. Eg:, (*2)

$wrapper = new TikaWrapper('file.doc', $client);

// Get text
$text = $wrapper->getText();

// Get html text
$html = $wrapper->getHtml();

// Get xhtml text
$xhtml = $wrapper->getXhtml();

// Get language
$lang = $wrapper->getLanguage();

// Get content type
$type = $wrapper->getContentType('file.doc');

The Versions

24/09 2013

dev-master

9999999-dev

Apache Tika PHP Client

  Sources   Download

GPL

The Requires

 

The Development Requires

by Marcelo Rodrigues