2017 © Pedro Peláez
 

library indicsoundex

Indian language soundex package based on Santhosh Thottingal's algorithm.

image

startcodein/indicsoundex

Indian language soundex package based on Santhosh Thottingal's algorithm.

  • Saturday, October 17, 2015
  • by pshanoop
  • Repository
  • 1 Watchers
  • 2 Stars
  • 7 Installations
  • PHP
  • 0 Dependents
  • 0 Suggesters
  • 1 Forks
  • 0 Open issues
  • 2 Versions
  • 0 % Grown

The README.md

IndicSoundex

Indian laguage soundex package based on Santhosh Thottingal's algorithm. For more info on algorithm check here, (*1)

Soundex is phonetic algorithm for indexing names by sound as pronounced in English. This module implements Soundex algorithm for Engish as well as a modified version of soundex algorithm for Indian languages., (*2)

This include Indian major languages:, (*3)

  • Hindi (hi_IN)
  • Bengali (bn_IN)
  • Punjabi (pa_IN)
  • Gujarati (gu_IN)
  • Oriya (or_IN)
  • Tamil (ta_IN)
  • Telugu (te_IN)
  • Kannada (kn_IN)
  • Malayalam (ml_IN)
  • English (en_US)

This can be extended to any language by including soundex character map for it., (*4)

Quick start

Installing using git, (*5)

git clone https://github.com/startcodein/IndicSoundex.git

Installing using composer, (*6)

composer require startcodein/indicsoundex:@dev

Generating soundex

<?php 

   use Startcode\IndicSoundex\IndicSoundex as IndicSoundex;

   $sound = new IndicSoundex();

   echo $sound->soundex('ಬೆಂಗಳೂರು').PHP_EOL;
   echo $sound->soundex('आम्र् फल्').PHP_EOL;
   echo $sound->soundex('vasudev').PHP_EOL;
   echo $sound->soundex('Rupert्').PHP_EOL;

This will give output, (*7)

ಬDNFQCPC
आNPMQ000
v2310000
r1630000

Comparing string soundex

<?php 

   use Startcode\IndicSoundex\IndicSoundex as IndicSoundex;

   $sound = new IndicSoundex();

   echo $sound->compare('बॆंगळूरु','आम्र् फल्').PHP_EOL;
   echo $sound->compare('Bangalore','ಬೆಂಗಳೂರು').PHP_EOL;
   echo $sound->compare('बॆंगळूरु','बॆंगळूरु').PHP_EOL;
   echo $sound->compare('അമ്മ','അമ').PHP_EOL;

This will give output like this, (*8)

-1  //Not equal
-1  //Not equal
0   // Same word
1   // Similar
2   //Diff lang similar

License

Copyright(c) 2015 Sanoob Pattanath, (*9)

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Library General Public License for more details., (*10)

Contributions

Any kind of contributions are really appreciated. If you find any bugs or security issues please email hello[at]pattanath.com or raise an issue on github., (*11)

The Versions

17/10 2015

dev-master

9999999-dev http://startcode.in

Indian language soundex package based on Santhosh Thottingal's algorithm.

  Sources   Download

GPL-3.0

The Requires

  • php >=5.5.9
  • ext-mbstring *

 

The Development Requires

by Sanoob Pattanath

language unicode soundex india indicsoundex startcodein hi_in bn_in pa_in gu_in or_in ta_in te_in kn_in ml_in en_us

17/10 2015

0.1.0

0.1.0.0 http://startcode.in

Indian language soundex package based on Santhosh Thottingal's algorithm.

  Sources   Download

GPL-3.0

The Requires

  • php >=5.5.9
  • ext-mbstring *

 

The Development Requires

by Sanoob Pattanath

language unicode soundex india indicsoundex startcodein hi_in bn_in pa_in gu_in or_in ta_in te_in kn_in ml_in en_us