[NLUUG]   Welcome to ftp.nluug.nl
Current directory: /os/Linux/distr/salix/sbo/15.0/python/PyStemmer/
 
Current bandwidth utilization 164.95 Mbit/s
Bandwidth utilization bar
Contents of README:
Snowball stemming algorithms, for information retrieval

Stemming algorithms

PyStemmer provides access to efficient algorithms for calculating a
"stemmed" form of a word. This is a form with most of the common
morphological endings removed; hopefully representing a common
linguistic base form. This is most useful in building search
engines and information retrieval software; for example, a search
with stemming enabled should be able to find a document containing
"cycling" given the query "cycles".

PyStemmer provides algorithms for several (mainly european) languages,
by wrapping the libstemmer library from the Snowball project in a
Python module.

It also provides access to the classic Porter stemming algorithm for
english: although this has been superceded by an improved algorithm,
the original algorithm may be of interest to information retrieval
researchers wishing to reproduce results of earlier experiments.

Icon  Name                                   Last modified      Size  
[DIR] Parent Directory - [TXT] PyStemmer.SlackBuild 02-Dec-2023 01:59 3.1K [   ] PyStemmer.info 02-Dec-2023 01:59 492 [TXT] README 17-Mar-2022 18:33 928 [TXT] slack-desc 11-Mar-2022 06:34 1.1K

NLUUG - Open Systems. Open Standards
Become a member and get discounts on conferences and more, see the NLUUG website!