Welcome to ftp.nluug.nl Current directory: /os/Linux/distr/salix/sbo/15.0/python/PyStemmer/ |
|
Contents of README:Snowball stemming algorithms, for information retrieval Stemming algorithms PyStemmer provides access to efficient algorithms for calculating a "stemmed" form of a word. This is a form with most of the common morphological endings removed; hopefully representing a common linguistic base form. This is most useful in building search engines and information retrieval software; for example, a search with stemming enabled should be able to find a document containing "cycling" given the query "cycles". PyStemmer provides algorithms for several (mainly european) languages, by wrapping the libstemmer library from the Snowball project in a Python module. It also provides access to the classic Porter stemming algorithm for english: although this has been superceded by an improved algorithm, the original algorithm may be of interest to information retrieval researchers wishing to reproduce results of earlier experiments. |
Name Last modified Size
Parent Directory - PyStemmer.SlackBuild 02-Dec-2023 01:59 3.1K PyStemmer.info 02-Dec-2023 01:59 492 README 17-Mar-2022 18:33 928 slack-desc 11-Mar-2022 06:34 1.1K
NLUUG - Open Systems. Open Standards
Become a member
and get discounts on conferences and more, see the NLUUG website!