Product detail

Text Preprocessing Tool

ŠABATKA, O. BARTÍK, V.

Product type

software

Abstract

The tool enables text preprocessing of documents for text mining. It offers several possibilities of document representation (words or N-grams as terms) and several weighting methods (binary, TF or TF-IDF). It also provides two standard pre-processing procedures of text - stopwords removal and stemming.

Keywords

text mining, preprocessing, document representation, N-grams,  TF-IDF

Create date

10. 11. 2010

Location

http://www.fit.vutbr.cz/~bartik/Arcbc/download.htm

Possibilities of use

Využití výsledku jiným subjektem je v některých případech možné bez nabytí licence

Licence fee

Poskytovatel licence na výsledek nepožaduje licenční poplatek

www