You will need at least a naive stemming algorithm (attempt the Porter stemmer; you will find out there, totally free code in the majority of languages) to approach text 1st. Hold this processed text and also the preprocessed text in two independent House-split arrays.saran3hsaran3h 14.4k55 gold badges5151 silver badges6868 bronze badges 1 I don't r