Home > pysearchengine

pysearchengine

Pysearchengine is a project mainly written in PYTHON and PERL, it's free.

search engine based on tf - idf IR algorithms implemented in python

Search engine based on term frequency and inverse document frequency formulas and Information Retrival algorithms. Features:

  • web crawling for a given domain
  • tokenizing the corpus
  • stemming algorithms
  • eliminating stop words
  • indexing the corpus
  • ranking based on tf-idf formulas