[ login | register | lost password? ]

IR Search


Search

1000 valid crawled documents. See more stats

URL Content Type Content Size Scan datetime
Load Crawled Docs
Word TF DF D[] (document list)
Load TFiDF content

This will only work if the TFiDF table is filled.

Dashboard Stats

Value Description
# URLs in Queue 5034 # of documents in the crawlers queue
# URLs Crawled 1203 # of documents crawled
-----> Invalid 203 # of documents crawled, that were invalid (either a redirect or 404 http errors)
-----> HTML 915
-----> PDF 85
-----> Text 0
-----> None 0
# Documents Preprocessed 1001

Welcome to the admin interface of this project. All functions below require proper priviledges and will not work if you do not have enough permissions.

Actions here require admin privileges!

Start crawling:

Max Levels
  • 0 indicates no limit on recursion... (use causiously)
  • any value <-1 indicates no will not recurse (just scans the page given)
  • any value >0 indicates a specific recurse level after which the crawler stops going any deeper

Start Crawling

Start preprocessing:

Preprocessing will preprocess the documents and generate the TF and DF matrices.

Start Preprocessing