Table of Contents:
|
|
|
Server platforms supported
|
|
Scalability:
|
|
Technical support:
|
TO CONTRIBUTE : Send your message to glimpse@cs.arizona.edu. |
| Source code availability | Yes (with the package) |
| Main program modules | Glimpseindex: Is the Indexer
Glimpse: Is the Search Engine. Glimpseserver: Provide remote access to glimpse database. Webglimpse: Search Interface . |
|
|
|
| File/document formats supported (HTML, ASCII, PDF, SQL, Spread sheets, WYSIWYG) | HTML, DOC, PDF and Text files |
Indexing level support:
|
|
Specification of document collections
|
|
| Standard formats recognised (MARC, BIB-REF, MEDLINE, etc.) | NO |
| Customisation of document formats | NO |
| Stemming | Yes. (Through Truncation) |
| Stop words | Yes (using the -Sk option). Instead of having a fixed stop-list, glimpseindex figures out the words that are too common for every index seperately. |
| Field level indexing | No |
| Database updation (merging) | Yes (using the -a option with glimpseindex). Adds the given files and/or directories to an existing index. |
| Compression support | No. |
| WEBGLIMPSE |
|
| Additional Features |
Eg: glimpseindex -H "index files path" -a "index directory" |
|
|
|
| Boolean |
a 'NOT' operation denoted by the symbol `~', or any combination. Eg: 1. glimpse`pizza;cheeseburger'
|
| Query term weighting | No |
| Relevancy ranking | No |
| Proximity/phrase searching | No |
| Approximate matching | Yes. Combination of exact and approximate matching .
Eg: competer |
| Truncation | The symbol `#' is used to denote a sequence of any number of
characters .
Eg: comp# |
| Pattern Search | Yes. Glimpse supports a large variety of patterns,
including simple strings, strings with classes of characters,
sets of strings, wild cards, and regular expressions.
Eg: glimpse \^abc\ , glimpse [a-ho-z] |
| Search set manipulation | No |
| Duplicate detection | No |
| Field level searching | No |
| Thesaurus/concept searching | No |
| QBE (Query-By-Example)/ Relevance feedback searching | No |
| Customisation (thru CGI programming) | Yes (using Perl scripts) |
| Soundex search | Yes |
| Additional features | Searches can be done at command line using different options of glimpse.
Eg: glimpse -H "index files path" -n information |
|
GLIMPSE: Results Display |
|
| Formats supported (Native format, ASCII, HTML) | The output of a query is a set of records, one for each matching
file. WebGlimpse formats the results in four ways.
|
| Relevancy ranking | No |
| Option for viewing document summary | Yes |
| Keyword-in-context | No |
| Customisation of results display | No |
5. Costs, license, registration:
|
GLIMPSE: Costs, license, registration, etc. |
|
| Completely free | Yes |
| Maintenance fee, license fee and any other contractual requirements | Any commercial use of this software will require a license.
e-mail: ott@u.arizona.edu http://vpr2.admin.arizona.edu/ott/Webnot96.htm |
| Registration for download and use | No |
Application 1:
| Input collection size | 707 K | 2 MB | 5 MB |
| Index time | 10 sec | 2 min | 6 min |
| No. of records | 34 docs | 336 docs | 1064 docs |
Installation steps (vis-a-vis what is on the CD):
GLIMPSE Installation.
$ mkdir /opt/glimpse
$ gunzip glimpse-4.1.src.tar.gz
$ tar -xvf glimpse-4.1.src.tar
$ cd glimpse-4.1
Note: Make the configure file executable. $ chmod 755 configure
$ ./configure
$ make
$ make install
$ cd webglimpse
$ gunzip webglimpse-1.6edu.tar.gz
$ tar -xvf glimpse-4.1.src.tar
$ ./wginstall your_oswhere your_os is one of the following: linux sunos of solaris