dtSearch Product Overview

The dtSearch product line instantly searches gigabytes of text across a desktop, network, internet or company-wide intranet site.

dtSearch products also serve as tools for publishing, with instant searching, large document collections to Web sites or CD/DVDs.


Basic Search Types

  • Phrase searching finds phrases like: due process of law

  • Boolean operators like and/or/not can join words and phrases: due process of law and not (equal protection or civil rights)

  • Proximity searching finds a word or phrase within 'n' words or another word or phrase: apple pie w/38 peach cobbler

  • Phonic searching finds words that sound alike, like Smythe in a search for Smith.

  • Stemming finds variations on endings, like applies, applied, applying in a search for apply.

  • Numeric range searching finds any number between two numbers such as between 6 and 36.

  • Macro capabilities make it easy to include frequently used items in a search request.

  • Wildcard support allows ? to hold a single letter place, and * to avoid multiple letter places: apple* and not appl?sauce.


Fuzzy Searching

Fuzzy searching uses a proprietary algorithm to find search terms even if they are misspelled.

Search fuzziness adjusts from 0 to 10 so you can fine-tune fuzziness to the level of OCR or typographical errors in your files.

A search for alphabet with a fuzziness of 1 would find alphaqet; with a fuzziness of 3, it would find both alphaqet and alpkaqet.

Fuzziness is not built into the index, so you can vary fuzziness at the time of each search.


Concept / Synonym / Thesaurus Searching

Concept searching lets you look for fast and find quick, speedy, etc.

dtSearch offers variable levels of automatic synonym expansion based on a comprehensive semantic network of the English language.

You can also add your own thesaurus terms.


Unicode Support

Unicode support allows for indexing and searching of non-English text, including every character set supported by the Unicode standard.

In addition to Unicode support, dtSearch offers extensive alphabet customization options.


Relevancy Ranking

dtSearch can sort and instantly re-sort searches by relevancy with respect to number of hits, file name, file date, etc.

Natural language algorithms provide automatic term weighting, following a "plain English" or unstructured indexed search request.

  • Automatic term weighting is based on the frequency and density of hits in your files.

  • For example, in the search request get me Sam's memo on the 1999 CorpX takeover, if 1999 appeared in 3,000 files, and Sam appeared in only two files, then Sam would get a much higher relevancy rating, taking you straight to the most "relevant" files.

dtSearch also includes variable term weighting options for all indexed searches:

  • Positive term weighting can place extra emphasis on one or more words: soup:8 or recipe:3

  • Negative term weighting can assign negative emphasis to one or more words: red or green or yellow:-7


Field Searching

dtSearch automatically detects fields (including document summary information fields) in XML, HTML, PDF, WordPerfect, MS Word, Access, Excel, PowerPoint, CSV, RTF and ANSI files.

In addition to full-text searching, dtSearch can also search by field name: (Author contains John Smith) and (Subject contains turbine generators).

dtSearch supports hierarchical field structures in XML data, including both fields and attributes, enabling highly refined nested field queries.

The dtSearch Engine includes a sample application for indexing SQL and other COM data sources.

 

SNMP is an official distributor for dtSearch in Singapore

 
 

CopyrightŠ 2001-2002 SNMP Pte. Ltd. All Rights Reserved.