|
dtSearch Product Overview
The dtSearch product line instantly searches gigabytes of text
across a desktop, network, internet or company-wide intranet site.
dtSearch products also serve as tools for publishing, with instant
searching, large document collections to Web sites or CD/DVDs.
Basic Search Types
-
Phrase searching finds phrases like: due process of law
-
Boolean operators like and/or/not can join words and phrases:
due process of law and not (equal protection or civil rights)
-
Proximity searching finds a word or phrase within 'n' words or
another word or phrase: apple pie w/38 peach cobbler
-
Phonic searching finds words that sound alike, like Smythe
in a search for Smith.
-
Stemming finds variations on endings, like applies,
applied, applying in a search for apply.
-
Numeric range searching finds any number between two numbers
such as between 6 and 36.
-
Macro capabilities make it easy to include frequently used items
in a search request.
-
Wildcard support allows ? to hold a single letter place, and *
to avoid multiple letter places: apple* and not appl?sauce.
Fuzzy Searching
Fuzzy searching
uses a proprietary algorithm to find search terms even if they are
misspelled.
Search fuzziness
adjusts from 0 to 10 so you can fine-tune fuzziness to the level of
OCR or typographical errors in your files.
A search for
alphabet with a fuzziness of 1 would find alphaqet;
with a fuzziness of 3, it would find both alphaqet and
alpkaqet.
Fuzziness is not
built into the index, so you can vary fuzziness at the time of each
search.
Concept / Synonym / Thesaurus Searching
Concept searching
lets you look for fast and find quick, speedy,
etc.
dtSearch offers
variable levels of automatic synonym expansion based on a
comprehensive semantic network of the English language.
You can also add
your own thesaurus terms.
Unicode Support
Unicode support
allows for indexing and searching of non-English text, including
every character set supported by the Unicode standard.
In addition to
Unicode support, dtSearch offers extensive alphabet customization
options.
Relevancy Ranking
dtSearch can sort
and instantly re-sort searches by relevancy with respect to number
of hits, file name, file date, etc.
Natural language
algorithms provide automatic term weighting, following a "plain
English" or unstructured indexed search request.
-
Automatic term weighting
is based on the frequency and density of hits in your files.
-
For example, in the
search request get me Sam's memo on the 1999 CorpX takeover,
if 1999 appeared in 3,000 files, and Sam appeared
in only two files, then Sam would get a much higher
relevancy rating, taking you straight to the most "relevant" files.
dtSearch also
includes variable term weighting
options for all indexed searches:
Field Searching
dtSearch
automatically detects fields (including document summary information
fields) in XML, HTML, PDF, WordPerfect, MS Word, Access, Excel,
PowerPoint, CSV, RTF and ANSI files.
In addition to
full-text searching, dtSearch can also search by field name:
(Author contains John Smith) and (Subject contains turbine
generators).
dtSearch supports
hierarchical field structures in XML data, including both fields and
attributes, enabling highly refined nested field queries.
The dtSearch Engine
includes a sample application for indexing SQL and other COM data
sources.
|