  
	
  
  
	
	  htfuzzy
	
	
	  ht://Dig Copyright © 1995-2004 
THANKS.htmlThe ht://Dig Group 	  Please see the file 
COPYINGCOPYING  for
	  license information.
	
	
	
	  
		
		  Synopsis
		
	  
	  
		htfuzzy [-c 
configfile][-v] algorithm ...
	  
	
	
	  
		
		  Description
		
	  
	  
		Htfuzzy creates indexes for different "fuzzy" search
		algorithms. These indexes can then be used by the
		
htsearch.htmlhtsearch  program.
	  
	
	
	  
		
		  Options
		
	  
	  
		
		  
			-c 
configfile		  
		  
			Use the specified configuration file instead of the
			default.
		  
		  
			-v
		  
		  
			Verbose mode. Used once will provide progress feedback,
			used more than once will overflow even the biggest
			buffers. :-)
		  
		
	  
	
	
	  
		
		  Algorithms
		
	  
	  
		Indexes for the following search algorithms can currently
		be created: 
		
		  
			
soundex		  
		  
			Creates a slightly modified  
http://www.sog.org.uk/cig/vol6/605tdrake.pdfsoundex  key database.
			A soundex key encodes letters as digits, with similar
			sounding letters (c, k, q) given the same digit.  Vowels
			are not coded.
			Differences with the standard soundex algorithm are: 
			
			  
				Keys are 6 digits.
			  
			  
				The first letter is also encoded.
			  
			
		  
		  
			
metaphone		  
		  
			Creates a metaphone key database. This algorithm is
			more specific to English, but will get fewer "weird"
			matches than the soundex algorithm.
		  
		  
			
accents		  
		  
			Creates an accents key database. This algorithm will
			map all accented letters to their unaccented
			counterparts, so that a search for the unaccented
			word will yield all variations of this word with
			accents.
		  
		  
			
endings		  
		  
			Creates two databases which can be used to match common
			word endings. The creation of these databases requires
			a list of affix rules and a dictionary which uses those
			affix rules. The format of the affix rules and
			dictionary files are the ones used by the
			
http://fmg-www.cs.ucla.edu/fmg-members/geoff/ispell.html			ispell
 program. Included with the distribution are
			the affix rules for English and a fairly small English
			dictionary. Other languages can be supported by getting
			the appropriate affix rules and dictionaries. These are
			available for many languages; check the ispell
			distribution for more details.
		  
		  
			
synonyms		  
		  
			Creates a database of synonyms for words. It reads a
			text database of synonyms and creates a database that
			htsearch can then use. Each line of the text database
			consists of words where the first word will have the
			other words on that line as synonyms.
		  
		
	  
	
	
	  
		
		  Files
		
	  
	  
		
		  
			
attrs.html#config_dirCONFIG_DIR /htdig.conf
		  
		  
			The default configuration file.
		  
		
		
		  
			
attrs.html#database_dirDATABASE_DIR /db.accents.db
		  
		  
			(Output) Maps between characters with and without
			accents for accents fuzzy rule
		  
		
		
		  
			
attrs.html#database_dirDATABASE_DIR /db.metaphone.db
		  
		  
			(Output) Database of similar-sounding words for
			metaphone fuzzy rule
		  
		
		
		  
			
attrs.html#database_dirDATABASE_DIR /db.soundex.db
		  
		  
			(Output) Database of similar-sounding words for soundex
			fuzzy rule
		  
		
		
		  
			
attrs.html#common_dirCOMMON_DIR /english.0, attrs.html#common_dirCOMMON_DIR /english.aff
		  
		  
			(Input) List of words and affix rules used to generate
			endings
		  
		
		
		  
			
attrs.html#common_dirCOMMON_DIR /root2word.db, attrs.html#common_dirCOMMON_DIR /word2rood.db
		  
		  
			(Output) Database used for endings fuzzy rule
		  
		
		
		  
			
attrs.html#common_dirCOMMON_DIR /synonyms
		  
		  
			(Input) List of groups of words considered synonymous
		  
		
		
		  
			
attrs.html#common_dirCOMMON_DIR /synonyms.db
		  
		  
			(Output) Database used for synonyms fuzzy rule
		  
		
	  
	
	
	  
		
		  See Also
		
	  
	  
		
htdig.htmlhtdig ,
		
htmerge.htmlhtmerge ,
		
htsearch.htmlhtsearch ,
		
attrs.htmlConfiguration file format , and
		
http://fmg-www.cs.ucla.edu/fmg-members/geoff/ispell.html		ispell
.
	  
	
	
	Last modified: $Date: 2004/06/12 13:39:13 $
  
