Dtsearch pdf search highlighter

#Dtsearch pdf search highlighter how to

Added automatic detection of gb2312 and JIS encoding.Improved formatting of documents converted from Ami Pro and Quattro Pro to HTML.Fixed two search report bugs causing incorrect hit highlighting.Fixed index merge bug causing "Inconsistent doc ids from target index" error during merge.Fixed error updating index when directory specified for temporary files is inaccessible.To ensure consistent options, Unicode Filtering options are stored in the index when the index is created, in the index_a.ix file. Fixed incorrect hit highlighting when Unicode Filtering options at search time different from options used to index a file.Fixed incorrect display of CreationDate and ModDate properties in PDF files.Added dtsoFfSkipDataSourceFields flag for Options.FieldFlags to prevent DocFields values from appearing in FileConverter output.Added dtsListIndexSkipNoiseWords flag for ListIndexJob to list words in an index without including any noise words.Added to COM interface: WordListBuilder.ListFieldValues, WordListBuilder.SetFilter, and IndexJob.EnumerableFields.Added more structural information to the output generated by conversion to the it_ContentAsXml file format.

#Dtsearch pdf search highlighter how to

For sample code demonstrating how to use this API, see the WordBreak example in examples\vc8\WordBreak. Added dtssGetWordBreaker API function to provide direct access to the dtSearch Engine's internal word breaker using the language analyzer API.The flag dtsLaInputIsSearchTerm is passed to the language analyzer in dtsLaJob.flags, so the language analyzer knows why it is being called. When this flag is set, the language analyzer is called for each word or phrase in the search request. Added dtsSearchLanguageAnalyzerSynonyms flag to enable using a language analyzer to generate morphological variations on a search term at search time.