Welcome to

phraSEarch$

 

A Utility for Helping You Search and Retrieve Information

Not just Links-to-Information

 

Home

User Guide

Installation

Feedback

Download

Website

Tips for Improving Your Search Productivity

 

 

phraSEarch$ was used in intensive off-line searching for many hundreds of hours, by several researchers, prior to its becoming the premier commercial product that is now available to you. As needs were encountered and ideas expressed, many features and operational techniques were incorporated which culminated in the present release. Exploiting the productivity benefits of some of those design features are explained here. This file is kept up-to-date at the phraSEarch$ website. Visit the website regularly and save the updated files to your \phraSEarch directory (folder). The date of this file is 5/16/2005.

$  The first time you use phraSEarch$     Tip: Use the pull-down menu >Help>Tutorial for a brief introduction to the search utility. The Tutorial demonstrates the use of all of the major input fields and buttons.

$  The most recent directory structure is remembered, which can reduce overall searching time. Establishing the directory structure can be a time-consuming procedure, when the structure contains thousands of folders. And, frequently, users wish to search on many different words and phrases in that same directory structure. Eliminating the redundant building of directory structures can greatly improve the performance of subsequent searches. Tip: Try to accomplish as many searches as can be foreseen, after having established large directory/sub-directory structures. PhraSEarch will use your forethought to your advantage.  Tip: Knowing that the directory structure will be saved, you may increase the context size for your first pass through your data files, decide what is a useful context size to have, and make subsequent passes using your most useful context size.

$  All search parameter sets are saved for re-use.   Your productivity can be improved by re-using search parameter sets from your previous sessions. Tip: Use the >File>Reload Search pull-down menu to locate and select previously used search parameter sets.

$  You may customize the search parameter set that is loaded at startup.   Tip: Use the >File>Save Search in INI File pull-down menu to save any current search parameter set in the phraSEarch.INI file. On future startups, that set of search parameters will be loaded and ready for use. Use the >File>Save Search in INI File pull-down menu any time you want to save a different search startup parameter set.

$  Use with E-Books: Many E-Books are available on the Internet for personal browsing. The Project Gutenberg website, the Internet's oldest producer of FREE electronic versions of books (e-texts), has over 15,000 E-Books available for free downloading. E-Books that are only available in binary form, and that require license activation (e.g. Adobe e-book and Microsoft e-book) can not supported by any third party software, including phraSEarch. Users of such products are required to purchase premium e-Book titles. Check with the www.phrasearch.com often, as new sources of e-books are listed as they are discovered. There are over 92 million web pages that contain both the words “ebook” and “download.”  Include the word “Free,” and there are still over 41 million hits.

$  If the highlighting appears to be working improperly: Consider cutting and pasting the words or phrases from the result file to the “More Phrases to Highlight” edit control. If it does not highlight a phrase, that includes what appear to be spaces, then the problem is most likely with alternate forms of the space character (e.g. non-breaking space, Em space, En space, ¼ Em space,…). The same issue sometimes exists with other alternate character representations in the source document (non-breaking dash, optional dash, Em dash, En dash, etc.). Tip: Entering the phrase as single words will usually solve it.

         Please recognize that sometimes, the author of a document (usually .doc or .html) will select part of a word or phrase, change its font size (or make it bold), then select the remaining part, and change its font size (or make it bold) to match the first part. In such cases, the resulting document can have font or other control characters imbedded within the word or phrase, invisible to the reader. Alas, the phraSEarch program cannot find such occurrences of those words or phrases.        

$  Use of the “All” check box: This feature is most useful when you want to include additional file types to your search. For example, as new web page formats come into existence, you may include them in your search. One should be careful selecting “All” File Types. This search parameter could cause the phraSEarch program to waste time by searching binary and image files, unsuccessfully. This depends on how the “Types_to_ignore=” parameter is set in the phraSEarch.INI file.  The default file types to ignore are: .bmp  .exe  .gif  .inf  .jpe  .jpg  .pdf  .png  .wmz.  Note that un-checking other parameters does not remove them from the file types to be scanned. Such action can only be accomplished by using the “Types_to_ignore=” parameter in the phraSEarch.INI file.  

        (Advanced:) If you have systems that use text files that have atypical file types (e.g. .csv, .dat, or .xls), using the “ALL” parameter will cause those files to be searched. If you know of large files on your system, which you would not want scanned, you should consider adding them to the “Types_to_ignore=” parameter in the phraSEarch.INI file. Be careful to separate file types with slashes (“/”). It is essential that the first character after the “Types_to_ignore=” be a slash (“/”), since that first character is used as a separator character.

$  On “strange formatting” in the output results file: Since you are in control of the search parameters which define the number of characters to select for the output document, before and after, there always exists a chance that you will select partial formatting control characters from searched .htm, .doc, and .rtf documents. These selected strings, including their (partial) formatting control characters, will then be placed into the resulting HTM file, and may inadvertently become part of the formatting that you view when the results are displayed by your browser. When this happens, you may get strange results. If you want to remove all HTML formatting strings from the documents that are searched, then un-check the Keep HTML Formatting: Check Box. This will not preempt the standard formatting (red highlighting, file names, etc…) in the normal output file. You may also increase or decrease the Characters Before: and Characters After: values in order to avoid or work around control characters from the original documents.

$ Browsers: The phraSEarch system has not been tested with browsers other than the Windows Internet Explorer, which is usually at:

C:/Program Files/Internet Explorer/IEXPLORE.exe

If you wish to specify a different path to your browser, you can change it in the [call] section of the file

C:/ Phrasearch/Phrasearch.INI,

and restart the program. Of course, we would be interested in your comments related to your experiences with other browsers.

$  Windows API Workaround:  phraSEarch$ can search any storage medium that has an address of the form x:\yyy. One user has encountered a situation where the “Browse for directory” button encountered an “Access Denied” message while attempting to access a search directory on a CD-Rom, caused by a limitation of the Windows API. The situation was easily remedied by entering the search path manually. We apologize for any inconvenience this deficiency may cause you. We are researching this incident and will provide a free update to registered users. Please visit our website weekly and watch for a notice that this has been resolved. Thank you.

 

If you wish to provide a Tip, to recommend changes, or to report errors, we welcome your emailing them to us at:

 

                                             

                  Thank you for using phraSEarch$

 

Copyright © 2004, 2005 APL Consultants of Houston. All Rights Reserved.