## 4.7 - 22.05.2005 - Bugfixes and improved docs (by Hans Fr. Nordhaug) - Fixed several bugs related to local file search (on hard drive or CD-ROM) reported in the "Open Discussion" forum. - Fixed stupid design bug in test for HttpURLConnection. - Added rudimentary support for iframes. - Added info about link handling; qoutes and types. - Made the applet supports links where the URL is enclosed by single quotes. (Before it only followed URLs inside double quotes.) ## 4.6 - 18.09.2004 - Charset improvements and standalone app (by Hans Fr. Nordhaug) - HouseSpider can now be run as an standalone application avoiding the troublesome security settings of the appletviewer when indexing. This is a major improvement. (New classes HouseSpiderStub and BrowserControl doesn't add much to the size of the jar file.) - Added checks for missing trailing slash, default index pages and permanent redirects in start URL (URLstart parameter). - Using HttpURLConnection if possible to detect redirects (missing trailing slash in particular) and dead (404) URLs. - Fixed a bug in the Spider causing long files to load slowly. Changed from (misuse of) String to StringBuffer when building body text. - Translations now in UTF-8 to support languages that needs more characters than ISO-8859-1 can offer. The log file is (of course) also in UTF-8. - Added Charset parameter to select charset used for reading webpages. (ISO-8859-1 is the default charset if none is given.) The same charset is (of course) used for the index file. Solving feature request https://sourceforge.net/tracker/?func=detail&aid=832325&group_id=42811&atid=434247 NB! "†" is still not supported because of a bug in the Java implementation - need to rewrite everything using Swing ... ## 4.5 - 21.05.2004 - Major improvements (by Hans Fr. Nordhaug) - Added explicit charset set to ISO-8859-1. This fixes most problems with letters not correctly shown and such. (The goal is UTF.) - Added IndexPath applet tag to set (relative) path the index file. - Added IgnoreWords applet tag to set words that should be excluded while indexing/searching. - Added StatusLabel applet tag determining whether to show the status label or not. - Added StatusLine applet tag determining whether to show the status line or not. - Added Action applet tag determining whether to index or not as default. You may then use either "" or "" to switch between index or not indexing modus. (Useful if you don't want to index a menu that is common on all pages). - Added font name and size applet tags. - Added ShowURL applet tag determining whether to show the URL after the title in the result list. - Made the applet name (lower left hand corner) a clickable link. - Added applet tags to set the color of the clickable link. - Added internationalization support (i18n). Can also be used to change *all* messages from HouseSpider since the text isn't hard-coded but read from file. (Template available.) Use Lang applet tag to select language. - Meta keywords are now searched/indexed. - Meta description may be used in the result list (in addition to the title). The description is not included in the search. NB! As a result the format of the index-file has changed to: page contents title meta keywords##meta description URL where "##" is a separator (and must be included even though meta description is empty). - Added timing of search. - Added 'no results found'-message. - Added 'wait:'-keyword to InitRun parameter. - Minor improvements for URLs where filename has no extension. - Default index pages as "index.html", "index.htm", "index,php" are stripped of the URLs (to avoid duplications). ## 4.4 - 14.01.2003 - General improvements (by Hans Fr. Nordhaug) - Added parameters TargetHelp and TargetSearch for better control when opening pages. (Target still works and sets both to the same.) - Added parameter InitInput used for initial commands/search. (Useful for indexing via batch file.) - Added FileExclude parameter (accepting comma seperated list of files). Extended URLExclude to accept comma seperated list of URLs. - Added MaxSearch parameter (setting maximum number of hits for search). - Added check for empty name attribute for URLExclude and FileExclude parameter. (This is a bug fix - see http://sourceforge.net/forum/forum.php?thread_id=794725&forum_id=139768) - Added version info to applet. Improved gridbaglayout code. ## 4.3 - 20.09.2002 - General improvements (by Hans Fr. Nordhaug) - Much improved treatment of pages using framesets. - Much improved search functionality: phrases, exclude words and logical expression. - Changed searching of index file. The applet now searches the joint string of line 1 and 2, i.e., body text and title. - Changed use of index file. Line 1 now contains only the body text; everything inbetween and is ignored especially the title which was (and is) repeated in line 2. This decreases the size of the index file. - Added status line (for index file) to applet. Added parameter to turn the status line off. Added search/index progress to the status line. - Added debug parameter to applet. (0 - no debugging, 3 - full debugging.) - Added documentation URL parameter to applet. - Added Make.bat. Makefile for Windows. - Added applet parameter bgtextcolour and texcolour ## 4.2 - 01.04.2002 - General improvements (by Hans Fr. Nordhaug) - Support for a compressed index-file added. HouseSpider can now create a zipped index-file (using the kewword 'houseindex-zip' in stead of 'houseindex') and read a zipped index-file if it exists. - Improvement of crawling. It now handles href-tags with spaces/line shitfs, i.e '', and URLs with missing slash, i.e '...'. - Improvement of crawling. It now handles pages with frames. ## 4.1 - 22.01.2002 - Major bug fixes (done mostly in 2001 by Hans Fr. Nordhaug) - Bug in the Spider class that made spider search fail on Netscape 6.0 and Opera 6.0 is removed. - Major improvement of the spidering (crawling) procedure. - Added the possibility of excluding a directory from the search/indexing. ## 4.0 - 11.01.2002 - Major rewrite (done in 2000 by Keith L. Jackson). - Spider searching is added. - The functionality of the HouseIndexer class is put into HouseSpider, and hence, HouseIndexer and related files are removed. - Keywords are no longer used in the indexing or in the search applet. - Icons replace text buttons. No more internationalization. ## 3.1 - 06.01.2002 - Initial release on SourceForge