disk Freeware: ajSitemap - Google Sitemap Generator / Web Spider ( Java )

Version .04

This freeware Java code will spider your website and produce the Google Sitemap file / index file. You can also pipe the output for a text file of all the links on your site.

NOTE: This software is presented as is, and is technically in a beta state. If you can issues, request or comments you can add them to the ajSitemap post on JATFQ.com
Changelog

V0.05
Added -d {path} command line option to dump links to a text file runtime. Allows you to save current links incase of a crash. You can also use -i {path or stdin} to build sitemap from text files.

Huge internal speed increase by moving storage code from Vectors to Treemaps.

Got rid of memory leak for large sitemaps.

V0.04
Added FTP support. Set username, password, host and path in the properties file. Make sure you extract the "LIB" directory beneath where the application is installed, or add it to your classpath.

V0.03
Added ability to ping Google to let it know that the sitemap has been updated. This is a little redundant at the moment as you really need to copy the file to your server first, but as FTP support is coming in the next version I though I'd add it anyway. If you are using this in a batch file, you could consider just calling WGET with the Google ping URL after your FTP command

V0.02
Added mutlithreaded support. Specify the number of threads to use in the properties file. Defaults to a single thread.

Added support for GZIP encoding of the URL stream to reduce file transfer time. Turned on by default.

 

Installation

Download the ZIP file and extract into a folder.

 

Running

java -jar "ajSitemap.jar" sitemapbuilder.properties http://your.site.com

or you can use the template batch file for windows

ajSitemap http://your.site.com

( this will use the default properties file )

The sitemapbuilder.properties contains a default configuration, you can change this file or create different ones per domain. If you rename the file, just change the start up parameters accordingly. The parameters are annotated in the default file.

The second parameter is the site to spider, remember to include the "http://" at the front.

The code will output the spider information to stdout, to direct this to a file simply redirect it, ie

java -jar "ajSitemap.jar" sitemapbuilder.properties http://your.site.com > dump.txt

The user agent for the spider will appear as "ajSitemap ( www.web-max.ca/freeware/ajSitemap )"

Requirements
Java 1.4 or greater
 
Download
  Download ZIP file ( 14k )