wwwstat -- Summarize NCSA httpd access logs

Copyright (c) 1994 Regents of the University of California. All rights reserved.

The wwwstat program will process an NCSA httpd_1.0 access_log file and output a log summary in HTML format suitable for publishing on your web. The program is written in Perl and, once customized for your site, should work on any UNIX-based system with Perl 4.019 or better.

As an example of what wwwstat can do for us here at UC Irvine's Department of Information & Computer Science, click here to see our server's statistics.

For easy distribution, the wwwstat program is available as a gzip'd tar file or as a compress'd tar file. The distribution consists of the following files:

The intention is that wwwstat be run by a wrapper program as a crontab entry just before midnight, with its output redirected to a temporary file which can then be moved to the site's summary file. The temporary file is necessary because the old summary file will be fstat'd for its size during wwwstat's processing (resuling in 0 if output is redirected to the summary file during processing).

This program could easily be modified to run as a CGI script, but that is not recommended for slow processors or heavily utilized servers.

Obviously, versions of this program would also be nice for the Plexus and CERN servers. However, I found that much of the logic for finding file names was just too specific to the NCSA server to justify all the other work of making this general. Feel free to do so yourself.

Version History

Known problems [Due to limits of info in access_log]
Assumes that a file's size has not changed since the access.
Cannot estimate the transmission of bytes from scripts.
Ignores the fact that some accesses may have been denied.
Ignores any Redirection directives.
Version 0.2 --- January 21, 1994
Added support for the /~username form of files.
Added general support for Alias and ScriptAlias configurations.
Now reads the server config file to get site configuration.
Sped up the process by caching file sizes (fewer file stats).
Added options to display full IP addresses in subdomain listing.
Expanded some form field sizes. Now sorts archive section by name.
Version 0.1 --- January 14, 1994
Added support for HTML output.
Added reversed subdomain statistics.
Added the logic for grouping files in archive sections.
Rewrote the whole damn thing.
Version 0.0
Originally from fwgstat 0.35 (jem@sunsite.unc.edu) with all the extra options stripped out and many bugs fixed. In turn, fwgstat was heavily based on xferstats, which is packaged with the Wuarchive FTP daemon. Fwgstat is good for multi-server stats.
If you have any suggestions, bug reports, fixes, or enhancements, send them to the author Roy Fielding at (fielding@ics.uci.edu).

This software has been developed by Roy Fielding as part of the Arcadia project at the University of California, Irvine. Wwwstat was originally based on a multi-server statistics program called fwgstat-0.035 by Jonathan Magid (jem@sunsite.unc.edu) which, in turn, was heavily based on xferstats (packaged with the version 17 of the Wuarchive FTP daemon) by Chris Myers (chris@wugate.wustl.edu). As such, this software and all derivations will always remain in the public domain.

Redistribution and use in source and binary forms are permitted, subject to the restriction noted below, provided that the above copyright notice and this paragraph and the following paragraphs are duplicated in all such forms and that any documentation, advertising materials, and other materials related to such distribution and use acknowledge that the software was developed in part by the University of California, Irvine. The name of the University may not be used to endorse or promote products derived from this software without specific prior written permission. THIS SOFTWARE IS PROVIDED ``AS IS'' AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.

Use of this software in any way or in any form, source or binary, is not allowed in any country which prohibits disclaimers of any implied warranties of merchantability or fitness for a particular purpose or any disclaimers of a similar nature.

IN NO EVENT SHALL THE UNIVERSITY OF CALIFORNIA BE LIABLE TO ANY PARTY FOR DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OF THIS SOFTWARE AND ITS DOCUMENTATION (INCLUDING, BUT NOT LIMITED TO, LOST PROFITS) EVEN IF THE UNIVERSITY OF CALIFORNIA HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

This work was sponsored in part by the Defense Advanced Research Projects Agency under Grant Number MDA972-91-J-1010. This software does not necessarily reflect the position or policy of the U.S. Government and no official endorsement should be inferred. Their support is appreciated.

See the Artistic License for more information.


Department of Information and Computer Science,
University of California, Irvine, CA 92717-3425