Path: senator-bedfellow.mit.edu!bloom-beacon.mit.edu!newsfeed.stanford.edu!cyclone.bc.net!torn!newsfeed.nyu.edu!arlnews!oanews!info.usuhs.mil!cadig2.usna.navy.mil!crab.cs.usna.edu!not-for-mail
From: crabbe@usna.edu (Ric Crabbe and Amit Dubey)
Newsgroups: comp.ai,news.answers,comp.answers
Subject: Artificial Intelligence FAQ:5/6 AI Web Directories & Online Papers [Monthly posting]
Followup-To: comp.ai
Date: Wed, 2 Jun 2004 04:12:01 +0000 (UTC)
Organization: United States Naval Academy
Lines: 1153
Sender: crabbe@crab.cs.usna.edu
Approved: news-answers-request@MIT.EDU
Expires: 10 JUN 2004 21:54:49 GMT
Message-ID: <c9jk2h$5qv$1@crab.cs.usna.edu>
Reply-To: crabbe@usna.edu
NNTP-Posting-Host: crab.cs.usna.edu
X-Trace: cadig2.usna.navy.mil 1086149419 15001 131.122.89.100 (2 Jun 2004 04:10:19 GMT)
X-Complaints-To: news@usna.navy.mil
NNTP-Posting-Date: Wed, 2 Jun 2004 04:10:19 +0000 (UTC)
Summary: FTP Resources for AI
Xref: senator-bedfellow.mit.edu comp.ai:69217 news.answers:271532 comp.answers:57180

Archive-name: ai-faq/general/part5
Posting-Frequency: monthly
Version: 2.1
Maintainer: Ric Crabbe <crabbe@usna.edu> and Amit Dubey <adubey@coli.uni-sb.de>
URL: http://www.faqs.org/faqs/ai-faq/general/part5/preamble.html
Size: 46277 bytes, 1114 lines

;;; ****************************************************************
;;; Answers to Questions about Artificial Intelligence *************
;;; ****************************************************************
;;; Written by Amit Dubey, Ric Crabbe, and Mark Kantrowitz
;;; ai_5.faq 

If you think of questions that are appropriate for this FAQ, or would
like to improve an answer, please send email to the maintainers.

Parts 5 and 6 of the FAQ are now under heavy construction.  The FTP & WWW
resources have been combined, since both are browser accessible these
days.  We're also pruning the entries to sites that include
information other than whatever project is being done at that
University, etc.

Part 5 (WWW & FTP Resources):
  [5-0] Research Index (nee Citeseer)
  [5-1] Weblogs, repositories, web directories and communities not
        aimed primarily at researchers
  [5-2] Repositories and web directories aimed primarily at researchers
  [5-3] Web books (textbooks and otherwise)
  [5-4] AI Bibliographies available by FTP and WWW
  [5-5] Technical Reports available by FTP and WWW
  [5-6] Technical Reports for/by undergraduate students
  [5-7] Where can I get a machine readable dictionary, thesaurus, and
        other text corpora?
  [5-8] Where can I get training sets for machine learning algorithms?
  [5-9] What on-line journals are there?

Search for [#] to get to question number # quickly.

----------------------------------------------------------------
Subject: [5-0] Research Index (nee Citeceer)

http://citeseer.nj.nec.com/cs

"ResearchIndex is a scientific literature digital library that aims to
improve the dissemination and feedback of scientific literature, and
to provide improvements in functionality, usability, availability,
cost, comprehensiveness, efficiency, and timeliness.

Rather than creating just another digital library, ResearchIndex
provides algorithms, techniques, and software that can be used in
other digital libraries. ResearchIndex indexes Postscript and PDF
research articles on the Web..."

http://citeseer.nj.nec.com/cs

Research index (though most people still call it citeseer) is the
currently best way to find published papers AND TRACK CITATIONS.  It
is so cool, it's in the FAQ twice.

UPDATE 4/1/04:  ResearchIndex has been sick, and updates stopped.  It
has been moved to: http://citeseer.ist.psu.edu/cis where hopefully it
will continue.

----------------------------------------------------------------
Subject: [5-1] Repositories and directories not aimed primarily at researchers


AI Toolkit:

   The AI Toolkit is an educational software package developed to
   train future and current scientists and engineers on Artificial
   Intelligence methods, with a specific focus on soft computing
   techniques for robotic applications. The software is designed to be
   completely user-friendly, and provide hands-on interactive lessons
   to enhance understanding and knowledge of these powerful
   methods. In addition, the multicultural interfaces promote a
   universal atmosphere to enhance learning.

   The AI Toolkit is now freely available for download at:
   http://www.openchannelsoftware.com/projects/AI_Toolkit

   For more information, visit:  http://aitoolkit.jpl.nasa.gov


AI Topics: 
   http://www.aaai.org/Pathfinder/html/welcome.html
   Presented by AAAI, AI Topics is a "...web site provided ... for
   students, teachers, journalists, and everyone who would like to
   learn more about what artificial intelligence is, and what AI
   scientists do.

   [Their] goal is to offer a limited number of authoritative,
   non-technical resources that [they] have organized and annotated
   to provide you with meaningful access to basic information about
   the AI universe. Each of the AI Topics (see the navigation buttons
   to the left) will lead you to online and in-print sources of
   information.

   There has been an explosion in the number of Websites that catalog
   locations of AI information in a Yahoo-style directory.  Although they
   often duplicate functionality, in the interest of fairness, I will list all
   the ones I know about here.

AboutAI.com:
   http://www.aboutai.net/
   It is a second generation portal and a successor to the former
   ai.about.com.  aboutAI.Net contains dozens of AI-related essays and
   thousands of Web links. It will be focused on the technical side,
   describing the implementation of a wide range of AI
   techniques. Site actively uses data mining techniques for
   collaboration filtering, information retrieval, automatic
   classification of retrieved information, etc.

Kurzweilai.Net:
   http://www.kurzweilai.net
   An entertaining mix of flashy demos and interesting essays on AI,
   many by important poeple in the field.  Much philosophical
   discussion, less hard-core technical discussion.

MAKHFI

   A site dedicated to Neural Networks, including some open source
   tools that they developed.

	http://www.makhfi.com

   The site is still not complete, but we are fully committed to make
   it "the" resource site for ANN.

Neuron AI Directory
   http://www.neuron.co.uk 

Neural Network Information in Polish:
   http://www.neuron.of.pl

Generation 5:
   http://www.generation5.org

   "Generation5 is aimed at presenting a website that will educate the
   viewer on Artificial Intelligence -- whatever the level of
   expertise. We have essays on the applications and history of AI for
   those unfamiliar, to essays on programming and philosophy, all the way
   to full blown mathematically-orientated essays on genetic algorithms
   and neural networks.  Generation5 prides itself also in its interviews
   sections with exclusive interviews from top AI scientists like Marvin
   Minsky, Craig Reynolds, Roger Schank, Andre LaMothe and many
   others. Generation5 also has a comprehensive collection of original
   programs, all with source included. Demonstration programs like image
   recognizors, number recognizors, cellular automata creators, NLP
   demonstrator and more. All programs have an accompanying essay
   describing the workings of the programs.

   The aim of Generation5 is not only to educate the viewer, but to allow
   the viewer to contribute to further other people's knowledge. They can
   do this through the discussion boards, voting systems, and soon
   through AI Solutions (a scheme to submit code - with a accompanying
   monthly competition)."


Yahoo Clubs:
   Yahoo maintains a number of AI clubs.  There is the general
   AI Group, an online community that discusses AI (a resource
   for beginners).  Their website is:

      http://clubs.yahoo.com/clubs/artificialintelligencegroup

   There is also a resource for amateur robot enthusiasts at:

      http://clubs.yahoo.com/clubs/theroboticsclub

Pentomino Site:
   http://home.planetinternet.be/~odettedm
   Student run site at T.I.D. Ronse Belgium on searching Pentomino
   spaces. 

----------------------------------------------------------------
Subject: [5-2] Repositories aimed primarily for researchers


CMU AI Repository:

   The CMU Artificial Intelligence Repository was established by
   Carnegie Mellon University to contain public domain and freely
   distributable software, publications, and other materials of
   interest to AI researchers, educators, students, and practitioners. 
   The AI Repository currently contains more than a gigabyte of
   material and is growing steadily. 

   The AI Repository is accessible from:

      http://www.cs.cmu.edu/Web/Groups/AI/html/repository.html

ACT-R Faq:
   
   http://acs.ist.psu.edu/act-r-faq/
   "ACT-R is a proposed unified theory of cognition realised as a
   production system. It is a unified theory of cognition, in the
   spirit proposed by Newell (1990, Unified theories of cognition,
   Harvard, Cambridge, MA), in that it is designed to predict human
   behavior by processing information and generating intelligent
   behavior itself."



Soar Faq:

   http://acs.ist.psu.edu/soar-faq/soar-faq.html
   "Soar is used by AI researchers to construct integrated intelligent
   agents and by cognitive scientist for cognitive modeling. It can
   basically be considered in three different ways:
   1. A theory of cognition. As such it provides the principles behind
   the implemented Soar system. 
   2. A set of principles and constraints on (cognitive)
   processing. Thus, it provides a (cognitive) architectural
   framework, within which you can construct cognitive models. In this
   view it can be considered as an integrated architecture for
   knowledge-based problem solving, learning and interacting with
   external environments. 
   3. An AI programming language."



Fast Replanning Methods:
   Techniques for replaning under plan failure without starting from
   scratch:
   http://www.cc.gatech.edu/fac/Sven.Koenig/fastreplanning.html

Artificial Life Online:

   Sponsored by MIT Press and the Santa Fe Institute, Artificial Life
   Online is intended to be a central information collection and
   distribution site on the Internet for any and all aspects of the
   Artificial Life endeavor.

   A special feature of the BBS is a collection of 40 or so local
   newsgroups dedicated to a wide variety of topics in Artificial Life.

   Artificial Life Online is accessible by World-Wide Web from

      http://alife.santafe.edu/

Case based reasoning:
   http://www.ai-cbr.org/
   ai-cbr aims to provide a comprehensive information base to
   Case-Based Reasoning academics and commercial developers. Through
   the dissemination of information it is hoped a stronger world-wide
   community of people interested in Case-Based Reasoning will be
   fostered and the commercial use of Case-Based Reasoning will
   increase. (added to the FAQ 2/2/00)

   http://www.aic.nrl.navy.mil/~aha/research/case-based-reasoning.html
   A very complete list of resources including tutorials 

Consortium for Lexical Research:

   clr.nmsu.edu:/CLR/ [128.123.1.12]

   Archive containing a variety of programs and data files related to
   natural language processing research, with a particular focus on
   lexical research. The file 00README.clr.site is a good place to start.
   See the file catalog or catalog.ps for a listing of the contents of
   the archive. Long descriptions are in the info/ subdirectory.
   Materials for paid-up members of the Consortium are in the
   members-only/ subdirectory. Public materials include the Alvey Natural
   Language Tools, Sowa's Conceptual Graph parser implemented in YACC by
   Maurice Pagnucco, a morphological parsing lexicon of English, a
   phonological rule compiler for PC-KIMMO, C source code for the NIST
   SGML parser, PC-KIMMO sources, the 1911 Roget Thesaurus, and a variety
   of word lists (including English, Dutch, and male/female/last names).
   Comments and questions may be directed to lexical@nmsu.edu.

   There are also some materials in clr.nmsu.edu:/pub/ unrelated to
   the archive.   

Fuzzy Logic Repositories:

   ntia.its.bldrdoc.gov:/pub/fuzzy/ [132.163.64.201] contains information
   concerning fuzzy logic, including bibliographies (bib/), product 
   descriptions and demo versions (com/), machine readable published
   papers (lit/), miscellaneous information, documents and reports (txt/), 
   and programs, code and compilers (prog/). You may download new items 
   into the new/ subdirectory. If you deposit anything in new/, please 
   inform fuzzy@its.bldrdoc.gov. The repository is maintained by 
   Timothy Butler, tim@its.bldrdoc.gov. 

Genetic Algorithms:

   The Genetic Algorithms Repository is accessible is also a WWW version at

      http://www.aic.nrl.navy.mil/galist/

   The information files includes Nici Schraudolph's survey of free and
   commercial GA software (send email to <schraudo@cs.ucsd.edu> to add to
   the list).

   The software includes GAC (a simple GA written in C), GAL (a simple GA
   written in Common Lisp), GAucsd, GECO (a Common Lisp toolbox for
   constructing genetic algorithms), GENESIS, GENOCOP, Paragenesis (a
   parallel version of GENESIS that runs on the CM-200), SGA-C (a C
   implementation/extension of Goldberg's SGA system).

Intelliwise:

   Sergio Navega maintains a large collection of AI links:

      http://www.intelliwise.com/links.htm

Funic Neural Nets Archive Site:

   The Finnish University maintains an archive site containing a large
   collection of neural network papers and public domain software.
   The files are available through the web interface at
   http://www.nic.funet.fi/pub/sci/neural or through FTP from
   ftp://funic.funet.fi:/pub/sci/neural.  FTP users: see the file 01README
   for details.   There's also a directory for non-neural net AI stuff
   in the directory /pub/sci/ai.  (Web service is still experimental as
   of 05/29/99).

   There is a list of mirrored ftp sites is in 04Neural_FTP_Sites.  For
   further information, contact neural-adm@funic.funet.fi or Marko
   Gronroos <magi@funic.funet.fi> (or <magi@utu.fi>).

OSU Neuroprose:

     archive.cis.ohio-state.edu:/pub/neuroprose/   [128.146.8.52]

   This directory contains technical reports, mostly from the early 90's, as a
   public service to the connectionist and neural network scientific community
   which has an organized mailing list (for info:
   connectionists-request@cs.cmu.edu)

NL Software Registry:
[maintainer's note: links upto this point haven't been checked]

   The Natural Language Software Registry is a catalogue of software
   implementing core natural language processing techniques, whether
   available on a commercial or noncommercial basis. Some of the topics
   listed include speech signal processing, morphological analysis,
   parsers, natural language generation systems, and knowledge
   representation systems. The second edition of the catalog contains
   more than 100 descriptions of natural language processing software.
   The catalogue is available from the German Research Institute for
   Artificial Intelligence (DFKI) in Saarbruecken (Germany) at the URL

      http://www.dfki.de/lt/registry

   The email contact for the site is lt-www@dfki.de

Essex ROBOTS Archive:
   Contains robotics related information, hasn't been updated since 1995 or so:
   ftp.essex.ac.uk:/pub/robots/

AI IN DESIGN WEBLIOGRAPHY 
   http://www.cs.wpi.edu/Research/aidg/AIinD-hotlist.html
   These web pages contain links to pretty much everything 
   concerned with the application of AI to Design.

Miscellaneous AI:

   Some miscellaneous AI programs may be found on ftp.uu.net:/pub/ai/
   Most are mirrors of programs available at other sites.

   AI_ATTIC is an anonymous ftp collection of classic AI programs and
   other information maintained by the University of Texas at Austin.  It
   includes Parry, Adventure, Shrdlu, Doctor, Eliza, Animals, Trek, Zork,
   Babbler, Jive, and some AI-related programming languages.     This
   archive is available by anonymous ftp from ftp.cc.utexas.edu
   in the directory /pub/AI_ATTIC.  For more information, contact
   atticmaster@bongo.cc.utexas.edu.

   The QWERTZ toolbox, a library of Standard ML modules with an emphasis
   on symbolic Artificial Intelligence programming, (including
   implementations of heuristic search and an ATMS reason maintenance
   system) may be obtained by anonymous ftp from

      ftp.gmd.de:/gmd/ai-research/Software/qwertz.tar.gz

   For more information, write to Tom Gordon <thomas.gordon@gmd.de>.

----------------------------------------------------------------
Subject: [5-3] Web books (textbooks and otherwise)
[This is a new topic, and we're just building the list.  Please submit
any web-based books you know about.]

Practical Artificial Intelligence Programming in Java, by Mark Watson.
"...covers AI programming techniques using Java."
http://www.markwatson.com/

Sutton, R.S. and Barto, A.G. (1998) Reinforcement Learning: An
introduction. MIT Press, Cambridge Mass, is online at:
http://www-anw.cs.umass.edu/~rich/book/the-book.html

   
----------------------------------------------------------------
Subject: [5-4] AI Bibliographies available by FTP

General:

   There are many recent papers at:

      http://www.cora.whizbang.com

   You can both browse and search; the searching ranks papers based on
   how often they have been referenced.
   
   [I think whizbang.com went away in the .com bust.  -ric]
 
Fuzzy Logic:
         
   A BibTeX database of references addressing neuro-fuzzy issues can be
   obtained by anonymous ftp from 

      ftp.tu-bs.de:/local/papers/ [134.169.34.15]

   as the (ascii) file fuzzy-nn.bib.

Genetic Algorithms:

      http://www.cogs.susx.ac.uk/users/ezequiel/alife-page/alife.html

Logic Programming, Constraints:

   A BibTeX bibliography for Constraint Logic Programming is available
   by anonymous ftp from 

      archive.cis.ohio-state.edu:/pub/clp/

   in the bib/ and papers/ subdirectories.

NLP/CL:

   For information on a fairly complete bibliography of computational
   linguistics and natural language processing work from the 1980s, send
   mail to clbib@csli.stanford.edu with the subject HELP. 

   The CSLI linguistics bibliography contains 3,300 entries in
   bib/tib/refer format. The bibliography is heavily slanted towards
   phonetics and phonology but also includes a fair amount of
   computational morphology, syntax, semantics, and psycholinguistics.
   The bibliography can be used with James Alexander's tib
   bibliography system, which is available from minos.inria.fr
   [128.93.39.5] among other places. The bibliography itself is available
   by anonymous ftp from 

      csli.stanford.edu:/pub/bibliography/

   Contributions are welcome, but should be in tib format.
   For more information, contact Andras Kornai <kornai@csli.stanford.edu>

NLG:

   Robert Dale's Natural Language Generation (NLG) bibliography is
   available by anonymous ftp from 

      scott.cogsci.ed.ac.uk:/pub/nlg/ [129.215.144.3] 

   Note that it is formatted for A4 paper. Stick in a line 
      .94 .94 scale
   after the %! line to print on 8.5 x 11 paper. For further information,
   write to Robert Dale, University of Edinburgh, Centre for Cognitive
   Science, 2 Buccleuch Place, Edinburgh EH8 9LW Scotland, or
   <R.Dale@edinburgh.ac.uk> or <rdale@microsoft.com>.

   Mark Kantrowitz's Natural Language Generation (NLG) bibliography is
   available by anonymous ftp from 

      ftp.cs.cmu.edu:/user/ai/areas/nlp/nlg/bib/mk/ [128.2.206.173] 

   In addition to the tech report, the BibTeX file containing the
   bibliography is also available.  The bibliography contains more than
   1,200 entries. A searchable index to the bibliography is
   available via the URL

      http://liinwww.ira.uka.de/bibliography/Ai/nlg.html

   Additions and corrections should be sent to mkant@cs.cmu.edu. 

Neural Nets, Learning:

   A bibliography of over 1000 entries about Self-Organizing Map
   (SOM) and Learning vector Quantization (LVQ) studies is
   available by anonymous ftp from

      cochlea.hut.fi:/pub/ref/

   as the files references.bib.Z (BibTeX file) and references.ps.Z
   (PostScript file). Please send additions and corrections to 
   biblio@cochlea.hut.fi.

   An extensive collection of references on Principal Component Analysis
   (PCA) neural networks and learning algorithms is available by
   anonymous ftp from dendrite.hut.fi:/pub/ref/ in LaTeX and PostScript
   formats. The list was compiled by Liu-Yue Wang, a graduate student of
   Erkki Oja, and updated by Juha Karhunen, all from Helsinki University
   of Technology, Finland.  For more information, contact Erkki Oja
   <oja@dendrite.hut.fi>.

   A bibliography of PCA algorithms is available by anonymous ftp from
   ftp.ai.mit.edu:/pub/sanger-papers/ as pca.bib. For more information,
   contact Terry Sanger <tds@ai.mit.edu>.

   A 36-page bibliography of connectionist models with symbolic
   processing is available by anonymous ftp from Neuroprose

      archive.cis.ohio-state.edu:/pub/neuroprose/ [128.146.8.52]

   as the file sun.nn-sp-bib.ps.Z. For more information, contact
   Ron Sun <rsun@athos.cs.ua.edu>.

Nonmonotonic Logic, Belief Revision:

   A bibliography on belief revision and nonmonotonic logics with
   about 2,000 items is available by anonymous ftp from

      tarski.phil.indiana.edu:/pub/morado/  [129.79.134.34]

   as nonmono.bib or nonmono.bib.Z. The file is also available by WAIS as

      wais://tarski.phil.indiana.edu/nonmono.bib?

   and by gopher/WWW. Please send additions and corrections to Raymundo
   Morado <morado@phil.indiana.edu>. 

Speech: 

   A bibliography of papers on Silicon Auditory Models (VLSI
   implementations of auditory representations) is available by anonymous
   ftp from

      hobiecat.pcmp.caltech.edu:/pub/anaprose/lazzaro/sa-biblio.ps.Z

   For more information, write to John Lazzaro <lazzaro@boom.cs.berkeley.edu>

Multi-agent Systems

   http://dis.cs.umass.edu/research/agents-learn.html

----------------------------------------------------------------
Subject: [5-5] Technical Reports available by WWW/FTP

This section lists the anonymous ftp sites for technical reports from
several universities and other organizations. Some of the sites
provide only an online catalog of technical reports, while the rest
make the actual reports available online. The email address listed is
that of the appropriate person to contact with questions about
ordering technical reports. 

The main source of tech reports is now from Networked Computer Science
Technical Reference Library  or NCSTRL (pronounced "ancestral").
It's home page is: http://www.ncstrl.org/  
If that is a problem, you can go directly to:
http://cs-tr.cs.cornell.edu/


Other general locations for technical reports from several
universities include:

   wuarchive.wustl.edu:/doc/techreports/ [128.252.135.4]
   cs-archive.uwaterloo.ca:/cs-archive/  (see Index for an index)
      AKA watdragon.uwaterloo.ca [129.97.140.24]

The uwaterloo archive includes tech reports from the Logic Programming
and Artificial Intelligence Group (LPAIG) of the University of Waterloo.

There is also a WAIS server containing tech report abstracts that can be
searched. To use, create the file ~/wais-sources/cs-techreport-abstracts.src 
containing
   (:source
      :version  3
      :ip-address "130.194.74.201"
      :ip-name "daneel.rdt.monash.edu.au"
      :tcp-port 210
      :database-name "cs-techreport-abstracts"
      :cost 0.00
      :cost-unit :free
      :maintainer "wais@daneel.rdt.monash.edu.au")
and invoke your local wais client. To add to it, email abstracts of
your papers to wais@rdt.monash.edu.au in the following format:
   %TI Title
   %AU Author (use multiple %AU lines for multiple authors)
   %PU Published In (citation information)
   %AV Availability (e.g., ftp reports.adm.cs.cmu.edu:/1992/CMU-CS-92-101.ps)
   %OR Organization (see cs-techreport-archives.src for institution codes)
   %LT Local title (e.g., tech report number)
   %DA Date (and, if you want, %MN Month, %YR Year)
   %AB Abstract
If your papers are not available by FTP, you can use a %AV line such as:
   %AV mail harry.bovik@cs.cmu.edu
Further instructions are available from
   daneel.rdt.monash.edu.au:/pub/techreports/reports/README  
[Based on a post by Ashwin Ram.]

Also see the Unified Computer Science Technical Report Index
    http://cs.indiana.edu/cstr/search
[this archive appears to be out of date -ed]

A list of FTP sites for technical reports and papers can be found in
    http://www.rdt.monash.edu.au/tr/siteslist.html

A list of more than 230 sites publishing CS tech reports may be
obtained by anonymous ftp from

    ftp.rdt.monash.edu.au:/pub/techreports/sites/sites-list-data

To receive notification of new tech report sites, send mail to
compdoc-techreports-request@ftp.cse.ucsc.edu to join the mailing list.

An archive of linguistics papers and preprints is available from
linguistics.archive.umich.edu:/linguistics/papers/. Contact John Lawler
(jlawler@umich.edu) or linguistics-archivist@umich.edu for more
information.

The Concurrent Engineering Research Center (CERC) at West Virginia
University has placed ASCII versions of the concurrent
engineering-related abstracts (over 500) that were on CERCnet, ASCII
back issues of the Concurrent Engineering Research in Review journal
(now discontinued), and Postscript copies of CERC technical reports in
the gopher server gopher.cerc.wvu.edu.  In addition, many of the CERC
technical reports, including journal articles, symposium papers,
theses, dissertations, and issues of the Concurrent Engineering
Research in Review journal, are available as Postscript versions via
anonymous ftp from 

   babcock.cerc.wvu.edu:/pub/techReports/ [157.182.44.36]

An index to all the reports, including some that are
available only in hardcopy, is contained in the file "CERC-TR-INDEX".
If you need additional information, contact Mary Carriger, CERC Office
of Information Services, at carriger@cerc.wvu.edu.

The newsgroup comp.doc.techreports is devoted to distributing lists of
tech reports and their abstracts.

   MIT Artificial Intelligence Laboratory:
      ftp    -- publications.ai.mit.edu:/ai-publications/
      email  -- publications@ai.mit.edu
      www    -- http://www.ai.mit.edu/research/publications/publications.shtml

      A full catalog of MIT AI Lab technical reports (and a listing of recent
      updates) may be obtained from the above location, by writing to
      Publications, Room NE43-818, M.I.T. Artificial Intelligence Laboratory,
      545 Technology Square, Cambridge, MA 02139, USA, or by calling
      1-617-253-6773. The catalog lists the technical reports ("AI Memos") 
      with a short abstract and their current prices. There is also a charge
      for shipping. Some recent tech reports (since 1991) are available in the
      ai-publications/ subdirectory; older technical reports are NOT
      available by ftp. A bibliography is in the bibliography/ directory.

   CMU School of Computer Science:
      ftp   -- reports.adm.cs.cmu.edu
      email -- Technical.Reports@cs.cmu.edu
      www   -- reports-archive.adm.cs.cmu.edu/cs.html

   CMU Software Engineering Institute:
      ftp   --  ftp.sei.cmu.edu:/pub/documents
      email --  bjz@sei.cmu.edu
      www   --  www.sei.cmu.edu/publications/publication.html

   Yale: 
      ftp   --  dept.cs.yale.edu:/pub/TR/

   University of Washington CSE Tech Reports:
      ftp   --  june.cs.washington.edu:/tr  
      email --  tr-request@cs.washington.edu

   ================

   AT&T Bell Laboratories:
      ftp   -- netlib.att.com:/netlib/research/cstr/
      bib.Z contains short bibliography, including all the technical
      reports contained in this directory.

      ftp   -- research.att.com:/dist/ai

      [Maintainer's note: I assume these have been moved over to Lucent's
       domain?]

   Argonne National Laboratory:
      ftp   -- anagram.mcs.anl.gov:/pub/tech_reports
      email -- wright@mcs.anl.gov

      Contains MCS Division preprints and technical memoranda,
      available as either .dvi or .ps files. For descriptions of the
      contents, see the subdirectory pub/tech_reports/abstracts; for
      the files themselves see the subdirectory pub/tech_reports/reports.

   Boston University:
      ftp   --  cs.bu.edu:/techreports/
      email --  techreports@cs.bu.edu

   Brown University:
      ftp   --  wilma.cs.brown.edu:/techreports/
      email --  techreports@cs.brown.edu

   Cambridge University: Speech, Vision & Robotics Group
      ftp   --  svr-ftp.eng.cam.ac.uk:/reports/

   Columbia University:
      ftp   --  cs.columbia.edu:/pub/reports
      email --  tech-reports@cs.columbia.edu

   DEC Cambridge Research Lab:
      ftp   --  crl.dec.com:/pub/DEC/CRL/abstracts/
		crl.dec.com:/pub/DEC/CRL/tech-reports/

   DEC Paris Research Lab:
      email --  doc-server@prl.dec.com
                Put commands in Subject: line of the message.
                To get a list of articles, use
                        send index articles
                To get a list of tech reports, use
                        send index reports

   DEC WRL:
      email -- wrl-techreports@decwrl.dec.com
               To get a helpfile, send a message with 
                  help
	       in the subject line.

   DFKI:
      ftp   -- duck.dfki.uni-sb.de:/pub/papers
      email -- Martin Henz (henz@dfki.uni-sb.de)

   Duke University:
      ftp   --  cs.duke.edu:/dist/papers/
		cs.duke.edu:/dist/theses/
      email --  techreport@cs.duke.edu  [unknown user, 7/7/93]

   Edinburgh:
      A list of available reports can be sent via email. Send requests
      for information about reports from the Center for Cognitive Science 
      to cogsci%ed.ac.uk@nsfnet-relay.ac.uk, and from the Human Communication 
      Research Center to HCRC%ed.ac.uk@nsfnet-relay.ac.uk. 

   Electrotechnical Laboratory, Japan:
      Reports from the Cooperative Architecture project (half AI, half
      software engineering).
      ftp   -- etlport.etl.go.jp:/pub/kyocho/Papers [192.31.197.99]
               See file Index.English.
      email -- Hideyuki Nakashima <nakashim@etl.go.jp>.

   Georgia Tech College of Computing, AI Group:
      ftp   -- ftp.cc.gatech.edu:/pub/ai (130.207.3.245) 
      email -- Professor Ashwin Ram <ashwin@cc.gatech.edu>

   HCRC (Human Communication Research Centre):
      ftp   -- scott.cogsci.ed.ac.uk:/pub/HCRC-papers/
      mail  -- Fiona-Anne Malcolm
               Human Communication Research Centre
               2 Buccleuch Place, Edinburgh, UK

   Illinois:
      email -- Erna Amerman <erna@uiuc.edu>

   Illinois Genetic Algorithms Laboratory (IlliGAL):
      email -- Eric Thompson <library@gal1.ge.uiuc.edu>
      phone -- 217-333-2346 (9AM to 5PM CT, M-F)
      mail  -- Illinois Genetic Algorithms Laboratory
               Department of General Engineering
               117 Transportation Building
               104 South Mathews Avenue
               Urbana, IL 61801-2996
      ftp   -- gal4.ge.uiuc.edu:/pub/papers/IlliGALs/
               Includes the GA bibliography and the Messy GA code in C
               (in /pub/src/) and preprints (in /pub/papers/Publications)
      www   -- http://gal4.ge.uiuc.edu/illigal.home.html

   Indiana:
      ftp   -- cogsci.indiana.edu:/pub         [129.79.238.12]
      ftp   -- ftp.cs.indiana.edu:/pub/techreports [129.79.254.191]

   INRIA, France:
      ftp   -- ftp.inria.fr:/INRIA/publication/

   Institute for Learning Sciences at Northwestern University:
      ftp   --  aristotle.ils.nwu.edu:/pub/papers/
      phone --  708-491-3500

   Mechanized Reasoning Group (MRG):
      ftp   --  ftp.mrg.dist.unige.it:/pub/mrg-ftp
      email --  Fausto Giunchiglia <fausto@irst.it>
                Mechanized Reasoning Group, IRST
                38050 Povo Trento, Italy
                Tel: +39 461-314444 (secr.)
                     +39 461-314436 (office)
                Fax: +39 461-302040 / 314591

   National University of Singapore:
      ftp   -- ftp.nus.sg:/pub/NUS/ISCS/techreports

   New York University (NYU):
      ftp   --  cs.nyu.edu:/pub/tech-reports

   OGI:
      ftp   --  cse.ogi.edu:/pub/tech-reports
      email --  csedept@cse.ogi.edu

   Ohio State University, Laboratory for AI Research
      ftp   -- nervous.cis.ohio-state.edu:/pub/papers
      email -- lair-librarian@cis.ohio-state.edu

   OSU Neuroprose:
      ftp   -- archive.cis.ohio-state.edu:/pub/neuroprose  (128.146.8.52)

      This directory contains technical reports as a public service to the
      connectionist and neural network scientific community which has an
      organized mailing list (for info: connectionists-request@cs.cmu.edu)
      Includes several bibliographies.

   Stanford:
      ftp   -- elib.stanford.edu:/cs
   
      Very spotty collection.

   SRI:
      email -- Donna O'Neal, donna@ai.sri.com

   SUNY Buffalo:
      ftp   -- ftp.cs.buffalo.edu:/pub/tech-reports/

   SUNY at Stony Brook:
      ftp   -- sbcs.sunysb.edu:/pub/TechReports
      email -- rick@cs.sunysb.edu or stark@cs.sunysb.edu

      The /pub/sunysb directory contains the SB-Prolog implementation
      of the Prolog language. Contact warren@sbcs.sunysb.edu for more
      information.

   TCGA (The Clearinghouse for Genetic Algorithms):
      email -- Robert Elliott Smith <rob@comec4.mh.ua.edu>
               Department of Engineering of Mechanics
               Room 210 Hardaway Hall
               The University of Alabama
               PO Box 870278
               Tuscaloosa, AL 35487
               205-348-1618, fax 205-348-6419

   Thinking Machines:
      ftp   -- ftp.think.com:/think/techreport.list

      This file contains a list of Thinking Machines technical reports.
      Orders may be placed by email (limit 5) to t-rex@think.com, or by US
      Mail to Thinking Machines Corporation, Attn: Technical reports, 245
      First Street, Cambridge, MA 01241. In addition, the directories
      cm/starlisp and cm/starlogo contain code for the *Lisp and *Logo
      simulators. 

   Tulane University:
      ftp   -- rex.cs.tulane.edu:/pub/tech/  [129.81.132.1]

   University of Alabama:
      ftp   -- aramis.cs.ua.edu:/pub/tech-reports/

   University of Arizona:
      ftp   -- cs.arizona.edu:/reports/
      email -- tr_libr@cs.arizona.edu

      The directory /japan/kahaner.reports contains reports on AI in
      Japan, among other things, written by Dr. David Kahaner, a
      numerical analyst on sabbatical to the Office of Naval
      Research-Asia (ONR Asia) in Tokyo from NIST. The reports are not
      written in any sort of official capacity, but are quite interesting. 

   University of California/Los Angeles:
      ftp   -- ftp.cs.ucla.edu:/tech-report/

   University of California/Santa Cruz:
      ftp   -- ftp.cse.ucsc.edu:/pub/bib/
	       ftp.cse.ucsc.edu:/pub/tr/
      email -- jean@cs.ucsc.edu

   University of Cambridge Computer Lab:
      email -- tech-reports@cl.cam.ac.uk

   University of Colorado:
      ftp   --  ftp.cs.colorado.edu:/pub/cs/techreports

   University of Florida:
      ftp   -- bikini.cis.ufl.edu:/cis/tech-reports

   University of Genoa, Mechanized Reasoning Group:
      ftp   -- ftp.mrg.dist.unige.it:/pub/mrg-ftp/
      email -- Fausto Giunchiglia <fausto@irst.it>

   University of Georgia:
      ftp   -- ai.uga.edu:/pub/ai.reports/

   University of Illinois at Urbana:
      ftp   -- a.cs.uiuc.edu:/pub/dcs
      email -- e-amerman@a.cs.uiuc.edu 

   University of Indiana, Center for Research on Concepts and Cognition:
      ftp   --  cogsci.indiana.edu:/pub/
      email --  helga@cogsci.indiana.edu

   University of Kaiserslautern, Germany:
      ftp   --  ftp.uni-kl.de:/reports_uni-kl/computer_science/

   University of Kentucky:
      ftp   --  ftp.ms.uky.edu:/pub/tech-reports/UK/cs/

   University of Massachusetts at Amherst:
      email --  techrept@cs.umass.edu

   University of Melbourne, Australia,
   Computer Vision and Pattern Recognition Laboratory (CVPRL):
      ftp   -- krang.vis.mu.oz.au:/pub/articles

   University of Michigan:
      ftp   -- ftp.eecs.umich.edu:/techreports
        
   University of North Carolina:
      ftp   -- ftp.cs.unc.edu:/pub/technical-reports/

   University of Pennsylvania:
      ftp   -- ftp.cis.upenn.edu:/pub/papers/
      email -- publications@upenn.edu [email bounced 7/7/93]

   USC/Information Sciences Institute:
      email -- Sheila Coyazo <scoyazo@isi.edu> is the contact. [email
	       bounced 7/7/93]

   University of Toronto:
      ftp   -- ftp.cs.toronto.edu:/pub/cogrob/   (Cognitive Robotics)
	       ftp.cs.toronto.edu:/pub/reports/
      email -- tech-reports@cs.toronto.edu

   University of Virginia:
      ftp   -- uvacs.cs.virginia.edu:/pub/techreports/cs

   University of Western Australia:
      ftp   -- ciips.ee.uwa.edu.au
      Centre for Intelligent Information Processing Systems (CIIPS)
      EE Engineering Department

   University of Wisconsin:
      ftp   -- ftp.cs.wisc.edu:/tech-reports
	       ftp.cs.wisc.edu:/machine-learning
	       ftp.cs.wisc.edu:/computer-vision
      email -- tech-reports-archive@cs.wisc.edu


Some AI authors have set up repositories of their own papers:

   Matthew Ginsberg: t.stanford.edu:/u/ftp/papers

----------------------------------------------------------------
Subject: [5-6] Technical resources for/by undergraduate students

Brainsciences http://www.brainsciences.com

   A group of students at Brown University have created a web site to
   "provide a forum for undergraduates to publish their work. We feature
   reports of original research, book reviews, term papers, and other work
   in a similar vein."


----------------------------------------------------------------
Subject: [5-7] Where can I get a machine readable dictionary, thesaurus, and
               other text corpora?

Linguistic Data Consortium:

   The Linguistic Data Consortium was established to broaden the
   collection and distribution of speech and natural language data
   bases for the purposes of research and technology development in
   automatic speech recognition, natural language processing, and
   other areas where large amounts of linguistic data are needed.  LDC
   corpora are the most commonly used in published research.
   Information about the LDC is at http://www.ldc.upenn.edu/

Free:

   On unix systems, /usr/dict/words is a fine word list.

   ===========
   The Moby Thesaurus (25,000 roots/1.2 million synonyms), Moby Words
   (560,000 entries), Moby Hyphenator (155,000 entries), and the Moby
   Part-of-Speech (214,000 entries), Moby Pronunciator (167,000
   entries with IPA encoding, syllabification, and primary, secondary,
   and tertiary stress marks) and Moby Language (100,000 word word
   lists in five major world languages) lexical databases are
   available at:
   
   http://www.dcs.shef.ac.uk/research/ilash/Moby/

   This was once commercial but is now in the public domain.
   [thanks to Robert Bechtel]
   ===========

   Roget's 1911 Thesaurus is available by anonymous FTP from the
   Consortium for Lexical Research 

      clr.nmsu.edu:/CLR/lexica/roget-1911 [128.123.1.12]

   It is also available from

      src.doc.ic.ac.uk:/literary/collections/project_gutenberg/roget11.txt.Z

   An old Webster's dictionary is in /text/dict/{DICT.Z,DICT.INDEX.Z}.
   Project Gutenberg also has Roget's 1911 Thesaurus. The Project
   Gutenberg archive is at mrcnext.cso.uiuc.edu:/pub/etext/. The
   Project Gutenberg archive collects public domain electronic books. For more
   information, write to Michael S. Hart, Professor of Electronic Text,
   Executive Director of Project Gutenberg Etext, Illinois Benedictine
   College, 5700 College Road, Lisle, IL 60532 or send email to
   hart@vmd.cso.uiuc.edu. 

   The Online Book Initiative maintains a text repository at
   http://obi.std.com:/obi/ 

   The CHILDES project at Carnegie Mellon University has a lot of data of
   children speaking to adults, as well as the adult written and adult
   spoken corpora from the CORNELL project.  Contact Brian MacWhinney
   <brian@andrew.cmu.edu> for more information.

   The Association for Computational Linguistics (ACL) has a Data
   Collection Initiative. For more information, contact Donald Walker at
   Bellcore, walker@flash.bellcore.com.

   Two lists of common female first names (4967 names) and male first
   names (2924 names) are available for anonymous ftp from 

      ftp.cs.cmu.edu:/user/ai/areas/nlp/corpora/names/

   Read the file README first. Send mail to mkant@cs.cmu.edu for more
   information. 

   A list of 110,000 English words (one per line, in ASCII) is
   available in the PD1:<MSDOS.LINGUISTICS> directory on SIMTEL20 as the
   files WORDS1.ZIP, WORDS2.ZIP, WORDS3.ZIP, and WORDS4.ZIP. Although the
   list is in MS-DOS files, it can easily be used on other machines (but
   first you'll have to unzip the files on a DOS machine). The list
   includes inflected forms of the words, such as plural nouns and the
   -s, -ed, and -ing forms of verbs; thus the number of lexical stems in
   the list is considerably smaller than the total number of word forms.
   These files are available via FTP from WSMR-SIMTEL20.ARMY.MIL
   [192.88.110.20].  SIMTEL20 files are mirrored on wuarchive.wustl.edu.

   The Collins English Dictionary encoded as a Prolog fact base is
   available from the Oxford Text Archive by anonymous ftp from

      ota.ox.ac.uk:/pub/ota/dicts/1192/  [129.67.1.165]

   The Oxford Text Archive includes many other texts, dictionaries,
   thesauri, word lists, and so on, most of which are available for
   scholarly use and research only. See the files

      ota.ox.ac.uk:/pub/ota/textarchive.form
      ota.ox.ac.uk:/pub/ota/textarchive.info
      ota.ox.ac.uk:/pub/ota/textarchive.list
      ota.ox.ac.uk:/pub/ota/textarchive.sgml

   for more information, or write to archive@ox.ac.uk, Oxford Text Archive,
   Oxford University Computing Services, 13 Banbury Road, Oxford OX2
   6NN, UK, call 44-865-273238 or fax 44-865-273275.

   Chuck Wooters <wooters@icsi.berkeley.edu> has extracted the most
   likely pronunciation for each of about 6100 words in the hand-labeled
   TIMIT database, and made them available by anonymous ftp from
   ftp.icsi.berkeley.edu:/pub/speech/TIMIT.mostlikely.Z.

   A list of homophones from general American English is available by
   anonymous ftp from svr-ftp.eng.cam.ac.uk:/comp.speech/data/ as the file
   homophones-1.01.txt. To receive the list by email, send mail to
   Evan.Antworth@sil.org. The list was compiled by Tony Robinson.

   Sigurd P. Crossland <sig@seuss.vantage.gte.com> has been compiling 
   a dictionary of English words, including most common American words,
   abbreviations, hyphenations, and even incorrect spellings. The most
   recent version is available by anonymous ftp from

      wocket.vantage.gte.com:/pub/standard_dictionary/dic-0394.tar.gz

   The tar file includes 31 text files, one for each word-length from 2
   to 32. The compressed tar file takes up just over 4mb of space, and
   includes approximately 870,000 words.

   WordNet is an English lexical reference system based on current
   psycholinguistic theories of human lexical memory. It organizes nouns,
   verbs and adjectives into synonym sets corresponding to lexical
   concepts. The sets are linked by a variety of relations. Besides being
   of scientific interest, 
   it makes a handy thesaurus. WordNet is available by anonymous ftp from

      clarity.princeton.edu:/pub/

   If you retrieve a copy of wordnet by ftp, please send mail to
   wordnet@princeton.edu. 

Commercial:

   The Oxford Text Archive has hundreds of online texts in a wide variety
   of languages, including a few dictionaries (the OED, Collins, etc.).
   The Lancaster-Oslo-Bergen (LOB), Brown, and London-Lund corpii are also
   available from them.  For more information, write to Oxford Electronic
   Publishing, Oxford University Press, 200 Madison Avenue, New York, NY
   10016, call 212-889-0206, or send mail to archive@vax.oxford.ac.uk.
   (Their contact information in England is Oxford Text Archive, Oxford
   University Computing Service, 13 Banbury Road, Oxford OX2 6NN, UK, +44
   (865) 273238.)

Mailing Lists:

   CORPORA is a mailing list for Text Corpora. It welcomes information
   and questions about text corpora such as availability, aspects of
   compiling and using corpora, software, tagging, parsing, and
   bibliography. To be added to the list, send a message to
   corpora-request@x400.hd.uib.no. Contributions should be sent to 
   corpora@x400.hd.uib.no.


----------------------------------------------------------------
Subject: [5-8] Where can I get training sets for machine learning algorithms?

UC/Irvine (UCI) AI/Machine Learning Repository:

   ftp.ics.uci.edu has a variety of AI-related materials, with a special
   focus on machine learning. For example,

      ftp.ics.uci.edu:/pub/machine-learning-databases/

   contains over 80 benchmark data sets for classifier systems (30mb).

MLnet Machine Learning Archive
MLnet Online Information Service

   In 1988 the Special Interest Group on Machine Learning of the German
   Society for Computer Science (GI e.V.) decided to establish a library
   of PROLOG implementations of Machine Learning algorithms.  By 1994 the
   library had a sizable collection of GLPed PROLOG software.  The site
   has grown, and now, according to the webpage it "offers a growing
   collection of ML information, datasets, software and pointers to other
   ML resources."  The homepage is at:

	http://www.mlnet.org

   Send your contributions to Mathias Kirsten (info@mlnet.org) at the GMD -
   German National Research Center, or use the contribution facilities within
   the MLnet OiS.

----------------------------------------------------------------
Subject: [5-9] What on-line Journals are there?

[this question is still in progress]

Journal of Artificial Intelligence Research.  See [3-2a].
Journal of Machine learning Reasearch. See [3-2n].

---
[ comp.ai is moderated.  To submit, just post and be patient, or if ]
[ that fails mail your article to <comp-ai@moderators.isc.org>, and ]
[ ask youa news administrator to fix the problems with your system. ]
