Finding Useful Information on the Internet

Lots of library-type information now available

The Libraries of Purdue University

Weather information

Weather Data and Forecasts

Address and zip code information

National Postal Address and Zip Code Server


Electronic Dictionaries and Magazines

Hypertext Webster's Dictionary from Carnegie-Mellon University

Oxford English Dictionary

ELECTRONIC MAGAZINES (E-Zines)...

NetscapeWorld

JavaWorld

Educause publications (particularly Edupage) are very informative

Scout Report, primarily for researchers and educators

WebWeek

Web Developer Magazine

WEBster -- The E-Zine for the World-Wide Web


US Government Resources

US Government has done great job of getting useful information onto World-Wide Web

Some examples...

White House

U.S. Government Resources via the FedWorld Information Network

National Science Foundation

Department of Education

Internal Revenue Service


Search Tools for the Internet

Too time-consuming to search throughout the WWW for information on a topic

Can use one of the many available online Search Engines

Allow you to search for information in many different ways

Some engines search titles or headers of documents on Internet

Others search documents themselves

Others just search other indexes or directories

There are two types of processes for supporting searches...


Contributor-Indexing Process

"Submit something along with key terms and we will include it in our index"

Creators of documents submit URL and terms for indexing: George Patton -- Patton, World War II, Third Army, tanks

Harvest Information Discovery and Access System, University of Colorado -- searches other indexes

CUI W3 catalog (University of Geneva, Switzerland) searches through directories (summaries) of other Web documents


Robots, Crawlers, Worms, and Spiders "Crawl around and find what we can"

Continuously running program (robot, crawler, worm, spider) pursuing hyper-links throughout Internet

Start with set of documents. Identify new places to explore by looking at outbound links. Visit those links. Index most useful terms.

Lycos project at Carnegie-Mellon

"Lycos" comes from arachnid family Lycosidae -- large ground spiders, very speedy and active at night, catching their prey by pursuit rather than in a web

Lycos has database of over 50 million Web pages

Receives about 4 million visits a day


Search Engines

http://www.lycos.com

Lycos -- receives about 4 million visits a day

Created by Dr. Michael Mauldin, Carnegie-Mellon University in 1994

http://www.yahoo.com/

Yahoo -- receives about 6 million visits a day

Created by Stanford graduate students Jerry Yang and David Filo in 1994

Name "Yahoo" is either corruption of "YangFilo" or "Yet Another Hierarchical Officious Oracle"

http://www.infoseek.com/

InfoSeek -- receives about 4 million visits a day

Excite

WebCrawler

HotBot

AltaVista

MetaCrawler calls several other search engines and reports their collective results