‹header›
‹date/time›
Click to edit Master text styles
Second level
Third level
Fourth level
Fifth level
‹footer›
‹#›
Introduce ourselves and let students know that they can feel free to contact us -- we're available as resources.
Also, tell the students about ourselves, so that they're aware of our background, how we ended up where we are now, experience, etc.
These goals are quite broad.  The main point to remember today is, however, that librarians must consider ways to integrate the use of the Internet as a resource for themselves and for their patrons.  As you can imagine, this integration poses a number of challenges and issues:
user education
computer literacy & access to machines
collection development
budgeting
Web site development & maintenance
etc.
The WWW is the whole constellation of Internet resources.  It provides the protocol for text files and multi-media.  What the WWW can do which none of the other Internet modes can is handle "hypermedia" such as graphics, moving and still images, audio, real-time chat or sound, etc.  The WWW uses the hypertext transfer protocol (http) to enable sounds, images, text, etc. to be mixed together in the same documents.  Although most WWW documents are written in the html (hypertext markup language), gopher and ftp sites can also be accessed via the WWW.  The WWW also allows any computer to become a client, and any computer to become a server.  Plus, the WWW allows different kinds of machines to communicate with eachother by providing a simple, common language (via the browser). 
In order to use the WWW, all you need is a "browser" program.  Mosaic was the first browser.  It was produced in 1993 by the NCSA, and took-off overnight!  Then, some of the people from NCSA who created Mosiac broke off and created their own company, which produced the popular Netscape browser.  At about that same time, Microsoft began creating its own browser, Internet Explorer.  As you've probably heard in the news, Netscape & Microsoft are is a legal battle because Microsoft Explorer is accused of trying to monopolize the browser market.
A URL is an address for a WWW site.  Just as every home has a number, street, city, state address, every WWW page has a URL.  The URL is what you use to locate a particular site when you're on the Web.  The URL provides information that tells your computer (client) exactly where to go (server). 
The format is:
protocol://server & domain/directory/file & extension.  So, the URL that's on this slide tells the client to go to the med-libwww server at the boston university (which is an educational institution) domain.  The directory is called "library", and within that directory, is a page or file called home.  The html tag indicates that this file was written in the html language (as opposed to a gif (graphics) or other kind of file.)
At this point, does anyone have any questions or comments about the history and background of the Internet & WWW?
A URL is an address for a WWW site.  Just as every home has a number, street, city, state address, every WWW page has a URL.  The URL is what you use to locate a particular site when you're on the Web.  The URL provides information that tells your computer (client) exactly where to go (server). 
The format is:
protocol://server & domain/directory/file & extension.  So, the URL that's on this slide tells the client to go to the med-libwww server at the boston university (which is an educational institution) domain.  The directory is called "library", and within that directory, is a page or file called home.  The html tag indicates that this file was written in the html language (as opposed to a gif (graphics) or other kind of file.)
At this point, does anyone have any questions or comments about the history and background of the Internet & WWW?
Okay, let's switch gears and move into the nitty gritty.  On most of your pre-class questionnaires, you indicated that you were interested in learning more about search engines, so let's discuss them. 
Just so you know, there are lots of sites on the WWW that give detailed summaries, evaluations, analyses and comparisons of various search engines.  The site URLs for some of these are listed on the handout.
Let’s go back to our original question.  Now that we have some idea of the types of information that is usually found on the WWW, we can determine how to go about retrieving those documents.
This first slide presents data from that same 1997 survey, giving us some indication of how people find out about new WWW sites.  And, as you can see, using search engines is a popular method of finding information.
We'll use the term "search engine" generically to describe all WWW search tools.  However, it is important to know that there is a difference between the types of search tools that are available.  A "search engine" is defined very specifically, and is different from a "Web directory".  But as I said, people use the term search engine to describe both.
Hotbot http://www.hotbot.com/ is an example of a true search engine.  Parts of a search engine include the "spider" and the software.  A search engine works by sending out "spiders" to record information about each site. If you change your web pages, search engines eventually find these changes, and that can affect how you are listed. Page titles, body copy and other elements all play a role.  When doing a search, the software looks for matches based on the elements of the sites that it has stored in its index.  
Yahoo is a good example of a Web directory. The Yahoo! directory is organized by subject. Most sites in it are suggested by Web users like all of us. Sites are placed in categories by Yahoo! surfers, who evaluate your suggestions and decide whether to add the site to Yahoo!.  This process tries to ensure that Yahoo! is organized in the best possible way, and includes quality sites.
Go to http://add.yahoo.com/fast/add?4210919 to see the "suggest a site" form.
But remember, the person who suggests a site writes the description, and in a "cold" search, Yahoo would only retrieve that site based on matching your search term(s) to the search terms in the description!  This is why "cold" searching is so sloppy.  But what Web directories do to make up for this is to create extensive categories and sub-categories in which you can browse the sites that are included.  But, keep in mind that these categories are not subject headings based on a controlled vocabulary.  They're simply logical headings which may be familiar to most people.
Take a look at headings in Yahoo! at http://www.yahoo.com or in AltaVista at http://www.altavista.com/
A meta-search engine is like a middle-man or agent that acts as a go-between between the searcher and the search engines.  Meta search engines were developed by Web-savvy people who recognized the fact that people often search several engines for one topic.  This is time consuming, plus you'll find lots of duplication among the various engines.  So, using a meta search engine saves time by searching several engines at once, eliminating duplication, and presenting the results to the user.
Two important things to remember when using meta search engines:
1.  each meta-search engine searches particular engines. 
For example, MetaCrawler searches AltaVista, Yahoo, Thunderstorm, Exite, Lycos, Infoseek & Webcrawler.  It even allows you to eliminate those that you don't want it to search (http://www.metacrawler.com/help/customize.html).  MetaCrawler does NOT search HotBot and many others, however. 
2.  The second important thing to remember about meta search engines is that when you enter a complex search, there's no guarantee that every search engine queried will be able to understand the search you've entered.
Take a look at metacrawler, for example:
http://www.metacrawler.com
If you enter (animal rights NOT (dogs or cats)) you'll find nothing from the Metacrawler search.  But if you make a small change in the search format by eliminating the first set of parentheses:  animal rights not (dogs or cats), you will get some results.  But when you analyze those results, you will immediately notice that the NOT was ignored, and you find lots of sites that mention dogs and cats.
Let's try the witchcraft search in MetaCrawler  (http://www.metacrawler.com) and DogPile (http://www.dogpile.com). 
It may go by too quickly for you to see, but MetaCrawler does list which engines it's searching as the search is being completed.  The results page ranks the hits (1000 = high relevancy and so on).  The results page also indicates from which search engine the hit was found.  MetaCrawler also provides the annotation from the original search engine.  At the end of every annotation, there is a link back to the original search engines, which can be used if you want to go back to that search engine and see the results for the search you've entered.
Dogpile provides the annotations and ranked hits, but it doesn't indicate from which engine each hit came.
Search engines also rank relevancy by the frequency of a term in a site.  But beware:  people have caught on to this and "spam" their sites with terms in the hopes that their sites will be retrieved frequently.  This spamming is considered a violation of Netiquette, so most of the search engines "punish" sites that appear to have been spammed by not loading them in the search engine's database. 
Legitimate frequency of a word or phrase is considered by a search engine, however.  For example, if you're looking for "witchcraft" the sites that are retrieved by a search should use that term frequently. 
(DO A SEARCH IN HOTBOT FOR WITCHCRAFT.  LOOK AT THE FIRST SEVERAL HITS.  ALSO NOTE THAT HOTBOT HAS A FEATURE THAT RETRIEVES THE TOP TEN MOST FREQUENTLY VISITED SITES FOR EVERY SEARCH.)
http://witchcraft.simplenet.com/present.html
You must always remember, however, that the WWW has no structure or organization.  It’s basically a free-for-all.  What the search engines attempt to do is impose some order and logic.  But in reality, you can’t artificially impose order on something that is chaotic.  This chaos will always be reflected in results of searches.
I recommend only using search engines when you’re looking for something with a proper name.  For example, the “cuban missle crisis” is agood one.  But if you’re looking for a general topic -- like the “witchcraft” serach, my recommendation would be to only use a search engine if you’re stuck.
Instead of relying on search engines, you may want to try the following approaches:
1.  Associations/professional organizations
For example, if you’re looking for information on “wine”, you can first use the Encyclopedia of Associations to find a few organizations dedicated to wine.  The Encyclopedia will also help you rule out certain organizations based on their point-of-view or intended audience.  For example, the "American Wine Society" may be generic enough to appeal to everyone, but the "Wine Growers Association of Sonoma Valley" is probably focused on wineries. Then, go to the WWW to determine whether these organizations have WWW sites --- pick a search engine and look for the association name.  You then use the information posted on the association's page.  Or, you can try the second approach, which is to:   
(CONTINUED ON NEXT SLIDE)
Think about information by CATEGORY, then use the resources compiled by libraries, professional organizations & associations, gov. agencies, etc.   A few examples to look at:
Real & Virtual Libraries
Alumni Medical Library http://med-libwww.bu.edu/
Internet Public Library http://www.ipl.org/
Associations & Prof. Organizations
ALA http://www.ala.org/library/weblinks.html
American Cancer Society http://www3.cancer.org/cancerinfo/specific.asp
Gov. Agencies
U.S. Census Bureau http://www.census.gov/
Mass. Board of Registration Physician Profiles
http://www.docboard.org/ma/ma_home.htm
World Health Organization http://www.who.int/
Cities, counties & states
Massachusetts Online http://www.magnet.state.ma.us/
Boston Online http://www.boston-online.com/index.html
An issue is, however, that no one librarian can be aware of all of the information that is available in all of these sources and sites.  Sure you can go out and search the WWW and look around, but if you have a patron waiting for help, you probably won’t have the time to browse -- risking that the information you need may not even be on the WWW. This is why it becomes increasingly important to create your own collection of WWW resources with which you are familiar, that you can go to more quickly and easily.
Think about information by CATEGORY, then use the resources compiled by libraries, professional organizations & associations, gov. agencies, etc.   A few examples to look at:
Real & Virtual Libraries
Alumni Medical Library http://med-libwww.bu.edu/
Internet Public Library http://www.ipl.org/
Associations & Prof. Organizations
ALA http://www.ala.org/library/weblinks.html
American Cancer Society http://www3.cancer.org/cancerinfo/specific.asp
Gov. Agencies
U.S. Census Bureau http://www.census.gov/
Mass. Board of Registration Physician Profiles
http://www.docboard.org/ma/ma_home.htm
World Health Organization http://www.who.int/
Cities, counties & states
Massachusetts Online http://www.magnet.state.ma.us/
Boston Online http://www.boston-online.com/index.html
An issue is, however, that no one librarian can be aware of all of the information that is available in all of these sources and sites.  Sure you can go out and search the WWW and look around, but if you have a patron waiting for help, you probably won’t have the time to browse -- risking that the information you need may not even be on the WWW. This is why it becomes increasingly important to create your own collection of WWW resources with which you are familiar, that you can go to more quickly and easily.
Once you’ve done a search and have located a number of sites, you must determine whether the information you’ve found is relevant, authoritative, reliable, etc. 
Let’s talk about evaluating Internet resources:
(Based on their experience using printed resources, the students should be able to list some of the criteria used to evaluate resources.  Most of the criterion that applies to printed resources also applies to Internet resources.)
Now take a look at these?
http://www.whitehouse.net
VS.
http://www.whitehouse.gov
Which is the REAL whitehouse?  How long would it take the average user to figure out that the 1st is a parody of the REAL whitehouse page (whitehouse.gov)
Example of a questionable site:
http://www.ceri.com/intro.htm
(This is an  editorial from a supposed expert.  But, who is this guy, really?  Notice that his resume is included on the site; but the resume excludes his education and is very vague.  Is CERI a legitimate organization?)
Would you use the information  in this site?
Example of a good site:  RR Bowker's Books Out of Print  http://www.bowker.com/bop/home/boop.html
Other sites which can be used to discuss accuracy & authority:
OncoLink (http://oncolink.upenn.edu)
Tobacco Control Archives (http://www.library.ucsf.edu/tobacco/index.html)
Smoker's Home Page (http://www.tezcat.com/~smokers/)
Essays on the Anti-Smoking Movement (http://www.tezcat.com/~smokers/issues1.html)
The True but Little Known Facts About Women with Aids, with documentation
(http://147.129.1.10/library/lib2/AIDSFACTS.html)
Example of questionable site:
search for “dihydrogen monoxide” -- some of these sites contain misleading information:
http://www.lrsm.upenn.edu/~sherman/humor/water.html
(This site provides inaccurate information regarding the supposed dangers of this chemical.)
http://www.cis.udel.edu/~way/DMRD/facts.html
(This site refutes CDC findings)
So what other approach would I take?
One site mentions the CDC, so I’d try searching their WWW site for more information.  Or, I’d try one of the toxicology databases on Internet Grateful Med for more information.) http://www.chem.utah.edu/MSDS/msds.html  (This site sells herbal PhenFen, despite the FDA’s warnings against it.)
Example of a good site:
Bureau of Labor Statistics
http://www.bls.gov/blshome.html
Example of questionable site:
search for “dihydrogen monoxide” -- some of these sites contain misleading information:
http://www.lrsm.upenn.edu/~sherman/humor/water.html
(This site provides inaccurate information regarding the supposed dangers of this chemical.)
http://www.cis.udel.edu/~way/DMRD/facts.html
(This site refutes CDC findings)
So what other approach would I take?
One site mentions the CDC, so I’d try searching their WWW site for more information.  Or, I’d try one of the toxicology databases on Internet Grateful Med for more information.) http://www.chem.utah.edu/MSDS/msds.html  (This site sells herbal PhenFen, despite the FDA’s warnings against it.)
Example of a good site:
Bureau of Labor Statistics
http://www.bls.gov/blshome.html
Example of questionable site:
search for “dihydrogen monoxide” -- some of these sites contain misleading information:
http://www.lrsm.upenn.edu/~sherman/humor/water.html
(This site provides inaccurate information regarding the supposed dangers of this chemical.)
http://www.cis.udel.edu/~way/DMRD/facts.html
(This site refutes CDC findings)
So what other approach would I take?
One site mentions the CDC, so I’d try searching their WWW site for more information.  Or, I’d try one of the toxicology databases on Internet Grateful Med for more information.) http://www.chem.utah.edu/MSDS/msds.html  (This site sells herbal PhenFen, despite the FDA’s warnings against it.)
Example of a good site:
Bureau of Labor Statistics
http://www.bls.gov/blshome.html
http://www.eb.com:180/   
Encyclopedia Brittanica (note that it says that it contains ALL articles from the printed editions; but for illustations, biographies, yearbooks, etc., it doesn't say that it contains ALL, so you should assume that many are missing.) 
http://www.bu.edu/library/refshelf.html
Mugar Library's "reference shelf" provides a link to WWWebster Dictionary, the online version of Merriam Webster's Collegiate Dictionary.  Notice that Mugar clearly states that the dictionary is an online version of the 10th edition of Merriam Webster's Collegiate Dictionary.  But when you link to the dictionary itself (http://www.m-w.com/netdict.htm), it's a lot tougher to find out what edition is contained and whether it's complete.
http://www.ahd.com/ 
American Hospital Directory clearly states how the information is gathered.  States that it includes infomration on MOST hospitals.  When you enter the "free services" section, it immediately presents a notice regarding contents, and a disclaimer.
http://www.eb.com:180/   
Encyclopedia Brittanica (note that it says that it contains ALL articles from the printed editions; but for illustations, biographies, yearbooks, etc., it doesn't say that it contains ALL, so you should assume that many are missing.) 
http://www.bu.edu/library/refshelf.html
Mugar Library's "reference shelf" provides a link to WWWebster Dictionary, the online version of Merriam Webster's Collegiate Dictionary.  Notice that Mugar clearly states that the dictionary is an online version of the 10th edition of Merriam Webster's Collegiate Dictionary.  But when you link to the dictionary itself (http://www.m-w.com/netdict.htm), it's a lot tougher to find out what edition is contained and whether it's complete.
http://www.ahd.com/ 
American Hospital Directory clearly states how the information is gathered.  States that it includes infomration on MOST hospitals.  When you enter the "free services" section, it immediately presents a notice regarding contents, and a disclaimer.
http://www.eb.com:180/   
Encyclopedia Brittanica (note that it says that it contains ALL articles from the printed editions; but for illustations, biographies, yearbooks, etc., it doesn't say that it contains ALL, so you should assume that many are missing.) 
http://www.bu.edu/library/refshelf.html
Mugar Library's "reference shelf" provides a link to WWWebster Dictionary, the online version of Merriam Webster's Collegiate Dictionary.  Notice that Mugar clearly states that the dictionary is an online version of the 10th edition of Merriam Webster's Collegiate Dictionary.  But when you link to the dictionary itself (http://www.m-w.com/netdict.htm), it's a lot tougher to find out what edition is contained and whether it's complete.
http://www.ahd.com/ 
American Hospital Directory clearly states how the information is gathered.  States that it includes infomration on MOST hospitals.  When you enter the "free services" section, it immediately presents a notice regarding contents, and a disclaimer.
Examples of Questionable sites:
http://www.hans.org/  (If you look at the fact sheets on this site -- none are dated.)
http://www.cc.columbia.edu/acis/bartleby/bartlett/
Bartlett's Familiar Quotations -- this is a terrific resource, but you may or may not notice that it's the 1901 edition!  (The header, bibliographic link and front matter link all clearly say 1901, but people could miss that.  What do you think?)
http://www.chem.utah.edu/MSDS/msds.html
This site actually warns you that the information may be outdated.  It also has a thorough disclaimer.
A good WWW site should be logically organized  so that a novice user can easily find his or her way around it.
Use my library's WWW site as an example:  the old site
http://med-libwww.bu.edu/library/OLDhome.html
was re-organized to function more easily, and also to match the BUMC site
http://www.bumc.bu.edu/   
and even the library's new WWW site http://med-libwww.bu.edu/library/home.html underwent changes.  See the BUMC MEDLINE link?  In the original re-design,   (on handout), the MEDLINE link is not as clearly delineated as a clickable box.  When we mounted this page, we immediately got calls from patrons who couldn't find the link to MEDLINE.  We were surprised that they didn't see it, but decided to make a change anyway.  If you compare the handout to the new site, you'll see that the links under the library's name are much clearer. Also notice on the other handout that the header and menu column on the left are small.  Patrons also complained about this, so we enlarged it.
Other sites for currency & coverage:
USA Today (http://www.usatoday.com)
About USA Today Online - the FAQ (Frequently Asked Questions) (http://www.usatoday.com/leadpage/about.htm)
Article by Hope Tillman entitled "Evaluating Quality on the Net" (http://www.tiac.net/users/hope/findqual.html)
Roget's Thesaurus (http://humanities.uchicago.edu/forms_unrest/ROGET.html)
Webster's Revised Unabridged Dictionary, 1913 Edition (http://humanities.uchicago.edu/forms_unrest/webster.form.html
CNN (htp://www.cnn.com)
 This site looks benign enough -- looks like a potentially useful consumer health site -- http://www.healthfront.com/  
But, it's sponsored solely by Advil.
Other sites:
     Pages Which Blend Entertainment, Information and Advertising
     The "Alternative" White House (http://www.whitehouse.net)
     The Official White House (http://www.whitehouse.gov)
     Chips Ahoy (http://www.chipsahoy.com)
     Joe Boxer (http://www.joeboxer.com)
     CourtTV (http://www.courttv.com)
Many newspapers, for example, provide what I call “teasers” -- headline or the first few sentences from an article. 
For example, look at the Philadelphia Inquirer
http://www.phillynews.com/
and you’ll see that the full text of the news is NOT available online.