Notes for Week 5

Notes on Final Project

Title Selection: Make sure the title is an official title (from the Web page itself, or the title bar)

Multiple use of the same Site: Sometimes subparts of a site can be used in addition to the main site. Don't overdo this. Example: National Library of Medicine (http://www.nlm.nih.gov/); Pubmed (http://www.ncbi.nlm.nih.gov/sites/entrez?db=pubmed); MeSH Database (http://www.ncbi.nlm.nih.gov/mesh?itool=sidebar).

URL Selection: Generally prefer the shortest form of the URL. The Coca-Cola Web site initially comes up as http://www.coca-cola.com/index.jsp. But the URL you should give is: http://www.coca-cola.com/.

Annotations. Annotations are brief summaries, about 3-4 sentences. You do not need to write abstracts, which are longer descriptions that can be many paragraphs in length. You need to write informative annotations as opposed to evaluative annotations. You do not want to say things like, "this is the best web site...", or "I really like the search feature on the site". Keep your annotations objective and original (your own words, not copy and paste from the web site). Try to vary your writing style. Your writing needs attention if you find yourself starting every annotation like this, "This Web site ...". Pay attention to features available on the site: "The site offers both searching and browsing capabilities"; A database of available reports can be searched from the home page"; The topics are broken down by geographic areas".

 

Video: http://vodpod.com/watch/25484-the-machine-is-using-us-final-version?mp=1&pod=dymaxionq 

Produced by Michael Wesch, Assistant Professor of Cultural Anthropology, Kansas State University, Manhattan, KS.

 

Google Searching:

Google is always beta testing new ways to search Internet content. Here are some examples:

a) Weather search: just type "weather" followed by a city and state, zip code, or city and county. Example: weather 80210

b) Current Market Data: type a stock symbol Example: CSCO ; MSFT; CMG

c) Time in other time zones. Example: time tokyo

d) Sports data: Examples: Denver Broncos ; Colorado Rockies

e) Calculations: 5*9+(sqrt 10)^3=; (34*56 + 4)/(75.4*8.43)

f) Natural Disasters: earthquake

g) Measurement Conversions: 10.5 cm in inches

h) Trends for states or counties: population Denver ; unemployment rate colorado ; population arapahoe co

i) Synonym Search: If you want to search not only for your search term but also for its synonyms, place the tilde sign (~) immediately in front of your search term. Example: ~fast food

j) Definitions. Example: Define:library For some reason, lower case define gives a different look: define:library

k) Local Searches: italian food 80210 ; movies 80210

l) Airline Travel Info: american airlines 18 ; ua875

m) Currency Conversion: 150 GBP in USD ; 100 japanese yen in usd ; 1 usd in japanese yen

n) Maps: denver map ; tokyo map ; centennial co map

o) News Timeline: http://newstimeline.googlelabs.com/

p) Google Sets: http://labs.google.com/sets

Additional Notes: http://www.google.com/intl/en/help/features.html

Hidden Internet not so hidden after all: Search Engine Optimization

Google has new technologies that allow for search engine optimization (SEO). This means that webmasters can give Google a site map of what their site looks like, and how to make calls to hidden Internet content. See: http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=35291

This record (http://adr.coalliance.org/codu/fez/view/codu:4982) is from a hidden Internet database. Yet it is indexed by search engines (Bing; Google). That's because we gave search engines our site map, telling them how to crawl our database in this case.

Here is another example where I am guessing they did a similar thing: http://catalogue.nla.gov.au/Record/2198741

Search Share, August 2009: http://searchenginewatch.com/3634991

 

 

Other Search Engines - New search engines come and go. Here are just a few:

http://www.wolframalpha.com/

http://hakia.com/

http://lexxe.com/ Very interesting

http://www.deeppeep.org/ Gone

http://search.yippy.com/ Facets

http://www.oggleus.com/ Maybe Oggleus means "slow" in some language

http://www.chacha.com/ Q & A service with bothersome ads

http://www.gnod.net/

http://www.lurpo.com/ Gone

http://www.mooter.com/ Gone

 

Evaluating Internet Resources

Criteria for Evaluation

Scope

Is the purpose of the resource clearly stated? A site's purpose should be clear and its content should reflect its purpose, whether it's to educate, persuade, entertain, or sell. Is the site appropriate for the intended audience? Is the site for a subject expert, a layperson, a school student? How does the site compare with other sites on the same subject?

What items are included in the resource? Is the scope only implied, or is it stated through metainformation such as an introduction? Does the actual scope of the resource match expectations? Aspects of the scope include:

Breadth: What aspects of the subject are covered? Is the resource focused on a narrow area or does it include related topics?

Depth: What is the level of detail provided about the subject? This is related to the level of audience for which the resource has been designed, mentioned below. Time: Is the information in the resource limited to certain time periods?

Format: A resource that provides links may restrict its scope to certain classes of resources. For example, Telnet, Gopher, or FTP (File Transfer Protocol) resources may be excluded from a WWW-oriented site.

What is the scope of these sites?

 

Authority

Who (person or agency) is responsible for the content? What are their credentials? Why should we believe what they say?

What are the author's reputation and qualifications in the subject covered? Are sources of information stated? Can the author be contacted for clarification or to be informed of new information? Examining the URL can give clues as to the author's relationship to the organization. For instance, a tilde "~" often indicates a personal Web directory, rather than part of the organization's official Web site.

Here is Senator  Edward M. Kennedy's Web site from 1997, as recovered from the Wayback Machine:

http://web.archive.org/web/19970613224316/http://www.senate.gov/~kennedy/

Notice the ~ in the URL, yet this was the official Web site.

Many universities (such as DU) offer personal Web sites. These bear personal opinions of the authors, not the official representation of the University:

Here is an authority question. Which of these Web sites is the best authority?

 

Content

If the resource is fee-based or selling a product, is there substantial content offered for free? Is the information on the site accurate? Is the information factual or opinion? Ask "why is the author placing this information on the Internet?" Frequently the answer is that the information is placed to advertise or support a particular point of view. Does the site contribute something unique to the subject? How current is the information? How frequently is the resource updated? Are dates of update stated? Browsers may allow you to view the date of creation and modification of a file [in Netscape View|Document Info]. Remember this may not be the date that the actual information was created or reviewed. If there are links to other sites, do they work? Are they appropriate? Is the content well-written (easy to read, correct spelling and grammar)?

Here is where you should pay attention to the TLD. Sites like .gov, .mil, state.co.us would likely represent official opinions of government agencies (but not necessarily in all cases!). Sites like .com, .co.jp, or .co.uk might be company sites, but might also be personal sites (as these can easily be purchased by individuals).

Is there any bias in the content?

Is the information fact or opinion?

Is the writing logical? Is it written in bad style? Is it easy to read? Is the content repetitive?

What is the intended audience? Is the resource too elementary, too advanced or technical, or is it just right?

Does the site contain original information or simply links?

Does the site consist of internally maintained links or does it consist of external links over which there is no control?.

Currency

Is currency important to the topic of the site/page?

When was the Web item produced? When was the Web item mounted? When was the Web item last revised? How up to date are the links? How reliable are the links; are there blind links, or references to sites which have moved? Is contact information for the author or producer included in the document?

http://memory.loc.gov/ammem/aap/aaphome.html

Test for Currency:

1. Go to official government Web site (such as epa.gov; commerce.gov; fema.gov; bls.gov, etc).

2. Look for press releases (sometimes labelled newsroom, releases, news, etc.) and retrieve the most recent one you can find.

3. Search for a phrase within the release with a site limit in place (ex: site:fema.gov "Payment of not less than 75 percent of the eligible costs for repairing or replacing damaged public facilities")

Utilities for Webmasters:

Net Mechanic: http://www.netmechanic.com/

Dr. Watson: http://watson.addy.com/

Example sites to evaluate:

 Class Exercise:

Write an annotation for the following Web sites:

http://tpot.ucsd.edu/msd/catpolicies/cattoolsresources/index.html

http://www.cfa.org/

http://www.brookings.edu/

 

Citation Verification

One of the most daunting aspects of librarianship is mastering the plethora of tools available for bibliographic verification. Bibliographic verification, the process of ensuring the accuracy and authenticity of bibliographic citations, is of great importance in the academic library. Academic reference departments deal with faculty and students in every discipline of the sciences, social sciences, and humanities. They must know how to verify everything from sixteenth century English books to physics pre-prints that may just recently have been posted on the Internet.

Verification always comes first: verification before location.

Freely Available Citation Verification Tools & Strategies

Union Databases

National Libraries

Union Lists of Periodicals

Union lists aren't used much any more, given the rise of union catalogs and cross-searching (meta-searching). But some still exist.

Tables of Contents Services

Tables of contents services for journals have largely been replaced with online indexing and FT services. But TOC services for books remains valuable, as citations to book chapters can often be difficult to track down. Even these services are not so necessary as Google Books and other such services provide deep access into book content.

Google Scholar: http://scholar.google.com/

Read: Google Scholar: Finding Fulltext through U. of Denver Databases

Citation Databases

It is becoming more popular for commercial content providers to link to other articles within their products.

Examples within freely-available resources:

These are not properly within the scope of this class, because they are fee-based resources (and will be covered in LIS4011), but I bring them up to illustrate the point.

Here are some examples:

 

Google Scholar - See Research Guide to configure

 

 

 

 

 

Favorite Google Tricks: [in class]

 

Article about Google and federal agencies:

http://www.gcn.com/print/26_08/43511-1.html?topic=&CMP=OTC-RSS