URLVExpt


Tom Tyler, University of Denver Library

 


Requirements:

 

·         URLVerify.html file from Innopac library using the URLVerify product

·         Microsoft Windows 9x

·         URLVExpt executable

 

Brief Description:

 

Using Innopac’s URLVerify.html file, URLVExpt creates two new files that may be used for general URL maintenance in Innopac bibliographic databases.

 

The first is an HTML file of hyperlinks that can be used with third party link checking software.  Hyperlinks in this file are constucted from the URLVerify.html file’s URL column for all errors except redirects.  Hyperlinks for temporary and permanent redirects are created from URLs in the New URL column.

 

The second file created by URLVExpt contains delimited data from an Innopac URLVerify.html  file.  Data in this file can be easily and quickly imported into a spreadsheet or into a database which can be used effectively as a working environment for URL maintenance in the Innopac database.

 

Documentation:

 

Documentation for URLVExpt, in MS Word format, is included among the files in the Zipped package available at the Download heading.

 

Download:

 

The URLVExpt executable and documentation in MS Word format are available in Zipped format at http://www.du.edu/~ttyler/freeware/urlvexpt.zip.

 

Unzip the file urlvexpt.zip to a directory location you want to use for URL maintenance activity.  Two files will be unzipped: urlvexpt.exe – the executable, and urlvexpt.doc – the documentation file in MS Word format.

 

Setup:

 

There is no special setup required for URLVExpt. 

 

Running the Program:

 

Before you run the program the source file should be saved to the directory where urlvexpt is found.  Because the program is unable to recognize long filenames, the 8+3 filename convention should be used.  In the illustration below, the Innopac URLVerify.html file was saved to the working directory with the name uv000515.htm.  You may want to use a similar name-date convention  such as uvyymmdd.htm (where yymmdd is the date).

 

You can initiate the program in two ways – in the command mode from the DOS prompt where you enter urlvexpt  or from Windows by clicking on the  filename urlvexpt.exe in the Explorer display.  Screen 1 illustrated below should appear.


 

+-----------------------------------------------------------------------+

¦                                                                       ¦

¦             URLVEXPT - a /screens/urlverify.html utility              ¦

¦                                                                       ¦

¦      Tom Tyler, University of Denver Library - ttyler@du.edu          ¦

¦                                                                       ¦

¦    Before proceeding with this program you need to have saved your    ¦

¦    Innopac .../screens/urlverify.html page to the directory           ¦

¦    location on your workstation where URLVEXPT.EXE is located.        ¦

¦                                                                       ¦

¦    It is recommended that you name the file XXyymmdd.htm where        ¦

¦    XX is an abbreviation of your library name, yy is the last two     ¦

¦    digits of the year, mm is the month, and dd is the date.  The file ¦

¦    extension may be anything, but '.htm' is recommended               ¦

¦                                                                       ¦

+-----------------------------------------------------------------------+

 

Enter the full name (with extension) of the URLVerify file

from which you want to export to CSV & HTML formats ? uv000515.htm

 

Screen 1:  Enter the complete name of the source file

 

Enter the complete filename; press ENTER.

 

 

+-----------------------------------------------------------------------+

¦                                                                       ¦

¦             URLVEXPT - a /screens/urlverify.html utility              ¦

¦                                                                       ¦

¦      Tom Tyler, University of Denver Library - ttyler@du.edu          ¦

¦                                                                       ¦

¦    Before proceeding with this program you need to have saved your    ¦

¦    Innopac .../screens/urlverify.html page to the directory           ¦

¦    location on your workstation where URLVEXPT.EXE is located.        ¦

¦                                                                       ¦

¦    It is recommended that you name the file XXyymmdd.htm where        ¦

¦    XX is an abbreviation of your library name, yy is the last two     ¦

¦    digits of the year, mm is the month, and dd is the date.  The file ¦

¦    extension may be anything, but '.htm' is recommended               ¦

¦                                                                       ¦

+-----------------------------------------------------------------------+

 

Enter the full name (with extension) of the URLVerify file

from which you want to export to CSV & HTML formats ? uv000515.htm

 

Note: two files will be created by this program. 1) URLVOUT.TXT,

a CSV file for use with MS Access, and 2) URLVOUT.HTM, a file in

HTML which contains URLS that may be check with a 3rd party

URL checker.  Press ENTER to continue.?

 

Screen 2: Press ENTER to continue

 

 

A reminder note appears at the bottom of the screen.  Press ENTER again to start the program.

 


 

Completed: b2518529a

Completed: b2518624a

Completed: b2518690a

Completed: b2518981a

Completed: b2519017a

Completed: b2519108a

Completed: b2520613a

Completed: b2520886a

Completed: b2520904a

Completed: b2521965a

Completed: b2524078a

Completed: b2524466a

Completed: b2529790a

Completed: b2529853a

Completed: b2538835a

Completed: b2544154a

Completed: b2544340a

Completed: b2512294a

 

Program concluded normally.  Two files have been created:

 

     URLVOUT.TXT - delimited data from the URLVERIFY.HTML file

     URLVOUT.HTM - an HTML file for use with 3rd party link checker

>_

 

Screen 3: Conclusion of Program

 

 

At the conclusion of  the program you should see a display similar to the one in illustration Screen 3, above.  If the program concludes any other way, it may suggest an error condition.

 

Errors and Potential Problems:

 

URLVexpt has been designed to work with Innovative’s URLVerirfy.html format as of Spring, 2000. 

 

If  Innovative makes changes to this format, URLVExpt will probably not work until it has been adapted to these changes. 

 

Contact Tom Tyler (ttyler@du.edu) if this occurs.

 

 

Working with the Output file: URLVOUT.TXT

 

The intended use of the URLVOUT.TXT is as an input file for MS Access or similar relational database applications. 

 

The following fields will be found in the delimited data file URLVOUT.TXT

 

RECORD#                    - Data from the URLVerify.html “Title” column

OCLC#                - An empty field; for possible use in relational database

ERROR                - Data from the URLVerify.html “Error” column

URL                  - Data from the URLVerify.html “URL” column

REDIRECT URL         - Data from the URLVerify.html “New URL” column

FLAG                 - An empty field; for possible use in relational database

NOTE                 - An empty field; for possible use in relational database

REPLACEMENT URL      - An empty field; for possible use in relational database

TITLE                - An empty field; for possible use in relational database

 

Data elements in the URLVOUT.TXT delimited file


 

Two of the URLVerify.html data elements have been “normalized” by the program.

 

Record#, which appears in the original source file as the Innopac bibliographic record number without its check digit, has been “normalized” by the addition of the letter “a”.

 

The Redirect URL (New URL in URLVerify.html) has cleaned up the GPO PURL redirect information by removing everything except the actual URL.  Some non-standard GPO PURL hexadecimal representation has been converted to the HTTP character set.

 

 

 

Working with the Output file: URLVOUT.HTM

 

The URLVOUT.HTM file is in HTML format.  Its intended use is as a source file for third-party link checking.

 

URLs in the URLVOUT.HTM, like those in the URLVOUT.TXT file, are taken from the URL column if the error reported is not a redirect; for redirects, the URL is a “sanitized” version of the URL in the New URL column.