URLVExpt
Tom Tyler, University of Denver Library
![]()
Requirements:
· URLVerify.html file from Innopac library using the URLVerify product
·
Microsoft
Windows 9x
·
URLVExpt executable
Brief Description:
Using Innopac’s URLVerify.html file, URLVExpt creates two new files that may be used for general URL maintenance in Innopac bibliographic databases.
The first is an HTML file of hyperlinks that can be used with third party link checking software. Hyperlinks in this file are constucted from the URLVerify.html file’s URL column for all errors except redirects. Hyperlinks for temporary and permanent redirects are created from URLs in the New URL column.
The second file created by URLVExpt contains delimited data from an Innopac URLVerify.html file. Data in this file can be easily and quickly imported into a spreadsheet or into a database which can be used effectively as a working environment for URL maintenance in the Innopac database.
Documentation:
Documentation for URLVExpt, in MS Word format, is included among the files in the Zipped package available at the Download heading.
Download:
The URLVExpt executable and documentation in MS Word format are available in Zipped format at http://www.du.edu/~ttyler/freeware/urlvexpt.zip.
Unzip the file urlvexpt.zip to a directory location you want to use for URL maintenance activity. Two files will be unzipped: urlvexpt.exe – the executable, and urlvexpt.doc – the documentation file in MS Word format.
Setup:
There is no special setup required for URLVExpt.
Running the Program:
Before you run the program the source file should be saved to the directory where urlvexpt is found. Because the program is unable to recognize long filenames, the 8+3 filename convention should be used. In the illustration below, the Innopac URLVerify.html file was saved to the working directory with the name uv000515.htm. You may want to use a similar name-date convention such as uvyymmdd.htm (where yymmdd is the date).
You can initiate the program in two ways – in the command mode from the DOS prompt where you enter urlvexpt or from Windows by clicking on the filename urlvexpt.exe in the Explorer display. Screen 1 illustrated below should appear.
+-----------------------------------------------------------------------+
¦
¦
¦
URLVEXPT - a /screens/urlverify.html utility ¦
¦ ¦
¦ Tom Tyler,
University of Denver Library - ttyler@du.edu ¦
¦
¦
¦ Before
proceeding with this program you need to have saved your ¦
¦ Innopac
.../screens/urlverify.html page to the directory ¦
¦ location on your
workstation where URLVEXPT.EXE is located. ¦
¦
¦
¦ It is
recommended that you name the file XXyymmdd.htm where ¦
¦ XX is an
abbreviation of your library name, yy is the last two ¦
¦ digits of the
year, mm is the month, and dd is the date.
The file ¦
¦ extension may be
anything, but '.htm' is recommended ¦
¦
¦
+-----------------------------------------------------------------------+
Enter the full name (with extension) of the URLVerify file
from which you want to export to CSV & HTML formats ? uv000515.htm
Enter the complete filename; press ENTER.
+-----------------------------------------------------------------------+
¦
¦
¦
URLVEXPT - a /screens/urlverify.html utility ¦
¦
¦
¦ Tom Tyler,
University of Denver Library - ttyler@du.edu ¦
¦
¦
¦ Before
proceeding with this program you need to have saved your ¦
¦ Innopac
.../screens/urlverify.html page to the directory ¦
¦ location on your
workstation where URLVEXPT.EXE is located. ¦
¦
¦
¦ It is
recommended that you name the file XXyymmdd.htm where ¦
¦ XX is an
abbreviation of your library name, yy is the last two ¦
¦ digits of the
year, mm is the month, and dd is the date.
The file ¦
¦ extension may be
anything, but '.htm' is recommended ¦
¦
¦
+-----------------------------------------------------------------------+
Enter the full name (with extension) of the URLVerify file
from which you want to export to CSV & HTML formats ? uv000515.htm
Note: two files will be created by this program. 1)
URLVOUT.TXT,
a CSV file for use with MS Access, and 2) URLVOUT.HTM, a
file in
HTML which contains URLS that may be check with a 3rd party
URL checker. Press
ENTER to continue.?
A reminder note appears at the bottom of the screen. Press ENTER again to start the program.
Completed: b2518529a
Completed: b2518624a
Completed: b2518690a
Completed: b2518981a
Completed: b2519017a
Completed: b2519108a
Completed: b2520613a
Completed: b2520886a
Completed: b2520904a
Completed: b2521965a
Completed: b2524078a
Completed: b2524466a
Completed: b2529790a
Completed: b2529853a
Completed: b2538835a
Completed: b2544154a
Completed: b2544340a
Completed: b2512294a
Program concluded normally.
Two files have been created:
URLVOUT.TXT -
delimited data from the URLVERIFY.HTML file
URLVOUT.HTM - an
HTML file for use with 3rd party link checker
>_
At the conclusion of the program you should see a display similar to the one in illustration Screen 3, above. If the program concludes any other way, it may suggest an error condition.
Errors and Potential
Problems:
URLVexpt has been designed to work with Innovative’s URLVerirfy.html format as of Spring, 2000.
If Innovative makes changes to this format, URLVExpt will probably not work until it has been adapted to these changes.
Contact Tom Tyler (ttyler@du.edu) if this occurs.
The intended use of the URLVOUT.TXT is as an input file for MS Access or similar relational database applications.
The following fields will be found in the delimited data file URLVOUT.TXT
RECORD# -
Data from the URLVerify.html “Title” column
OCLC# -
An empty field; for possible use in relational database
ERROR -
Data from the URLVerify.html “Error” column
URL -
Data from the URLVerify.html “URL” column
REDIRECT URL -
Data from the URLVerify.html “New URL” column
FLAG -
An empty field; for possible use in relational database
NOTE -
An empty field; for possible use in relational database
REPLACEMENT URL -
An empty field; for possible use in relational database
TITLE -
An empty field; for possible use in relational database
Two of the URLVerify.html data elements have been “normalized” by the program.
Record#, which appears in the original source file as the Innopac bibliographic record number without its check digit, has been “normalized” by the addition of the letter “a”.
The Redirect URL
(New URL in URLVerify.html) has cleaned up the GPO PURL redirect
information by removing everything except the actual URL. Some non-standard GPO PURL hexadecimal
representation has been converted to the HTTP character set.
The URLVOUT.HTM file is in HTML format. Its intended use is as a source file for third-party link checking.
URLs in the URLVOUT.HTM, like those in the URLVOUT.TXT file, are taken from the URL column if the error reported is not a redirect; for redirects, the URL is a “sanitized” version of the URL in the New URL column.