FTP or not FTP? That is the Question
Jack Solock, Special Librarian
When we do the Scout Report (http://scout.cs.wisc.edu/scout/report/, we like to think of ourselves as guides, allowing users to start up their information vehicles and ride down the highway, perhaps stopping here and there to load small items into their trunks if they feel the need. The web is like that. It is a very pretty road to travel, and has many beautiful and useful sites to see. It is analogous to taking a nice Sunday drive (when the traffic isn't too terrible), stopping the car here and there to enjoy the beautiful vistas, and maybe even picking up a souvenir now and then.
Before I came to InterNIC, I was a librarian at a Special Library at the University of Wisconsin. My job there was not information guide, but information hauler. The professors I worked for appreciated the nice tours I occasionally provided them. But they enjoyed it much more when I backed my information eighteen wheeler at their dock and unloaded a truckful of information that they could process into new knowledge.
The Internet is about two things, communication and sharing of information. While one can get information from the web, its main function, it could be argued, is communication. Another Internet access method, FTP (File Transfer Protocol), is much more effective for industrial strength information sharing. It allows you to to trade in your nice Sunday car for an eighteen wheeler, fifty car freight train, or even a super tanker. It is the best way to quickly obtain enormous amounts of information from the Internet, and one of the great drawbacks of the Net is that information providers don't realize how much more useful sites could be if they simply provided FTP access as well as web access.
This column will be a tour rather than a tutorial on FTP, although we will show the basic steps of how to obtain information via this access method. By taking you to just a few sites, we will demonstrate how you can use FTP to take full advantage of Internet resources. Not only that, but with just a little practice, you will be able to tell your friends that not only do you "surf the web," but that you also know how to drive an eighteen wheeler.
Before the tour, it is important to point out the key difference between FTP and web access, which is the ability to download multiple files (the
First, for those who need to know how to use anonymous FTP (a type of FTP that allows any user access to Internet FTP resources), the best place to start is the FTP FAQ (Frequently Asked Questions) at the Usenet FAQ archives at Massachussetts Institute of Technology (MIT) (ftp://rtfm.mit.edu/pub/usenet/news.answers/ftp-list/faq). If you already have an FTP client, now is as good as any time to use it.
ftp rtfm.mit.edu login: anonymous password: your email address cd pub/usenet/news.answers/ftp-list/ get faq bye
(cd means change directories)
(Note that in this and all cases, directories are separated by / and you may have to change directories individually, depending on your client.)
For those who don't have FTP access tools, they can be obtained many places, one of the most effective of which is the PBS (Public Broadcasting System) Beginner's Guide to the Internet FTP section (http://www.pbs.org/uti/guide/ftp.html). Here you can find not only FTP information, but also connections to FTP programs for Windows® and Macintosh®, and file decompression software that you may need. It is very important, especially if you are new to FTP, to obtain this information before you continue.
Now, let's take a look at a well-maintained FTP archive as a model for FTP maintenance, as well as to see advantages of the FTP access method.
The 15 Minute Series (http://rs.internic.net/nic-support/15min)
This is the InterNIC Information and Education Services' set of materials for Internet trainers. If you access the 15 Minute Series through the web, you can do many interesting things, such as search or browse the materials, or even download each set in HTML or PowerPoint format. The site is also useful in that it provides exhaustive instructions about decompressing and using the materials. However, if you were interested in downloading all the materials in the Index and Search Services section, for example, FTP would be a much more effective way to do it.
ftp rs.internic.net login: anonymous password: your email address cd NIC-support/15Min
At this point, if you didn't know where the Index and Search materials were, you would download (or view, if your FTP client were able to) the files called "
get table-of-contents.txt get instructions.txt bye
If your client doesn't support viewing files, you must download these files (index and help files) and look at them first to see what files you want to download. Admittedly, this is cumbersome, but sometimes driving an eighteen wheeler is cumbersome. Remember that it is the data you can obtain that is the advantage of FTP.
In this case, InterNIC has provided the information you need to know about where to find the index-search materials, as well as instructions on how to differentiate the HTML from PowerPoint files. This is good FTP netiquette, and any effective FTP archive will have some sort of table of contents or instruction file that identifies the files in the archive and how to use them. Now, if you viewed the two files in your client, you can simply go to the index-search directory. If not, you might need to open another FTP session to get the files.
ftp rs.internic.net login: anonymous password: your email address cd NIC-support/15min/index-search
Here we see that there are some text files (instructional in nature), some
Note that you told the client to download in
You have left your car and are now driving a small truck. You could have done all this through a web browser (using the format ftp://...), and the browser would even recognize the binary format when downloading, but you would only be able to download one file at a time.
The advantages to FTP become clear when you decide you want to download the entire 15 Minute Series (31 modules at present.)
ftp rs.internic.net login: anonymous password: your email address cd NIC-Support/15min/modules binary prompt mget *.zip OR mget *.tar.gz
Sit back and have a cup of coffee while FTP loads your truck with the 15 Minute Series.
The above is a case where the web was a good place to find out what the 15 Minute Series is about, how it is organized, and what a module looks like. Once you have seen that, use FTP to get the goods.
Now, for just a few examples of how FTP can help load the information goods in your eighteen wheeler. We will look at both the web and ftp sites of these information repositories, in order to see how you can use both to aid in your information mining.
Users should note that in all the above examples save the 15 Minute Series, it was difficult to directly correlate the web to the associated FTP information. If you want to use FTP for downloading lots of information, you should expect this, and expect to contact the information maintainers or your librarian to help you. Driving an eighteen wheeler is often more difficult than driving in your car. It will continue to be so until information maintainers realize that easy FTP access is as important as easy web access. Unfortunately, this is not a widely held principle in the Internet community.
That said, if you really want to exploit the resources of the Internet, a working knowledge of FTP is required. And by the way, webmasters use it extensively to set up the pretty web scenery that we all enjoy. That is called uploading FTP files, but that is something for another place and time.
Copyright Susan Calcari and the University of Wisconsin Board of Regents, 1994-1998. Permission is granted to make and distribute verbatim copies of the End User's Corner provided the copyright notice and this paragraph is preserved on all copies. The Internet Scout Project provides information about the Internet to the US research and education community under a grant from the National Science Foundation, number NCR-9712140. The Government has certain rights in this material.
Any opinions, findings, and conclusions or recommendations expressed in this publication are those of the author(s) and do not necessarily reflect the views of the University of Wisconsin - Madison or the National Science Foundation.
© 1997 Internet Scout Project