diff options
author | Thomas Voss <mail@thomasvoss.com> | 2024-11-27 20:54:24 +0100 |
---|---|---|
committer | Thomas Voss <mail@thomasvoss.com> | 2024-11-27 20:54:24 +0100 |
commit | 4bfd864f10b68b71482b35c818559068ef8d5797 (patch) | |
tree | e3989f47a7994642eb325063d46e8f08ffa681dc /doc/rfc/rfc1614.txt | |
parent | ea76e11061bda059ae9f9ad130a9895cc85607db (diff) |
doc: Add RFC documents
Diffstat (limited to 'doc/rfc/rfc1614.txt')
-rw-r--r-- | doc/rfc/rfc1614.txt | 4427 |
1 files changed, 4427 insertions, 0 deletions
diff --git a/doc/rfc/rfc1614.txt b/doc/rfc/rfc1614.txt new file mode 100644 index 0000000..3ab44d0 --- /dev/null +++ b/doc/rfc/rfc1614.txt @@ -0,0 +1,4427 @@ + + + + + + +Network Working Group C. Adie +Request for Comments: 1614 Edinburgh University Computing Service +RARE Technical Report: 8 May 1994 +Category: Informational + + + Network Access to Multimedia Information + +Status of this Memo + + This memo provides information for the Internet community. This memo + does not specify an Internet standard of any kind. Distribution of + this memo is unlimited. + +Abstract + + This report summarises the requirements of research and academic + network users for network access to multimedia information. It does + this by investigating some of the projects planned or currently + underway in the community. Existing information systems such as + Gopher, WAIS and World-Wide Web are examined from the point of view + of multimedia support, and some interesting hypermedia systems + emerging from the research community are also studied. Relevant + existing and developing standards in this area are discussed. The + report identifies the gaps between the capabilities of + currentlydeployed systems and the user requirements, and proposes + further work centred on the World-Wide Web system to rectify this. + + The report is in some places very detailed, so it is preceded by an + extended summary, which outlines the findings of the report. + +Publication History + + The first edition was released on 29 June 1993. This second edition + contains minor changes, corrections and updates. + +Table of Contents + + Acknowledgements 2 + Disclaimer 2 + Availability 3 + 0. Extended Summary 3 + 1. Introduction 10 + 1.1. Background 10 + 1.2. Terminology 11 + 2. User Requirements 13 + 2.1. Applications 13 + 2.2. Data Characteristics 18 + + + +Adie [Page 1] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + 2.3. Requirements Definition 19 + 3. Existing Systems 24 + 3.1. Gopher 24 + 3.2. Wide Area Information Server 30 + 3.3. World-Wide Web 34 + 3.4. Evaluating Existing Tools 42 + 4. Research 47 + 4.1. Hyper-G 47 + 4.2. Microcosm 48 + 4.3. AthenaMuse 2 50 + 4.4. CEC Research Programmes 51 + 4.5. Other 53 + 5. Standards 55 + 5.1. Structuring Standards 55 + 5.2. Access Mechanisms 62 + 5.3. Other Standards 63 + 5.4. Trade Associations 66 + 6. Future Directions 68 + 6.1. General Comments on the State-of-the-Art 68 + 6.2. Quality of Service 70 + 6.3. Recommended Further Work 71 + 7. References 76 + 8. Security Considerations 79 + 9. Author's Address 79 + +Acknowledgements + + The following people have (knowingly or unknowingly) helped in the + preparation of this report: Tim Berners-Lee, John Dyer, Aydin Edguer, + Anton Eliens, Tony Gibbons, Stewart Granger, Wendy Hall, Gary Hill, + Brian Marquardt, Gunnar Moan, Michael Neuman, Ari Ollikainen, David + Pullinger, John Smith, Edward Vielmetti, and Jane Williams. The + useful role which NCSA's XMosaic information browser tool played in + assembling the information on which this report was based should also + be acknowledged - many thanks to its developers. + + All trademarks are hereby acknowledged as being the property of their + respective owners. + +Disclaimer + + This report is based on information supplied to or obtained by + Edinburgh University Computing Service (EUCS) in good faith. Neither + EUCS nor RARE nor any of their staff may be held liable for any + inaccuracies or omissions, or any loss or damage arising from or out + of the use of this report. + + + + + +Adie [Page 2] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + The opinions expressed in this report are personal opinions of the + author. They do not necessarily represent the policy either of RARE + or of ECUS. + + Mention of a product in this report does not constitute endorsement + either by EUCS or by RARE. + +Availability + + This document is available in various forms (PostScript, text, + Microsoft Word for Windows 2) by anonymous FTP through the following + URL: + + ftp://ftp.edinburgh.ac.uk/pub/mmaccess/ + + ftp://ftp.rare.nl/rare/pub/rtr/rtr8-rfc.../ + + Paper copies are available from the RARE Secretariat. + +0. Extended Summary + + Introduction + + This report is concerned with issues in the intersection of + networked information retrieval, database and multimedia + technologies. It aims to establish research and academic user + requirements for network access to multimedia data, to look at + existing systems which offer partial solutions, and to identify + what needs to be done to satisfy the most pressing requirements. + + User Requirements + + There are a number of reasons why multimedia data may need to be + accessed remotely (as opposed to physically distributing the data, + e.g., on CD-ROM). These reasons centre on the cost of physical + distribution, versus the timeliness of network distribution. Of + course, there is a cost associated with network distribution, but + this tends to be hidden from the end user. + + User requirements have been determined by studying existing and + proposed projects involving networked multimedia data. It has + proved convenient to divide the applications into four classes + according to their requirements: multimedia database applications, + academic (particularly scientific) publishing applications, cal + (computeraided learning), and general multimedia information + services. + + + + + +Adie [Page 3] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Database applications typically involve large collections of + monomedia (non-text) data with associated textual and numeric + fields. They require a range of search and retrieval techniques. + + Publishing applications require a range of media types, + hyperlinking, and the capability to access the same data using + different access paradigms (search, browse, hierarchical, links). + Authentication and charging facilities are required. + + Cal applications require sophisticated presentation and + synchronisation capabilities, of the type found in existing + multimedia authoring tools. Authentication and monitoring + facilities are required. + + General multimedia information services include on-line + documentation, campus-wide information systems, and other systems + which don't conveniently fall into the preceding categories. + Hyperlinking is perhaps the most common requirement in this area. + + The analysis of these application areas allows a number of + important user requirements to be identified: + + o Support for the Apple Macintosh, UNIX and PC/MS Windows + environments. + + o Support for a wide range of media types - text, image, + graphics and application-specific media being most + important, followed by video and sound. + + o Support for hyperlinking, and for multiple access structures + to be built on the same underlying data. + + o Support for sophisticated synchronisation and presentation + facilities. + + o Support for a range of database searching techniques. + + o Support for user annotation of information, and for user- + controlled display of sequenced media. + + o Adequate responsiveness - the maximum time taken to retrieve + a node should not exceed 20s. + + o Support for user authentication, a charging mechanism, and + monitoring facilities. + + o The ability to execute scripts. + + + + +Adie [Page 4] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + o Support for mail-based access to multimedia documents, and + (where appropriate) for printing multimedia documents. + + o Powerful, easy-to-use authoring tools. + + Existing Systems + + The main information retrieval systems in use on the Internet are + Gopher, Wais, and the World-Wide Web. All work on a client-server + paradigm, and all provide some degree of support for multimedia data. + + Gopher presents the user with a hierarchical arrangement of nodes + which are either directories (menus), leaf nodes (documents + containing text or other media types), or search nodes (allowing some + set of documents to be searched using keywords, possibly using WAIS). + A range of media types is supported. Extensions currently being + developed for Gopher (Gopher+) provide better support for multimedia + data. Gopher has a very high penetration (there are over 1000 Gopher + servers on the Internet), but it does not provide hyperlinks and is + inflexibly hierarchical. + + Wais (Wide Area Information Server) allows users to search for + documents in remote databases. Full-text indexing of the databases + allows all documents containing particular (combinations of) words to + be identified and retrieved. Non-text data (principally image data) + can be handled, but indexing such documents is only performed on the + document file name, severely limiting its usefulness. However, WAIS + is ideally suited to text search applications. + + World-Wide Web (WWW) is a large-scale distributed hypermedia system. + The Web consists of nodes (also called documents) and links. Links + are connections between documents: to follow a link, the user clicks + on a highlighted word in the source document, which causes the + linkedto document to be retrieved and displayed. A document can be + one of a variety of media types, or it can be a search node in a + similar sense to Gopher. The WWW addressing method means that WAIS + and Gopher servers may also be accessed from (indeed, form part of) + the Web. WWW has a smaller penetration than Gopher, but is growing + faster. The Web technology is currently being revised to take better + account of the needs of multimedia information. + + These systems all go some way to meet the user requirements. + + o Support for multiple platforms and for a wide range of media + types (through "viewer" software external to the client + program) is good. + + o Only WWW has hyperlinks. + + + +Adie [Page 5] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + o There is little or no support for sophisticated presentation + and synchronisation requirements. + + o Support for database querying tends to be limited to + "keyword" searches, but current developments in Gopher and + WWW should make more sophisticated queries possible. + + o Some clients support user annotation of documents. + + o Response times for all three systems vary substantially + depending on the network distance between client and server, + and there is no support for isochronous data transfer. + + o There is little in the way of authentication, charging and + monitoring facilities, although these are planned for WWW. + + o Scripting is not supported because of security issues + + o WWW supports a mail responder. + + o The only system sufficiently complex to warrant an authoring + tool is WWW, which has editors to support its hypertext + markup language. + + Research + + There are a number of research projects which are of significant + interest. + + Hyper-G is an ambitious distributed hypermedia research project at + the University of Graz. It combines concepts of hypermedia, + information retrieval systems and documentation systems with aspects + of communication and collaboration, and computer-supported teaching + and learning. Automatic generation of hyperlinks is supported, and + there is a concept of generic structures which can exist in parallel + with the hyperlink structure. Hyper-G is based on UNIX, and is in + use as a CWIS at Graz. Gateways between Hyper-G and WWW exist. + + Microcosm is a PC-based hypermedia system developed at the University + of Southampton. It can be viewed as an integrating hypermedia + framework - a layer on top of a range of existing applications which + enables relationships between different documents to be established. + Hyperlinks are maintained separately from the data. Networking + support for Microcosm is currently under development, as are versions + of Microcosm for the Apple Macintosh and for UNIX. Microcosm is + currently being "commercialised". + + + + +Adie [Page 6] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + AthenaMuse 2 is an ambitious distributed hypermedia authoring and + presentation system under development by a university/industry + consortium based at MIT. It will have good facilities for + presentation and synchronisation of multimedia data, strong authoring + support, and will include support for networking isochronous data. It + will be a commercial product. Initial versions will support UNIX and + X windows, with a PC/MS Windows version following. Apple Macintosh + support has lower priority. + + The "Xanadu" project is designing and building an "open, social + hypermedia" distributed environment, but shows no sign of delivering + anything after several years of work. + + The European Commission sponsors a number of peripherally relevant + projects through its Esprit and RACE research programmes. These + programmes tend to be oriented towards commercial markets, and are + thus not directly relevant. An exception is the Esprit IDOMENEUS + project, which brings together workers in the database, information + retrieval and multimedia fields. It is recommended that RARE + establish a liaison with this project. + + There are a variety of other academic and commercial research + projects which are also of interest. None of them are as directly + relevant as those outlined above. + + Standards + + There are a number of existing and emerging standards for structuring + hypermedia applications. Of these, the most important are SGML, + HyTime, MHEG, ODA, PREMO and Acrobat. All bar the last are de jure + standards, while Acrobat is a commercial product which is being + proposed as a de facto standard. + + SGML (Standard Generalized Markup Language) is a markup language for + delimiting the logical and semantic content of text documents. + Because of its flexibility, it has become an important tool in + hypermedia systems. HyTime is an ISO standardised infrastructure for + representing integrated, open hypermedia documents, and is based on + SGML. HyTime has great expressive power, but is not optimised for + run-time efficiency. It is recommended that future RARE work on + networked hypermedia should take account of the importance of SGML + and HyTime. + + MHEG (Multimedia and Hypermedia information coding Experts Group) is + a draft ISO standard for representing hypermedia applications in a + platform-independent form. It uses an object-oriented approach, and + is optimised for run-time efficiency. Full IS status for MHEG is + expected in 1994. It is recommended that RARE keep a watching brief + + + +Adie [Page 7] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + on MHEG. + + The ODA (Open Document Architecture) standard is being enhanced to + incorporate multimedia and hypermedia features. However, interest in + ODA is perceived to be decreasing, and it is recommended that ODA + should not form a basis for further RARE work in networked + hypermedia. + + PREMO is a new work item in the ISO graphics standardisation + community, which appears to overlap with MHEG and HyTime. It is not + clear that the PREMO work, which is at a very early stage, is + worthwhile in view of the existence of those standards. + + Acrobat PDF is a format for representing multimedia (printable) + documents in a portable, revisable form. It is based on Postscript, + and is being proposed by Adobe Inc (originators of Postscript) as an + industry standard. RARE should maintain awareness of this technology + in view of its potential impact on multimedia information systems. + + There are various standards which have relevance to the way + multimedia data is accessed across the network. Many of these have + been described in a previous report [1]. Two further access + protocols are the proposed multimedia extensions to SQL, and the + Document Filing and Retrieval protocol. Neither of these are likely + to have major significance for networked multimedia information + systems. + + Other standards of importance include: + + o MIME, a multimedia email standard which defines a range of + media types and encoding methods for those types which are + useful in a wider context. + + o AVIs (Audio-Visual Interactive services) and the associated + multimedia scripting language SMSL, which form a + standardisation initiative within CCITT (now ITU-TSS) to + specify interactive multimedia services which can be + provided across telephone/ISDN networks. + + There are two important trade associations which are involved in + standardisation work. The Interactive Multimedia Association (IMA) + has a Compatibility Project which is developing a specification for + platform-independent interactive multimedia systems, including + networking aspects. A newly-formed group, the Multimedia + Communications Forum (MMCF), plans to provide input to the standards + bodies. It is recommended that RARE become an Observing Member of + the MMCF. A third trade association - the Multimedia Communications + Community of Interest - has also just been formed. + + + +Adie [Page 8] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Future Directions + + Three common design approaches emerge from the variety of systems and + standards analysed in this report. They can be described in terms of + distinctions between different aspects of the system: + + o content is distinct from hyperstructure + + o media type is distinct from media encoding + + o data is distinct from protocol + + Distributed hypermedia systems are emerging from the + research/development phase into the experimental deployment phase. + However, the existing global information systems (Gopher, WAIS and + WWW) are still largely limited to the use of external viewers for + nontextual data. The most significant mismatches between the + capabilities of currently-deployed systems and user requirements are + in the areas of presentation and quality of service (i.e., + responsiveness). + + Improving QOS is significantly more difficult than improving + presentation capabilities, but there are a number of possible ways in + which this could be addressed. Improving feedback to the user, + greater multi-threading of applications, pre-fetching, caching, the + use of alternative "views" of a node, and the use of isochronous data + streams are all avenues which are worth exploring. + + In order to address these problems, it is recommended that RARE seek + to adapt and enhance existing tools, rather than develop new ones. + + In particular, it is recommended that RARE select the World-Wide Web + to concentrate its efforts on. The reasons for this choice revolve + around the flexibility of the WWW design, the availability of + hyperlinks, the existing effort which is already going into + multimedia support in WWW, the fact that it is an integrating + solution incorporating both WAIS and Gopher support, and its high + rate of growth compared to Gopher (despite Gopher's wider + deployment). Gopher is the main competitor to WWW, but its + inflexibly hierarchical structure and the absence of hyperlinks make + it difficult to use for highly-interactive multimedia applications. + + It is recommended that RARE should invite proposals for and + subsequently commission work to: + + o Develop conversion tools from commercial multimedia + authoring packages to WWW, and accompanying authoring + guidelines. + + + +Adie [Page 9] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + o Implement and evaluate the most promising ways of overcoming + the QOS problem. + + o Implement a specific user project using these tools, to + validate that the facilities being developed are truly + relevant to real applications. + + o Use the experience gained to inform and influence the + development of the WWW technology. + + o Contribute to the development of PC/MS Windows and Apple + Macintosh WWW clients, particularly in the multimedia data + handling area. + + It is noted that the rapid growth of WWW may in the future lead to + problems through the implementation of multiple, uncoordinated and + mutually incompatible add-on features. To guard against this trend, + it may be appropriate for RARE, in coordination with CERN and other + interested parties such as NCSA, to: + + o Encourage the formation of a consortium to coordinate WWW + technical development. + +1. Introduction + +1.1. Background + + This study was inspired by the realisation that while some aspects of + distributed multimedia technology are being actively introduced into + the European research community (for instance, audiovisual + conferencing, through the MICE project), other aspects are receiving + less attention. In particular, one category in which there seems to + be relatively little activity is providing solutions to ease remote + access to multimedia resources (for instance, accessing stored + audio/video clips or images, or indeed entire multimedia + applications, across the network). Few commercial products address + this, and the relevance of existing standards in this area is + unclear. + + Of the 50 or so research projects documented in the recent RARE + distributed multimedia survey [1], only about six have a direct + relevance to this application area. Where stated in the survey, the + main research effort in these projects is often directed towards the + "difficult" problems, such as the transfer of isochronous data and + the design and implementation of object-oriented multimedia + databases, rather than towards user-oriented issues. + + + + + +Adie [Page 10] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + This report is concerned with practical issues in the intersection of + networked information retrieval, database and multimedia + technologies. It aims to establish actual user requirements in this + area, to look at existing systems which offer partial solutions, and + to identify what additional work needs to be done to satisfy the most + pressing requirements. + +1.2. Terminology + + In order to discuss multimedia information systems, we need a + consistent terminology. The vocabulary defined below embodies some + of the concepts of the Dexter hypertext reference model [2]. This + model is sufficiently general to be useful for describing most of the + facilities and requirements of the multimedia information systems + described in this report. (However, the Dexter model does not + describe searchable index objects - it is not a database reference + model.) + + anchor An identified portion of a node. E.g., in a text + node, an anchor might be a string of one or more + adjacent characters, while in an image node it + might be a rectangular area of the image. + + composite node A node containing data of multiple media types. + + document Often used loosely as a synonym for node. + + hyperdocument We refer to a collection of related nodes, + linked internally with hyperlinks, as a + "hyperdocument". Examples are a database of + medical images and associated text; a module + from a suite of teaching material; or an article + in a scientific journal. A hyperdocument may + contain hyperlinks to other data which exists in + internally with hyperlinks, as a + "hyperdocument". Examples are a other + hyperdocuments, but can be viewed as largely + self-contained. It is a highlevel "unit of + authoring", but is not necessarily perceived as + a distinct unit by a reader (although it may be + so perceived, particularly if it contains few + hyperlinks to outside entities). + + hyperlink Set of one or more source anchors and one or + more target anchors. Also known simply as a + "link". + + + + + +Adie [Page 11] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + isochronous (adjective) Describes a continuous flow of data which + is required to be delivered by the network under + critical time constraints. + + leaf node A node which contains no source anchors. + + media type An attribute of data which describes the general + nature of its expected presentation. The value + of this attribute could be one of the following + (not exhaustive) list: + + o Text + + o Sound + + o Image (e.g., a "photograph") + + o Graphics (e.g., a "drawing") + + o Animation (i.e., moving graphics) + + o Movie (i.e., moving image) + + monomedia (adjective) Said of data which is all of the same media + type. + + multimedia (adjective) Said of data which contains different media + types. This definition is stricter than general + usage, where "multimedia" is often used as a + generic term for non-textual data, and where it + may even be used as a noun. + + physical media Magnetic or optical storage. Not to be confused + with media type! + + [simple] node A monomedia object which may be retrieved and + displayed as a single unit. + + source anchor An anchor which may be "actioned" by the user, + causing the node(s) containing the target + anchor(s) in the same hyperlink to be retrieved + and displayed. This process is called + "traversing the link". + + target anchor an anchor forming part of a hyperlink, whose + containing node is retrieved and displayed when + the hyperlink is traversed. + + + + +Adie [Page 12] + +RFC 1614 Network Access to Multimedia Information May 1994 + + +2. User Requirements + + User requirements in an area such as networking, which is subject to + rapid technological change, are sometimes difficult to identify. To + an extent, technology leads applications, and users will exploit what + is possible. + +2.1. Applications + + Awareness of the range of networked multimedia applications which are + currently being envisaged by computer users in the academic and + research community leads to a better understanding of the technical + requirements. This section outlines some projects which require + remote access to multimedia information across research networks, and + which are currently either at a preliminary stage or underway. The + projects are divided into broad categories according to their + characteristics. + + Multimedia Databases + + Here are several examples of multimedia projects which have a + "database" character. + + The Peirce Telecommunity Project + + This project centres on the construction of a multimedia (text and + image) database of the works of the American philosopher Peirce, + together with tools to process the data and to make it available + over the Internet. A sub-project at Brown University focuses on + adapting existing client/server network tools for this purpose. + The requirements for network access include facilities for + structured viewing, intelligent retrieval, navigation, linking, + and annotation, as well as for domainspecific processing. + + Museum Object Databases + + The RAMA (Remote Access to Museum Archives) project is funded + under the EEC RACE II programme. Its objective is to develop a + system which allows museums to make multimedia information about + their exhibits and archived material available over an ISDN + network. The requirements capture and technical architecture + design phases are now complete, and a prototype system will be + delivered in June 1993 to link the Ashmolean Museum (Oxford, GB), + the Musee d'Orsay (Paris, FR) and the Museum Archeological + National (Madrid, ES). Image data is the main media type of + interest, although video and sound may also play a part. + + + + + +Adie [Page 13] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + The Bristol Biomedical Videodisk Project + + The Bristol Biomedical Videodisc is a collection of Medical, + Veterinary and Dental images. The collection holds some 24,000 + still images and is continuously growing. Textual information + regarding the images is included as part of the database and this + can be searched on any keyword, number or other data type, or a + combination of any of these. The images are currently delivered + in analogue form on a videodisc, but many institutions are unable + to afford the cost of videodisc players. Investigations into + making this image and text database available across the network + are underway. + + ArchiGopher + + ArchiGopher is a Gopher server at the College of Architecture, + University of Michigan, dedicated to the dissemination of + architectural knowledge. Presently in its infancy, ArchiGopher is + intended to become a multimedia resource for all architecture + faculty and students world-wide. Some of the available or planned + resources are: + + o The College's image bank. + + o The CAD group's collection of computer models (already + started). + + o The Doctoral Program's recent dissertation proposals and + abstracts. + + o Example archive of Kandinsky paintings. + + o Images of 3D CAD projects. + + The principal media type in ArchiGopher is image. Files are + stored in both TIFF and GIF format. + + Vatican Library Exhibit + + In January 1993, the US Library of Congress mounted an electronic + version of the exhibition ROME REBORN: THE VATICAN LIBRARY AND + RENAISSANCE CULTURE. The exhibition was subsequently processed by + the University of Virginia Library. The text files were broken + into individual captions associated directly with each image and a + WAIS-searchable version of the object index generated. This has + been made available on Gopher by the University of Virginia + Library. + + + + +Adie [Page 14] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + This project is particularly interesting, as it demonstrates some + limitations of the Gopher system. The principal media types are + image and text, and it is difficult to associate a caption with + its image - each must be fetched separately, and using the XMosaic + or xgopher client software it is not possible to tell which menu + entry is the image and which the caption. (This may be a + consequence of how the data has been configured for the Gopher + server; if so, a requirement for better publishing tools may be + indicated.) Furthermore, searching the object index will result + in a Gopher menu containing references to catalogue entries for + relevant exhibits, but not to the online images of the exhibits + themselves, which severely limits the usefulness of the index. + + It is interesting to note that during the preparation of this + report, the Vatican Exhibition has been mounted on the WorldWide + Web (WWW). The hypermedia presentation on the Web is very much + more attractive to use than the Gopher version. + + Jukebox + + Jukebox is a project supported by the EEC libraries program. The + project aims to evaluate a pilot service providing library users + with on-line access to a database of digital sound recordings. + The database will support multi-user access and use suitable + storage media to make available sound recordings in a compressed + format. Users will access the service with a personal computer + connected to a telematic network. + + Scientific Publishing + + There are several refereed electronic academic journals presently + distributed on the Internet. These tend to be text-only journals, + and have not really addressed the issues of delivering and + manipulating non-text data. + + Many scientific publishers have plans for electronic publishing of + existing academic journals and conference proceedings, either on + physical media or on the network. The Journal of Biological + Chemistry is now published on CD-ROM, for instance. Some publishers + view CD-ROM as an interim step to the ultimate goal of making + journals available on-line on the Internet. + + The main types of non-text data which are envisaged are: + + o Images. In many cases, image data (a microphotograph, say) + is central to an article. Software which recognises that + the text may be of secondary importance to the image is + required. + + + +Adie [Page 15] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + o Application-specific data. The ChemLab and MoleculeLab + applications are widely used, and the integration of + corresponding data types with journal articles will enhance + readers' ability to visualise molecular structures. + Similarly, mathematics appearing in scientific papers could + be represented in a form suitable for processing by + applications such as Mathematica. Mathematical content + could then become a much more interactive and dynamic aspect + of research publications. + + o Tabular data. The ability for a reader to extract tabular + data from a research paper, to produce a graphical + representation, to subset the data, and to further process + it in a number of different ways, is viewed as an essential + part of scientific electronic publishing. + + o Movies. The American Astronomical Society regularly + publishes videos to go with its academic journals. + Electronic publishing can improve on this "hard copy" + publishing by integrating video data much more closely with + the source article. + + o Sound. There is perhaps slightly less demand for audio + information in scientific publishing, but the requirement + does exist in particular specialities (such as acoustics and + zoology journals). + + Access to academic journals using at least four different paradigms + is envisaged. Hierarchical access, perhaps using a traditional + journal/volume/issue/article model, is perhaps the most obvious. + Keyword searching (or full-text indexing) will be required. Browsing + is another useful and often underestimated access model - to support + browsing it is essential that "eye-catching" data (unlikely to be + textual) is prominently accessible. The final method of access is + perhaps the most important - the use of interactive viewing tools. + Such tools would enable navigation of hypermedia links within and + between articles, with gateways to special-purpose applications as + described above. The use of these disparate access methods implies + more than one structure being applied to the same underlying data. + + Standards, particularly SGML, are becoming important to publishers, + and it is clear that the SGML-based HyTime standard will be a front + runner in providing the kind of hypermedia facilities which are being + envisaged. However, progress towards a common SGML Document Type + Definition (DTD) for scientific articles, even within individual + publishing houses and for text-only documents, is slow. + + + + +Adie [Page 16] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + A specific initiative involving interested parties will be required + to formalise detailed requirements and to pilot standards in this + area. A preliminary demonstrator project, funded by publishers and + by the British Library Research and Development Department, involves + making about 30 sample scientific articles available over the + SuperJANET network, using a range of different software products. The + demonstrator project is being managed by IOP Publishing and is being + carried out at Edinburgh University Computing Service. + + Existing tools, particularly WAIS and WWW, are relevant, but adequate + security and charging mechanisms are required if commercial + publishers are to use them. Many research groups are now making the + text of preprints and published research papers available on Gopher + servers. + + It is interesting to note that the proceedings of the Multimedia 93 + conference run by the ACM will be published electronically (on CD + ROM), using a multimedia document format designed specifically for + the event. + + Computer-aided Learning + + The ready availability of user-friendly multimedia authoring tools + such as AuthorWare Professional, Asymmetrix Multimedia Toolbook, + Macromind Director and many more, has stimulated much interest in + multimedia for computer-aided learning applications within the user + community. Sophisticated interactive multimedia courseware + applications are being developed in many disparate subjects + throughout the European academic community. Users are now beginning + to ask network technologists, "how can I make my multimedia + application available to others across the network?". + + There is considerable interest in using the network to enhance + delivery of multimedia teaching materials - for instance to allow + students to take courses remotely (distance learning) and for their + learning process to be supported, monitored and assessed remotely. + + The requirements which flow from this type of network application + include the ability to identify and authenticate the students using + the material, to monitor their progress, and to supply on-line + assessment exercises for the student to complete. Multimedia + authoring tools allow very attractive presentation environments to be + created, which encourages learning; this is viewed as essential by + course developers. Easy-to-use authoring tools (preferably existing + commercial ones) are also essential. + + Finally, some learning applications involve simulations - examples + include meteorological modelling and economic simulations. Network + + + +Adie [Page 17] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + delivery of teaching materials should cope with this requirement + (perhaps by acknowledging that executable scripts are just another + media type). + + General Information Services + + There are many other possible uses of multimedia data in networked + information servers which don't conveniently fall into any of the + above categories. Some examples are given below. + + o On-line documentation. Manuals and instruction books often + rely heavily on pictorial information, and are enhanced by + dynamic media types (sound, video). The ability to access + centrally-held manuals across a network makes it much easier + to keep the information up-to-date. + + o Campus-wide information systems (CWIS) are an important + growth area. The opportunities for enhancing such a + service with multimedia data (e.g., maps) is obvious. + + o Multimedia news bulletins (e.g., the Internet Talk Radio, + which is sound only). + + o Product information (the multimedia equivalent of paper + advertising matter). + + o Consumer systems - e.g., tourist information servers. The + utility of such systems in an academic/research environment + is perhaps questionable, but it is likely that such systems + will address problems which will also be met in this + environment. We should be prepared to learn from such + projects. + +2.2. Data Characteristics + + Some of the characteristics which make data more appropriate for + network publication rather than publication on physical media are + listed below. + + o The data may change frequently. + + o Implementing corrections and improvements to the data is + very much easier. + + o It is more readily available to the data user - no + purchase/delivery cycle need exist. + + + + + +Adie [Page 18] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + o Publication on physical media may not be cost-effective for + very large volumes of data. (Of course, there is a cost in + networking the data as well, but the research/academic user + is normally insulated from this.) + + o Access for large user communities can be established without + requiring each user to purchase a potentially expensive + physical media peripheral (such as a laser disk player). + This is particularly helpful in classroom situations. + + o It may require less effort from the data publisher to make + data available over a network, rather than set up a manual + mechanism for distributing physical media. + + o If related data from many different sources is to be + published, it may be more efficient to leave the data in + situ, and simply publish the network addresses of the data. + + There are counter-reasons which may make physical media distribution + more appropriate: + + o Easier to charge for. (However, charging mechanisms do + exist in some network information systems. It may be that + potential information providers need to be made more aware + of this.) + + o Easier to deter or prevent copyright infringement, using + traditional copy-protection techniques. + +2.3. Requirements Definition + + From studying the applications described in the preceding section, + and from discussions with the people involved with the applications, + it is possible to draw up a list of general requirements which a + distributed multimedia information system for the academic and + research community should satisfy. These requirements are informally + described in the following subsections. The descriptions are + necessarily informal and incomplete: every individual application + will have its own detailed requirements, which would take a great + deal of effort to determine (and indeed some of the requirements may + not become apparent until the application is into its development + phase). + + Platforms + + It is clear that the European academic community, in common with + other such communities, requires support for three main platforms: + UNIX, Apple Macintosh, and PC/Windows. For multimedia client/server + + + +Adie [Page 19] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + systems, the latter two are less appropriate as server platforms, but + client support for all three is vital. UNIX will be most often used + as the server platform. + + There are other systems, such as VAX/VMS, which are also important in + some sectors. + + Media Types + + Unsurprisingly, all applications require text data to be supported as + a basic media type. Image and graphic media types are next in + importance, followed by "application-specific" data (such as tabular + scientific data, mathematical equations, chemical data types, etc). + Sound and video media types are becoming more important as users + discover how these can enhance applications. + + Many different encodings are possible for each media type (e.g., + image data can be encoded as TIFF, PCX, GIF, PICT and many more). An + information system should not constrain the type of encoding used, + and should ideally offer either a range of alternative encodings, or + conversion facilities between the stored encoding and an encoding + suitable for display by the client workstation. + + Hyperlinks + + It is clear that many applications require their users to be able to + navigate through the information base according to relationships + determined by the information provider - in other words, hyperlinks. + Academic publishing, CAL, on-line documentation and CWIS systems all + require this capability. The user should be able, by some action + such as clicking on a highlighted word in a text node or on a button, + to cause another node or nodes to be retrieved and displayed. + + Some "hypermedia" systems are in fact simply hypertext, in that they + require the source anchor of a hyperlink to be in a text node. A + true hypermedia system allows hyperlinks to have their source anchors + in nodes of any media type. This allows a user to click the mouse on + a component of a diagram or on part of a video sequence to cause one + or more related nodes to be retrieved and displayed. + + Some hypermedia systems allow target anchors of a hyperlinks to be + finer-grained than a whole node - e.g., the target anchor could be a + word or a paragraph within a text document. Without such a + capability, it is necessary for target nodes to be quite small if + precision is required in a hyperlink. This may be difficult to + manage, and fine-grained target anchors are therefore better. + + + + + +Adie [Page 20] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Additional structure above or orthogonal to the underlying + hyperlinked data is required in some applications. This allows the + same (generally non-textual) data to be used in several different + applications, or the implementation of different access paradigms. + + Presentation + + Related information of different media types must be capable of + synchronised display. Commercial multimedia authoring packages + provide many different ways of presenting, synchronising and + interacting with media elements. Some of these are summarised below. + + o Backdrops. An application may present all its visual + information against a single background bitmap - e.g., + a CAL application might use a background image of an open + textbook, with graphics, text and video data all presented + on the open pages of the book. + + o Buttons. A "button" can be defined as an explicitly- + delimited area of the display, within which a mouse click + will cause an action to occur. Typically, the action will + be (or can be modelled as) a hyperlink traversal. + Applications use different styles of button - some may use + "tabs" as in a notebook, or perhaps "bookmarks" in + conjunction with the open textbook backdrop mentioned above. + Others may use plain buttons in a style conforming to the + conventions of the host platform, or may simply highlight a + word or phrase in a text display to indicate it is "active". + + o Synchronisation in space. When two or more nodes are + presented together (e.g., because a link with more than one + target anchor has been traversed), the author of the + hyperdocument may wish to specify that they be presented in + a spatially-related way. This may involve: x/y + synchronisation - e.g., a video node being displayed + immediately above its text caption; it may involve + contextual synchronisation - e.g., an image being displayed in + a specific location within a text node; or it may involve z- + axis synchronisation as well - for instance a text node + containing a simple title being displayed on top of an + image, with the text background being transparent so that + the image shows through. + + o Synchronisation in time. Isochronous data may require + synchronisation - the obvious case being audio and video + tracks (where these are held separately). Other examples + are: the synchronisation of an automatically-scrolling text + panel to a video clip (for subtitling); or to an audio clip + + + +Adie [Page 21] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + (e.g., a translation); or synchronising an animation to an + explanatory audio track. + + Searching + + Database-type applications require varying degrees of sophistication + in retrieval techniques. For applications addressed in this report, + non-text nodes form the major data of interest. Such nodes have + associated descriptions, which may be plain text, or may be + structured into fields. Users need to be able to search the + descriptions, obtain a list of "hits", and select nodes from that + list to display. Searching requirements vary from simple keyword + searching, via full-text indexing (with or without Boolean + combinations of search words), to full SQL-style database retrieval + languages. + + Interaction + + The user must be able to annotate documents retrieved from the + information server. The annotations may be stored locally. + Similarly, the user may wish to add his own (locally-held) hyperlinks + to documents. (Actual modification of documents in the information + system itself, or shared annotations to documents - i.e., the + information system as a CSCW environment - is viewed as separate + issue which this report does not address.) + + If an information provider has included contact details (such as a + mail address) in a document, it should be possible for the reader to + invoke a program (such as a mailer) which initiates communication + with the author. + + In some applications, it may make sense for a user to be able to + specify a region of interest in an image or movie clip, and to + request a more detailed view of (or other information about) that + region. + + Some applications require a sequence of images to be presented under + control of the user. For instance, a three-dimensional microscopic + structure could be represented as a sequence of images taken with the + microscope focused on a different plane for each image. For display, + the user could control which image was displayed using some kind of + slider control, giving the illusion of focusing a microscope. (This + particular example has been taken from the Theseus project at John + Moore's University, Liverpool, GB.) + + + + + + + +Adie [Page 22] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Quality of Service + + Research has shown [3] that user toleration of delay in computer + systems depends on user perception of the nature of the requested + action. If the user believes that no computation is required, + tolerable delays are of the order of 0.2s. If the user believes the + action he or she has requested the computer to perform is "difficult" + - for instance a computation of some form - then a tolerable delay is + of the order of 2s. Users tend to give up waiting for a response + after about 20s. Networked multimedia information systems must be + able to provide this level of responsiveness. + + Management + + In order to support applications involving real-money information + services (e.g., academic publishing) and learning/assessment + applications, there must be a reliable and secure access control + mechanism. A simple password is unlikely to suffice - Kerberos + authentication procedures are a possibility. + + Users must be able to determine the charge for an item before + retrieving it (assuming that pay-per-item will be a common paradigm + alternatives such as pay-per-call, pay-per-duration are also + possible). Access records must be kept by the information server for + charging purposes. + + Learning applications have similar requirements, except that the + purpose here is not to charge for information retrieved, but to + monitor and perhaps assess a student's progress. + + Scripting + + Many authoring packages provide scripting languages. In most cases, + these languages are used to manage the presentation environment and + control navigation within the hypermedia document. There are other, + declarative rather than procedural, methods for achieving this, so + scripting of this type is not necessarily a requirement. However, + some application areas require executable scripts for other purposes + (e.g., simulations in CAL applications). Care in providing such a + facility is required, because of the potential for abuse (the + possibility of "trojan" scripts). However, there is work going on to + produce "safe" scripting languages - an example is "safe tcl", being + developed by Borenstein and Ousterhout (contact + ouster@cs.berkeley.edu). + + + + + + + +Adie [Page 23] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Bytestream Format + + For the easy transfer and handling of a hyperdocument, it must be + capable of being encoded into a bytestream form, in such a way that + the structure of the document is preserved and it can be decoded + without loss of information. + + This facility makes it possible for such documents to be supplied to + a user over electronic mail, in such a way that he or she can browse + them at his or her own site. This may be appropriate where the user + does not have a direct connection to the Internet. It will also be + useful for printing the hyperdocument. + + Authoring + + It is essential that a multimedia information system should have + adequate authoring tools which make it easy to prepare and publish + hypermedia information. Such tools need similar power to existing + commercial multimedia authoring software for stand-alone multimedia + applications. + +3. Existing Systems + + This chapter describes some existing distributed information systems + in sufficient detail to reveal how they handle multimedia data, and + analyses how well they meet the requirements outlined in the + preceding chapter. + +3.1. Gopher + + The Internet Gopher is a distributed document delivery service. It + allows a neophyte user to access various types of data residing on + multiple hosts in a seamless fashion. This is accomplished by + presenting the user with a hierarchical arrangement of nodes and by + using a client-server communications model. The Gopher server + accepts simple queries, and responds by sending the client a node + (usually called a document in this context). + + Client software is available for a large number of systems, + including: + + o UNIX (character terminals) + + o X windows + + o Apple Macintosh + + o MS DOS + + + +Adie [Page 24] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + o NeXT + + o VM/CMS + + o VMS + + o OS/2 + + o MVS/XA + + Servers are available for systems such as: + + o UNIX + + o VMS + + o Apple Macintosh + + o VM/CMS + + o MVS + + o MS DOS + + Gopher was developed at the University of Minnesota. + + Gopher User Image + + A Gopher client offers an interface into "gopherspace", which appears + to the user as a hierarchy of menus and document nodes, similar in + some ways to a file system hierarchy of directories and files. + Selecting an entry from a menu node causes a further menu to appear, + or causes a document to be retrieved and displayed. + + As well as "ordinary" document nodes, Gopher has "search nodes" when + one of these is selected from a menu, the user is prompted for one or + more words to search on. The result of the search is a "virtual" + menu, containing entries for document nodes (within some subset of + gopherspace) which match the search. A special type of Gopher search + server called "veronica" provides access to a database of all + directory nodes in gopherspace. This allows a user to construct a + virtual menu of all Gopher menu items containing a particular word. + WAIS databases may also be located at Gopher search nodes, since some + Gopher servers understand the format of WAIS index files. + + + + + + +Adie [Page 25] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Gopher Protocol + + Gopher uses a client-server paradigm. The Gopher protocol runs over + a reliable data stream service, typically TCP, and is fully defined + in RFC 1436. The following paragraphs give an overview which is + sufficient for understanding how multimedia data is handled in + Gopher. + + A Gopher client opens a TCP connection to a Gopher server (defined by + machine name and TCP port number), and sends a line of text known as + the "selector" to request information from the server. The server + responds with a block of data, and then closes the connection. No + state is retained by the server. A null (empty) selector tells the + Gopher server to return its "root" menu node, containing pointers to + other information in gopherspace. + + A menu is returned from a Gopher server as a sequence of lines of + text, each corresponding to one entry in the menu. Each line (which + is sometimes called a "Gopher reference") contains the following + data, which can be used by the client software to retrieve and + display the corresponding node in gopherspace. + + o A single character which identifies the type of the node. + Possible values of this type ID are given below. + + o A human-readable string which is used by the client software + when it displays the menu entry to the user. + + o The selector which should be used by client software to + retrieve the node. It is treated as opaque by the client + software. + + o The domain name of the host on which the node is held. + + o The port number to use for the TCP connection. + + A document node is sent by a Gopher server simply as lines of text + terminated by a dot on a line by itself, or as raw binary data, with + the end of the data indicated by the server closing the TCP + connection. The choice depends on the type of node. + + The currently-defined type IDs are as follows: + + 0 Node is a file. + + 1 Node is a directory. + + 2 Node is a CSO phone book server. + + + +Adie [Page 26] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + 3 Error. + + 4 Node is a BinHexed Macintosh file. + + 5 Node is DOS binary archive of some sort. + + 6 Node is a UNIX uuencoded file. + + 7 Node is a search server. + + 8 Node points to a text-based telnet session. + + 9 Node is a binary file. + + T Node points to a TN3270 connection. + + Some experimental IDs are also in use: + + s Node contains -law sound data. + + g Node contains GIF data. + + M Node contains MIME data. + + h Node contains HTML data. + + I Node contains image data of some kind. + + i In-line text type. + + The process for defining new data types and corresponding IDs is not + clear. + + Gopher+ Protocol + + The Gopher+ protocol is an extension of the Gopher protocol. Gopher+ + is defined informally in [4]. It is designed to be downwards + compatible with the original protocol, so that old Gopher clients may + access Gopher+ servers (without being able to take advantage of the + new facilities), and Gopher+ clients may access old Gopher servers. + Gopher+ is still at the experimental stage, and is liable to change. + + The most important new feature is the introduction of "attributes" + associated with individual nodes. The client may retrieve the + attributes of a node instead of the node contents. Attributes + defined so far include: + + + + +Adie [Page 27] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + INFO Contains the Gopher reference of the node. + Mandatory. + + ADMIN Contains administrative information, including + the mail address of the server administrator and + the last-modified date of the node. Mandatory. + + VIEWS Contains a list of one or more "view + descriptors", each of which describes an + alternate view of the node. For instance, an + image node may contain a TIFF view, a GIF view, + a JPEG view, etc. The client software (or the + user) may choose which view to retrieve. The + size of the view is also (optionally) available + in this attribute. The Gopher+ Attribute + Registry (see below) defines the permitted view + types. + + ABSTRACT This attribute contains a short description of + the item. It may also include a Gopher + reference to a longer abstract, held in a + separate Gopher node. + + ASK This attribute is used for the interactive query + extension. The interactive query facility in + Gopher+ is used to obtain information from a + user before retrieving the contents of a node. + The client fetches the ASK attribute, which + contains a list of questions for the user. His + or her responses to those questions are sent + along with the selector to the server, which + then returns the contents of the node. This + facility could be used as a very simple way of + querying a database, for instance. Using the + interactive query facility to supply a password + for access control purposes is not a good idea - + there are too many opportunities for + masquerading. + + The University of Minnesota maintains a registry of Gopher+ attribute + types. For the VIEWS attribute, the registry contains a list of + permitted view types. Note that these view types have a similar + function to the type identifier described in the preceding section. + + The general format of a Gopher+ view descriptor is: + + xxx/yyy zzz: <nnnK> + + + + +Adie [Page 28] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + where xxx is a general type-of-information advisory, yyy is what + information format you need understand to interpret this information, + zzz is a language advisory (coded using POSIX definitions), and nnn + is the approximate size in bytes. Possible values for xxx include + text, file, image, audio, video, terminal. + + (It now appears that the University of Minnesota Gopher Team accepts + the need to be consistent in the use of type/encoding attributes with + the MIME specification. The Gopher+ Type Registry may thus + eventually disappear, together with the set of xxx/yyy values it + currently contains.) + + No view descriptors for directory nodes are currently registered. + + In order to make use of the information available in attributes, it + is necessary to fetch the attributes before fetching the contents of + a node. Gopher+ provides a way of fetching the attributes for each + entry in a menu at the same time as the menu is retrieved. This + saves having to establish two successive TCP connections to fetch a + single document, at the expense of some additional client software + complexity. + + Gopher Publishing + + The procedure for making data available using the Unix Gopher server + "gopherd" is very straightforward. The hierarchical nature of the + Unix file system closely matches the Gopher concept of menus and + documents. The gopherd program exploits this - Unix directories are + represented as Gopher menu nodes, and Unix files as Gopher document + nodes. The names of directories and files are the entries in Gopher + menus. This can lead to awkward file names containing spaces, so + gopherd provides an aliasing mechanism (the \.cap directory) to get + round this. + + To represent menu entries pointing to Gopher nodes on other servers, + special "link" files (starting with a dot) are used. + + The type ID for a document node is determined from the extension of + its Unix filename. If a client requests a file containing a shell + script, the script is executed and the output returned to the client. + + The Gopher+ version of gopherd is similar, but the .cap directory is + replaced by a configuration file gopherd.conf. This file is used to + specify administration attributes, and the mapping between filename + extensions and view descriptors. Some limited access control (based + on the client's IP address/domain name) is also provided by the + Gopher+ version of gopherd. + + + + +Adie [Page 29] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Published Non-text Data + + There is already some useful non-text data published on Gopher almost + exclusively image data. See for example the Vatican Library + Exhibition at the University of Virginia Library, the ArchiGopher at + the University of Michigan, the weather machine at the University of + Illinois. Some of these are described in the User Requirements + chapter of this report. + + There seem to be rather fewer sound archives in gopherspace, but + interested users may access the Edinburgh University Computing + Service Gopher server on gopher.ed.ac.uk, where the Testing Area + contains 20 or 30 short audio files in Sun audio format. Note - the + availability of this archive is not guaranteed. + + Advantages + + The main factor in favour of Gopher is its widespread penetration. + There are over 1000 Gopher servers world-wide. This popularity is + due in part to the ease of setting up a Gopher server and making + information available on it, particularly on a Unix platform. + + Limitations + + It is unfortunate that the relatively well-defined MIME types were + not adopted in Gopher+. As mentioned above, this may yet happen, + although there appear to be reasons for keeping the set of MIME types + small whereas Gopher requires a wide range of types to offer to + clients. The latest word is that the MIME registry will be expanded + to include the types which the Gopher+ developers want. + + Gopher is inflexibly hierarchical in nature. Hypertext or hypermedia + it is not - links to other nodes from within document nodes are not + possible. There is a suggestion in the Gopher+ specification that + alternate views of directory nodes could be used to provide some kind + of hypermedia capability, but this does not yet exist, and it is + unlikely that it could be made to work as easily as the WWW hypertext + model. + + There is no access control at the user level - anyone can retrieve + anything on a Gopher server. There is no provision for charging for + information. + +3.2. Wide Area Information Server + + The Wide Area Information Server (WAIS) system allows users to search + for and retrieve information from databases anywhere on the Internet. + WAIS uses a client-server paradigm, and client and server software is + + + +Adie [Page 30] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + available for a wide range of platforms. Client applications are + able to retrieve text or other media documents stored on the servers, + by specifying keywords. The server software searches a full-text + index of the documents, and returns a list of documents containing + the keywords (ranked according to a heuristic algorithm). The client + may then request the server to send a copy of any of the documents + found. Relevant documents can be fed back to a server to refine the + search. Successful searches can be automatically re-run, to alert + the user when new information becomes available. + + WAIS was developed by Thinking Machines Corporation of Cambridge, + Massachusetts, in collaboration with Apple Computer Inc., Dow Jones + and company, and KPMG Peat Marwick. The WAIS software has been made + freely available; however Thinking Machines has announced that they + will stop support for their publicly-distributed WAIS as of version + 8b5.1. Future support and development of the publicly-distributed + WAIS has been taken over by CNIDR (Clearinghouse for Networked + Information Discovery and Retrieval) in the USA. Future CNIDR + releases will be called FreeWAIS. A new company, WAIS Inc, has been + formed by Thinking Machines to take over commercial exploitation of + the Thinking Machines WAIS software. + + WAIS server software is available for the following platforms: + + o UNIX + + o VAX/VMS + + Client software is available for the following platforms: + + o UNIX (versions for X, Motif, Open Look, Sun View) + + o NeXT + + o Macintosh + + o MS DOS + + o MS Windows + + o VAX/VMS + + There are currently over 400 WAIS databases available on the + Internet. WAIS is also the basis of some commercial information + services on private networks. + + + + + + +Adie [Page 31] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + WAIS User Image + + In order to ask a question, the user must first select one or more + databases in which to look for the answer. (The list of all + available databases is available from a number of well-known sites.) + The next step is to enter one or more keywords as the basis of the + search. The search will return a list of documents (the "result + set") which contain any of the keywords. Each document is given a + ranking (a number between 1 and 1000) which indicates how relevant to + the user's question the server believes the document to be. The size + of each document is also shown in the list. The user may limit the + size of the result set - the default limit is typically 40 documents. + + The user may then choose to retrieve and display one or more + documents from the list. Alternatively, he or she may designate one + or more documents in the list as "relevant", and perform another + search to find "more documents like this". This is called "relevance + feedback". + + The user may retrieve general information about the database, and may + examine the catalogue of all documents in the database. There is + also a "database of databases", which may be searched to identify + WAIS databases which may be relevant to a subject. + + WAIS Protocol + + The user interface (client) talks to the server using an extended + version of a standard ANSI protocol called Z39.50. This is now + aligned with the ISO SR (Search and Retrieval) protocol for + bibliographic (library) applications, which is part of OSI. The + present WAIS protocol does not utilise a full OSI stack - APDUs are + transferred directly over a TCP/IP connection. The WAIS protocol is + described in [5]. + + WAIS does not, at this time, implement the full Z39.50-1992 + specification - in particular, WAIS does not permit Boolean searches + (e.g., "find all documents containing 'chalk' and 'cheese' but not + 'green'"). However, Boolean search capability is being added to the + FreeWAIS implementation. There are facilities in the Z39.50 protocol + for access control and charging, but these are not currently + implemented in WAIS. + + The WAIS extensions to Z39.50 are mainly to provide the relevance + feedback capability. + + Note that the Z39.50 protocol is not stateless - the result set may + in some circumstances be retained by the server for the user to + further refine or refer to. However, the subset of Z39.50 used by + + + +Adie [Page 32] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + current WAIS implementations mean that server implementations may be + stateless. + + Document type is determined by the server from information in the + database index (see below), and is sent to the client as part of the + result set. + + WAIS Publishing + + The first step in preparing data for publishing in a WAIS database is + to use the 'waisindex' utility. This takes a set of text files, and + produces an index file which contains an occurrence list of words of + three or more letters in every file. This index file is used by the + WAIS server software to resolve search requests from clients. + + The 'waisindex' utility indexes files in a wide range of text + formats, as well as postscript and image files in various encodings + (only the file name is indexed for image files). Some of the text + formats involve a file as being treated as a collection of documents + for the purposes of WAIS access. Note that there appears to be no + formal "registry of types" - just whatever the waisindex program + supports. There is no distinction between media type and encoding + format. + + Published Non-text Data + + There is relatively little non-text data available in WAIS databases. + + o URL=wais://quake.think.com:210/CM-images is a database of + TIFF images from the Connection Machine. + + o URL=wais://mpcc3.rpms.ac.uk:210/home/images/pathology/RPMS- + pathology is a database of histo-pathological images and + documentation on mammalian endocrine tissue. + + o URL=wais://starhawk.jpl.nasa.gov:210/pio contains GIF images + from NASA planetary probe missions, together with their + captions. The presence of the caption index information + makes it difficult to construct a search which returns + images in the result set increasing the maximum result set + size may help. + + Advantages + + WAIS is ideally suited for its intended purpose of searching + databases of textual information on the basis of keywords. It + appears to have the potential to satisfy the requirements of some of + the "database" category of applications mentioned in Chapter 1. + + + +Adie [Page 33] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Limitations + + WAIS is not (and does not pretend to be) a general-purpose + information system, as Gopher and WWW are. WAIS does not have + hyperlinking, and offers a purely flat structure. + + A limitation which is particularly apparent is the way that the + current version of FreeWAIS indexes non-text files - using only the + filename! However, it does seem that simply changing the indexing + program to allow a list of keywords to be attached to non-text files + would suffice to allow sensible indexing of non-text data. The + commercial (WAIS Inc) version of WAIS allows several files to be + associated together for indexing and retrieval purposes. + Furthermode, the UCSF Centre for Knowlege Management is modifying the + FreeWAIS code to support the indexing of multiple content types. The + document returned by WAIS will be an HTML document containing + pointers to the multimedia data. Contact dcmartin@library.ucsf.edu + for further information. + + WAIS is not a fully-featured query/response protocol such as SQL. It + has no concept of fields, or numeric data types. + + It appears to be impossible to retrieve a document from its catalogue + entry in many of the existing databases. + +3.3. World-Wide Web + + The World-Wide Web project (also known as WWW or W3), started and + driven by CERN, is a large-scale distributed hypertext system. It + uses the standard client-server paradigm, with client "browser" + software responsible for fetching and displaying data. Originally + aimed at the High Energy Physics community, it has spread to other + areas. + + Browser software is available for a large number of systems + including: + + o Line-mode dumb terminal. + + o Terminal with Curses support + + o Macintosh + + o X/Motif + + o X11 + + o PC/MS Windows + + + +Adie [Page 34] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + o NeXT + + There is server software available for: + + o VM mainframes. + + o UNIX + + o Macintosh + + o VMS + + WWW User Image + + The WWW world consists of nodes (usually called documents) and links. + Links are connections between documents: to follow a link, a reader + clicks with a mouse on a word in the source document, which causes + the linked-to document to be retrieved and displayed. (On systems + without a mouse, the user types a number instead.) + + Indexes are special documents which, rather than being read, may be + searched. To search an index, a reader supplies keywords (or other + search criteria). The result of a search is a "virtual" document + containing links to the documents found. All documents, whether + real, virtual or indexes, look similar to the reader. + + The WWW addressing mechanism means that an interface to Gopher and + anonymous FTP information sources may be established, in a way which + is transparent to the user. Thus, the whole of gopherspace is part + of the Web. Transparent gateways to other systems, including Hyper-G + and WAIS, are also available. + + URL + + All nodes on the Web are addressed using the "Universal [or Uniform] + Resource Locator" (URL) syntax, defined in [6]. This is an Internet + Draft produced by the IETF URL Working Group. + + A URL is a name for an object (which may be a document or an index) + on the Internet. It has the general form: + + <scheme> : <path> [ # <anchorid> ] + + The <scheme> identifies an access protocol or method for the object. + Some of the schemes are HTTP (the native WWW protocol), anonymous + FTP, Andrew file system, news, WAIS, Gopher. The <path> component + locates the document in a way significant for the access method. + + + +Adie [Page 35] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Thus for instance for anonymous FTP, the path includes the fully + qualified domain name of the host on which the document resides, and + the directory and file name under which it may be found. For some + schemes, the <path> may include a search string (or combination of + strings) which is used to address a "virtual" object formed by + searching an index of some kind. The HTTP, WAIS and Gopher schemes + can use search strings, which usually follow the rest of the path, + separated from it by a ?. + + The optional <anchorid> is used for addressing within an object. Its + interpretation is not defined in the URL specification. + + "Partial" URLs may be specified. These are used within a document on + the Web to refer to another "nearby" document - for instance to a + document in another file on the same machine. Certain parts of the + URL (e.g., the scheme and machine name) may be omitted, according to + well-defined rules. This makes it much easier to move groups of + documents around, while maintaining the links within and between + them. + + A URL locates one and only one object on the Internet. However, more + than one URL may point to the same object. Given two URLs, it is not + in general possible to determine whether they refer to the same + object. Furthermore, there is no guarantee that a single URL will + refer to the same object at different times (the object may change + incrementally, or it may be completely replaced with something + different, or it may indeed be removed). + + HTTP + + HTTP (HyperText Transfer Protocol) is the protocol employed between + server and client. It is defined in [7]. The protocol is currently + being revised (see the Future Developments section below), and will + eventually be proposed as an Internet standard. + + The original protocol is extremely simple, and requires only a + reliable connection-oriented transport service, typically TCP/IP. + + The client establishes a connection with the server, and sends a + request containing the word GET, a space, and the partial URL of the + node to be retrieved, terminated by CR LF. The server responds with + the node contents, comprising a text document in the Hypertext Markup + Language (HTML). The end of the contents is indicated by the server + closing the connection. + + + + + + + +Adie [Page 36] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + HTML + + HTML (HyperText Markup Language) is the way in which text documents + must be structured if they are to contain links to other documents. + Non-HTML text documents may of course be made available on the Web, + but they may not contain links to other documents (i.e., they are + leaf nodes), and they will be displayed by browsers without + formatting, probably using a fixed-width font. Like HTTP, HTML is + also undergoing enhancement, but the original version is defined in + [7], and is being submitted as an Internet draft. + + HTML is an application of SGML (Standard Generalized Markup + Language). It defines a range of useful tags for indicating a node + title, paragraph boundaries, headings of several different levels, + highlighting, lists, etc. Anchors are represented using an <A> tag. + + For instance, here is an example of HTML containing an anchor: + + The HTTP protocol implements the WWW <A NAME=13 + HREF="../../Administration/DataModel.html">data model</A> . + + The location of the anchor is the text "data model". It is a source + anchor, with a target given by the URL in the HREF attribute, so the + text would appear highlighted in some way in a client's window, to + indicate that clicking on it would cause a hyperlink to be traversed. + It is also a target anchor, with an anchor ID given by the NAME + attribute. A source anchor referring to this target would specify + #13 at the end of the node's URL. Traversing a hyperlink to this + node would cause the entire node to be retrieved, but the target + anchor text would be displayed in some emphasised way - for instance + if the retrieved text is displayed in a scrolling window, it might be + positioned such that the target anchor appears at the top of the + window. + + Another attribute of the <A> element, TYPE, is also available, which + is intended to describe the nature of the relationship modelled by + the link. However, this is not in extensive use, and there appears + to be no registry of the possible values of such types. + + Future Developments + + HTTP and HTML are currently being extended in a backward-compatible + way to add multimedia facilities. [8] describes the HTTP2 protocol. + The revised HTML is defined in [9]. Both documents are subject to + change (and indeed the HTML2 specification has changed substantially + during the preparation of this report). + + + + + +Adie [Page 37] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + The revised HTML contains many enhancements which are useful for + multimedia support. Some of the most relevant are listed below. + + o "Universal Resource Numbers" are a proposed system for + unique, timeless identifiers of network-accessible files + presently being designed by IETF Working Groups. URNs must + be distinguished from URLs, which contain information + sufficient to locate the document. URNs may be allocated to + nodes and may be represented in source anchors. This saves + client software from retrieving a copy of something it + already has - allowing sensible caching of large video + clips, for instance. The disadvantage is that when + something is changed and given a new URN, the source anchors + of all links which point to it must be changed (and the URNs + of these documents must therefore be changed, and so on). + Therefore, it makes sense to allocate URNs only to very + large documents which change rarely, and not to the + documents which reference them. + + o The title of a destination document may be included as + anattribute of a source anchor. This allows a client to + display the title to the user before or during retrieval, + and also allows data which does not itself contain a title + (e.g., image data) to be given one. + + o There is provision for in-line non-text data (e.g., images, + video, graphics, mathematical equations), which appears in + the samewindow as the main textual material in the node. + + o The concept of the relationship expressed by a hyperlink is + expanded. Both source and target anchors may contain + relation attributes which point forwards and backwards + respectively. Possible relationships include "is an index + for", "is a glossary for", "annotates", "is a reply to", "is + embedded in", "is presented with". The last two are useful + for multimedia - for instance, the "embed" relationship + could cause a retrieved image to be fetched and embedded in + the display of a text node, and the "present" relationship + could cause a sound clip to be automatically retrieved and + presented along with a text node. + + The HTTP2 protocol maintains the same stateless + connect/request/response/close procedure as the current HTTP + protocol. Data is transferred in MIME-shaped messages, allowing all + MIME data formats (including HTML) to be used. As well as the GET + operation, HTTP2 has operations such as: + + + + + +Adie [Page 38] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + HEAD Fetch attribute information about a node + (including the media type and encoding) + + CHECKOUT/CHECKIN/PUT/POST + + These allow nodes to be checked out for updating + and checked back in again, and new nodes to be + created. New node data is supplied in MIME + shape with the request. + + The request from the client can contain a list of formats which the + client is prepared to accept, user identification, authorisation + information (a placeholder at present), an account name to charge any + costs to, and identification of the source anchor of the hyperlink + through which the node was accessed. + + The response from the server may contain a range of useful attributes + (e.g., date, cost, length - but only for non-text data). The server + may redirect the query, indicating a new URL to use instead. It may + also refuse the request because of authorisation failure or absence + of a charge account in the request. + + The protocol also contains a mechanism which is designed to allow the + server to make an intelligent decision about the most appropriate + format in which to return data, based on information supplied in the + request by the client. This may for instance allow a powerful server + to store the uncompressed bitmap of an image, but to compress it on + request using an appropriate encoding, according to the decoding + capabilities announced by the client. + + An HTTP2 server and client are currently under test. Some HTML2 + features are already fitted to the XMosaic browser. + + Mosaic + + The Mosaic project, located at the US National Centre for + Supercomputing Applications (NCSA) at the University of Illinois, is + developing a networked information system intended for wide-area + distributed asynchronous collaboration and hypermedia-based + information discovery and retrieval. Mosaic, which is specifically + oriented towards scientific research workers, has adopted the World + Wide Web as the core of the system, and the first Mosaic software to + appear was the XMosaic WWW client for UNIX with X. Other clients of + similar functionality are under development for the Apple Macintosh + and the PC with Windows. + + The capabilities of the XMosaic browser include: + + + + +Adie [Page 39] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + o Support for NCSA's Data Management Facility (DMF) for + scientific data. + + o Support for transferring data with other NCSA tools such + asCollage, using NCSA's Data Transfer Mechanism (DTM). + + o The ability to "check out" documents for revision, and to + check them back in again. + + o Local and remote annotation of Web documents. + + Future planned functionality includes: + + o In-line non-text data (in addition to images). + + o Information space graphical representation and control. + + o Hypermedia document editing. + + o Information filtering. + + NCSA intends to make the entire Mosaic system publicly available and + distributable. + + The XMosaic browser was used extensively for finding and retrieving + information used to prepare this report. + + Web Publishing + + Making a web is as simple as writing a few SGML files which point to + your existing data. Making it public involves running the FTP or HTTP + daemon, and making at least one link into your web from another. In + fact, any file available by anonymous FTP can be immediately linked + into a web. The very small start-up effort is designed to allow small + contributions. + + At the other end of the scale, large information providers may + provide an HTTP server with full text or keyword indexing. This may + allow access to a large existing database without changing the way + that database is managed. Such gateways have already been made into + Digital's VMS/Help, Technical University of Graz's "Hyper-G", and + Thinking Machine's WAIS systems. + + There are a few editors which understand HTML - for instance on UNIX + and on the NeXT platform. + + + + + + +Adie [Page 40] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Published non-text data + + See the multimedia demo node on: + + http://hoohoo.ncsa.uiuc.edu:80/mosaic-docs/multimedia.html + + This contains links to images, sound, movies and postscript media + types. The media type is determined by the filename extension in the + URL specification of the target node. The (XMosaic) client uses this + to invoke a separate program appropriate for displaying the media + type, or in some cases it can be displayed embedded within the source + document. The latter method uses an <IMG> tag, which is part of + HTML2. + + Advantages + + WWW is a hypertext system and its underlying technology is thus + richer than Gopher. The use of SGML, which is of increasing + importance in hypermedia systems, allows a great deal of + expressiveness and structure, and enables text to be presented in an + attractive way. The facilities for multimedia data in the extended + versions of HTTP and HTML are excellent. It also seems that QOS and + management issues identified in Chapter 2 are to some degree catered + for in these extensions. + + Limitations + + There is no indication in the source anchor of the media type of the + destination node, or of its size (this has been ruled out on the + argument that the information is likely to degrade with time). It is + necessary to perform a HEAD request (in HTTP2) to deduce this. + + Link source anchors must be in text documents, so non-text nodes must + be leaf nodes. However, with HTML2 using the <IMG> tag, an embedded + bitmap may be used as a source anchor, and the position of the mouse + click within the image is passed to the server, which can then choose + to return a different document depending on where in the image the + mouse was clicked. + + WWW is much less prevalent than Gopher, partly because of an + (erroneous?) perception that setting up an HTTP server is more + complex than setting up a Gopher server. There are only about 60 + servers world-wide; however the growth in the use of WWW is much + faster than the growth in the use of Gopher. The availability of + sophisticated WWW clients such as XMosaic is fuelling this growth. + + + + + + +Adie [Page 41] + +RFC 1614 Network Access to Multimedia Information May 1994 + + +3.4. Evaluating Existing Tools + + This section compares the capabilities of the Gopher, WAIS and + WorldWide Web systems (abbreviated as GWW) to the informal + requirements defined in section 2.3. + + Platforms + + The table below gives the names of the most important client software + for each of GWW on the three most important platforms of interest. + WWW is the weakest, with clients for the Macintosh and the PC still + under development. The main PC Gopher client is "PC Gopher III", + which is a DOS program, not a Windows program. + + CLIENTS Gopher WAIS WWW + + Macintosh TurboGopher WAIStation (No name) + (beta version + available) + + PC with HGopher (two WAIS for Cello (beta + Windows others also Windows, WAIS version + available) Manager available), + Mosaic (beta due + 3Q93) + + UNIX with X Xgopher, XWAIS XMosaic + XMosaic + + At present, multimedia support in most of these clients (where it + exists) is limited to the invocation of external "viewer" programs + for particular media types. The exception is XMosaic, which supports + in-line images in WWW documents. + + Media Types + + The GWW tools can all handle multiple media types well. + + o Text is very well supported by all three tools. WWW offers + facilities for displaying "richer" text, supporting + headings, lists, emphasised text etc., in a standardised way. + + o Image data is also well supported, using either external + viewers (e.g., the TurboGopher client software on a Macintosh + might invoke the JPEGView program to display an image); or + in-line display within a text document (WWW with XMosaic on + UNIX). + + + + +Adie [Page 42] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + o There is little direct support for application-specific + data, but most systems allow data of a nominated type to be + passed to an external viewer or editor program. This tends + to be a function of the client software rather than being + built in to the protocol or server. There has been + discussion in the WWW community about using TeX for + representing mathematical equations, and about providing + "panels" within a text document where a separate application + could render its application-specific data (or indeed any + data which can be represented spatially). This latter + suggestion fits well with the OLE (Object Linking and + Embedding) approach used in Microsoft Windows. + + o Sound can be supported through the external "viewer" + concept. Some platforms don't have readily-available + "viewers" with "tape recorder"-style controls for replaying. + There is no single commonly-accepted sound encoding format. + + o Video data can be handled using external viewers. MPEG and + QuickTime are the most common encodings. + + One essential capability of a client/server protocol is the ability + for the client to determine the type of a node (and a list of + available encodings) before downloading it. WAIS and Gopher transfer + this information in the result set and menu respectively. WWW + clients currently determine this information either from analysing + the URL of a target node, or by the occurrence of the <IMG> tag. The + new WWW HTTP2 protocol allows the media type and encoding of a node + to be determined through a separate interaction with the server. + + The GWW systems all use different methods for expressing type and + encoding. WAIS does not distinguish the encoding from the media + type. WWW is moving to the MIME type/encoding system. Gopher does + not distinguish type and encoding, but Gopher+ does, and is also + moving to the MIME type/encoding system. + + Hyperlinks + + Only the WWW system has hyperlinks. Source anchors may be text, + images, or points within an image. Target anchors may be entire + nodes of any media type, or points within (with HTTP2, portions of) + text nodes. + + Gopher+ could potentially be enhanced to include hyperlinks, but + there seems to be no development effort going towards this - those + who need hyperlinking are using WWW. + + + + + +Adie [Page 43] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Gopher menus can be constructed to allow alternative views of + gopherspace. For instance, a geographically-organised menu tree of + gopherspace is in place, but a parallel subject-based menu tree could + be added as an alternative way of access to the same data. (There + are in fact moves to set this up.) Since WWW offers a superset of + Gopher functionality, these comments also apply to the Web. In fact, + the Web already has a rudimentary subject tree. + + In both Gopher and WWW, non-textual data may be used in different + information structures without having to maintain more than one copy. + + Presentation + + There is little support in GWW for controlling the presentation of + non-text data. + + o Backdrops are not supported by GWW. + + o Buttons are supported in a limited way - typically, a node + is retrieved by clicking on a highlighted text phrase, or on + an entry in a list. In XMosaic, bitmap images can be used + as buttons. However, there is no support for different + styles of button. Client software may have generic + navigation buttons (e.g., "Back", "Next", "Home") which are + always available and don't form part of a node. + + o Synchronisation in space is not supported by GWW, except + that WWW supports contextual synchronisation of images using + the <IMG> tag. + + o Synchronisation in time is not supported by GWW. + + Searching + + WAIS supports keyword searching, and is very well suited for that + task. The Gopher+ protocol could potentially support multimedia + database querying applications through the ASK attribute, but there + is as yet no server implementation which supports such database + applications. In the WWW project, there are ongoing discussions on + how best to extend HTML to cope with database query applications - an + <INPUT> tag has been suggested - but no consensus has yet emerged. + + Both Gopher and WWW can make use of WAIS-type keyword searching: + either by incorporating WAIS code into the server (enabling WAIS + index files to be searched); or through WAIS gateways, which run + searches on remote WAIS servers in response to queries from non-WAIS + clients. + + + + +Adie [Page 44] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Interaction + + XMosaic allows users to make text (or on some platforms, audio) + annotations to any text node. The annotations appear at the end of + the text display.. They are held locally - other users of the node + do not see the annotations (but a recently added facility allows + globally-visible annotations held on an "annotation server"). Text + annotations may include hyperlinks to other nodes (provided the user + knows how to use HTML). Other clients do not provide such + facilities. + + There is a move to add an "email" address notation to URL. This + would allow WWW client software to invoke a mail program when a user + selects an anchor with such a URL. + + There are plans to allow WWW users to delineate a rectangular area of + interest within an image for use in an HTTP request. + + There is no support in GWW clients for interacting with sequences of + images in the way described in section 2.3.6. + + Quality of Service + + The user expectations for responsiveness mentioned in section 2.3.7 + are difficult to meet with currently-deployed wide-area network (or + even LAN) technology, particularly for voluminous multimedia data. + None of the GWW systems currently exploit the emerging isochronous + data transfer capabilities of protocols such as RTP and technologies + such as ATM. None of them make serious attempts to alleviate the + problem in other ways (except for WWW, which defines some mechanisms + in HTTP2 for format negotiation based on size and available bandwidth + considerations). + + Management + + The following table shows the support for three key management + facilities in the GWW systems. The first two facilities require + support in the client/server protocol, the third requires support in + the server, but depends on authentication being available. + + Gopher WAIS WWW + + + Access control No No1 Yes, in + and HTTP2 + authentication + + + + + +Adie [Page 45] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Charging support No No Yes, in + HTTP2 + + Monitoring for No No No + statistical and + assessment + purposes + + Note: + + 1. "Access-control-facility" is a feature of Z39.50 which is not used + by the current WAIS implementations. + + Scripting Requirements + + None of the GWW systems have facilities for the execution of scripts + by the client, because of security issues (it would be too easy for a + malicious "trojan" script to be executed). Gopher and WWW servers + have the ability for a UNIX script to be run by the server, with the + script output returned to the client. Scripting as understood in the + context of stand-alone multimedia applications does not exist in GWW. + + Bytestream Format + + None of the three GWW systems use a bytestream format for + interchanging collections of material. There has been some talk + about setting up a system akin to the "Trickle" mail server, for + retrieving single document nodes from GWW using mail. Such a system + has been implemented for WWW. + + Authoring tools + + Gopher is sufficiently simple to set up that no special authoring + tools are required. WAIS requires only an indexing program (as + discussed in section 3.2) for preparing material for publication. + + WWW, because it uses a sophisticated authoring language (HTML), + benefits from the availability of authoring tools. There are HTML + editors for UNIX (using the tk toolkit) and the NeXT system. There + are no authoring tools designed specifically for exploiting the + multimedia capabilities of WWW, mainly because these capabilities are + still evolving. + + + + + + + + + +Adie [Page 46] + +RFC 1614 Network Access to Multimedia Information May 1994 + + +4. Research + + This section describes some current research projects in the area of + distributed hypermedia information systems. + +4.1. Hyper-G + + Hyper-G [10] is an ambitious distributed hypermedia research project + at a number of institutes of the IIG (Institutes for Information- + Processing Graz), the Computing and Information Services Centre of + the Graz University of Technology, and the Austrian Computer Society. + It is funded by the Austrian Ministry of Science. It combines + concepts of hypermedia, information retrieval systems and + documentation systems with aspects of communication and + collaboration, and computer-supported teaching and learning. + + Unlike WWW, Hyper-G supports bi-directional links. This enables + users to see which other documents reference the one they are using, + and also allows the system to avoid dangling pointers when a linkedto + document is deleted. Another difference from WWW is that links are + kept separately from their source and target nodes, to allow easy + linking of read-only documents and for ease of link maintenance. In + addition to manually defined links, Hyper-G supports automatic static + and dynamic (i.e., view-time) generation and maintenance of links. + + Hyper-G has a concept of generic "structures" - an additional layer + of relationships imposed on (and orthogonal to) the web of documents + and links. A document can be part of more than one structure, and + structures may be hierarchically related. Types of structure + include: + + o "Clusters" are a set of documents which are all + presentedtogether. + + o "Collections" are unordered sets of documents or other + structures, and can be used as query domains or to construct + gopher-like menus. + + o "Paths" are ordered sets of documents or structures, which + must be visited sequentially. + + One application of the structure concept is the provision of "guided + tours" through the information space. + + In addition to hypernavigation, the collection hierarchy and guided + tours, another strategy for interaction with the system is the use of + database queries. Two kinds of query are supported: keyword + searching in a user-defined list of databases; and collection + + + +Adie [Page 47] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + specific form-filling queries. In the latter case, the answer to the + query may appear dynamically as the form is filled out. + + Four modes of user identification are supported: "identified", where + a userid is publicly associated through name and address information + with a particular individual; "semi-identified", where a userid is + associated by the system with an individual, but the user is only + known to other users through a pseudonym; "anonymously identified", + where the userid is not associated by the system with any individual; + and "anonymous", where there is no userid (or a generic userid such + as "guest"). Possible operations in the system depend on the user's + mode of identification. Users may access the system in any desired + mode, and switch to other modes only when necessary. + + Hyper-G contains specific support for multilingual documents and + document clusters. Users may specify an ordered list of preferred + languages, for instance. There are plans to experiment with + automatic translation programs. + + Integration of other, external, systems such as WWW into Hyper-G in a + seamless manner is possible. + + Hyper-G is in use as a CWIS within Graz Technical University. Client + software is available for UNIX workstations from DEC, HP, SGI, and + SUN. The system is still in an experimental state, but it has been + used by about 200 students as part of a course on the social impact + of information technology. + +4.2. Microcosm + + Microcosm [11] is an open hypermedia system developed at the + University of Southampton. It is implemented on the PC under MS + Windows, and versions for the Apple Macintosh and for UNIX with X are + under development. + + Microcosm consists of a number of autonomous processes which + communicate with each other by a message-passing system. Information + about hyperlinks between documents is stored in a link database, or + "linkbase", and is not stored in the documents themselves. This has + the advantages that: + + o Links to and from read-only documents (perhaps stored on CD- + ROM) are possible. + + o Documents need undergo no conversion process to be imported + into the system - they can still be viewed and edited using + the original application which created them, without the + link information getting in the way. + + + +Adie [Page 48] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + o It is as easy to establish links to and from non-text + documents as text documents. + + In Microcosm, the user interacts with a "viewer" program for a + particular media type. Such programs may be specifically written for + use with Microcosm (about 10 such viewers have been written for a + number of common media types and encodings); or they may be a program + adapted for use with Microcosm (the programmability of Microsoft Word + for Windows has allowed it to be so adapted); or it may even be a + program with no knowledge of Microcosm. + + The user selects an object (e.g., a piece of text) in the viewer, and + requests Microcosm to perform an action with the object - typically + to follow a link to another document. This may involve executing + another viewer to display the target document. + + Microcosm link source anchors may be specific (denoting a unique + point in a particular document), local (denoting any occurrence of a + particular object in a particular document) or generic (denoting any + occurrence of an object in any document). Target anchors may specify + specific objects within a document. Other link styles are + textretrieval links (looking up a full-text index , as WAIS does), + and relevance links to a set of documents using similar vocabulary to + the source document (again, similar to WAIS's relevance feedback). + + Links may be created by readers as well as by authors. Dynamically + computed links may be added to the permanent linkbase for later use. + A history of link traversal is maintained, and "guided tours" may be + established through the system which allow the reader to stray from + and return to the tour. + + Microcosm viewers operate by sending messages to the Microcosm + system. In MS Windows, these messages are transferred using DDE + (Dynamic Data Exchange); in the Apple Macintosh version Apple Events + are used, and sockets are used on UNIX. For viewers which are not + Microcosm aware, the user must transfer the selected object to the + system clipboard before being able to follow a link from it. + + Networking support in Microcosm is currently under development. + Components of Microcosm may be distributed to multiple machines there + is not necessarily a concept of "client" and "server". + + There are problems with the Microcosm approach, common to systems + which maintain link information separately from documents, and which + use external viewers. + + + + + +Adie [Page 49] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + o Documents move and change, thus invalidating links. + Microcosm datestamps links to help to detect (but not + correct) such problems. + + o It is not always clear what links are available to be + followed from a document, since the viewer program is + unaware of the contents of the linkbase. + + o It is not always possible to indicate the object within a + document which is the target anchor of a link. Many viewers + automatically show the start of the document (e.g., a word + processor), or perhaps the entire document (e.g., a picture + viewer). The user has no way of knowing which part of the + target document the link just followed points to. + + Microcosm may be viewed as an integrating hypermedia framework - a + layer on top of a range of existing applications which enables + relationships between different documents to be established. + + Microcosm is currently being "commercialised". + +4.3. AthenaMuse 2 + + AthenaMuse 2 (AM2) is an ambitious distributed hypermedia authoring + and presentation system under development by the AthenaMuse Software + Consortium based at MIT. It is based on the earlier AM1 system + developed as part of MIT's Project Athena. The first version of AM2 + is scheduled for January 1994, and will be "pre-commercial software", + with a fully-commercialised version due about 6 months later. Both + the educational and commercial sectors are the intended market. The + system will initially be based on X and UNIX workstations, but + PC/Windows will also be supported in a second phase. Apple Macintosh + support has a lower priority. + + The specifications of AM2 are available in [12]. Some of the key + points are: + + o AM2 will support import and export of application from and + tostandard forms. The project is watching standards such as + HyTime, MHEG and ODA. + + o Several "application themes", or frequently-occurring + collections of functionality, are viewed as useful. These + are as follows: + + Application Theme Interactive? + Presentation of multimedia data No + Exploration of a rich multimedia Yes + + + +Adie [Page 50] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + environment + Simulation of a real-world scenario Partially + Communication of real-time No + information to the user + Authoring Yes + Annotation of material Yes + + o "Interface templates" allow a multimedia application to make + use of a common format for presenting a range of content. + This is similar to the "backdrop" concept mentioned in + section 2.3.4. + + o A range of link types will be supported. + + o Media content editors and interface/application editors for + structuring will be provided. A third class of editor, the + "hypermedia notebook", will allow readers to excerpt and + annotate media from AM2 applications. + + The project is developing multimedia network services, including the + transmission of digital video, using a client-server paradigm. + +4.4. CEC Research Programmes + + Some of the research programmes sponsored by the Commission for the + European Community (CEC) contain apparently relevant projects. [1] + has further details of some of these projects. + + RACE programme + + The RACE programme is outlined in [13], which should be consulted for + further information about the projects described below. The RACE + programme targets the industrial, commercial and domestic sectors, + and results are not necessarily directly applicable to the research + and academic community. RACE project numbers are given. + + RACE Phase I projects, which have mostly completed: + + R1038 MCPR - Multimedia Communication, Processing and + Representation. This project developed a demonstrator + multimedia system with communications capability for travel + agents. + + R1061 DIMPE - Distributed Integrated Multimedia Publishing + Environment. The project designed and implemented interim + services for compound document handling, and defined a + distributed publishing architecture. + + + + +Adie [Page 51] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + R1078 European Museums Network. This project aimed to demonstrate + interactive navigation through a pool of multimedia museum + objects, using ISDN as the communications network. + + RACE Phase II projects: + + R2008 EuroBridge. + + Aims to demonstrate multi-point multimedia applications + running over DQDB, FDDI and ATM test networks. + + R2043 RAMA - Remote Access to Museum Archives + + This project follows on from R1078. + + R2060 CIO - Coordination, Implementation and Operation of + Multimedia Services. + + One aspect of this project is JVTOS - a "Joint Viewing and + Teleoperation Service". This aims to integrate standard + multimedia applications running on a range of heterogeneous + machines into a cooperative working environment, allowing + individuals to view and interact with multimedia data on + colleague's machines. + + ESPRIT Programme + + The ESPRIT research programme is outlined in [14], which should be + consulted for further information about the projects listed below. + ESPRIT project numbers are given. + + 28 MULTOS - A Multimedia Filing System + + This project, which ran from 1985 to 1990, developed a + client/server system for filing and retrieval of multimedia + documents using the ODA interchange format standard (ODIF). + + 5252 HYTEA - HyperText Authoring + + This project, which runs from 1991 to 1994, aims to develop + a set of authoring tools for large and complex hypermedia + applications. + + 5398 SHAPE - Second Generation Hypermedia Application Project + + This project is developing a portable software environment + comparable to a CASE tool intended to facilitate the + realisation of complex hypermedia applications. + + + +Adie [Page 52] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + 5633 HYTECH - Hypertextual and Hypermedial Technical + Documentation This project, which ran from 1990-1991, was to + assess the feasibility of hypermedia technology and to + devise needed extensions to it in order to support + applications dealing with technical documentation + management. + + 6586 PEGASUS - Distributed Multimedia Operating System for the + 1990s This project is aimed at the design of an operating + system architecture for scalable distributed multimedia + systems and the development of a validating prototype, the + design and implementation of a distributed complex-object + service and a global name service, the development of + mechanisms for the creation, communication and rendering of + fully digital multimedia documents in real time and in a + distributed fashion, and the design and implementation of an + application for the system: a digital TV director. + + 6606 IDOMENEUS - Information and Data on Open Media for Networks + of Users. This project, which started January 1993, brings + together workers in the database, information retrieval, + networking and hypermedia research communities in the + development of an "ultimate information machine". It "will + coordinate and improve European efforts in the development + of next-generation information environments capable of + maintaining and communicating a largely extended class of + information on an open set of media". Because of the close + match between the subject of the IDOMENEUS project and the + RARE WG-IMM, it is recommended that RARE establish a liaison + with this project. + +4.5. Other + + Some other research projects of less immediate relevance are listed + below. Some of these projects are described further in [1]. + + o Xanadu is a project to develop an "open, social hypermedia" + distributed database server, incorporating CSCW features. + It has been in existance for many years and has been funded + by a number of companies. The current status of this + project is not known, and although iminent availability of + alpha-test versions has been announced more than once, no + software has been delivered. + + o CMIFed [15] is an editing and presentation environment for + portable hypermedia documents being developed at CWI, + Amsterdam, NL. It is based on the "Amsterdam Model" of + + + +Adie [Page 53] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + hypermedia [16], which is an extension of the Dexter + hypertext reference model incorporating "channels" for media + delivery and synchronisation constraints. + + o Deja Vu [17] is a proposed "intelligent" distributed + hypermedia application framework. It is intended as a + vehicle for research in the areas of: hypermedia systems, + object-oriented programming, distributed logic programming, + and intelligent information systems. Proposed techniques + for use in the Deja Vu framework include "inferential + links", defined automatically according to predefined rules. + A scripting language for use both by information providers + and users is planned. This project is at a very early + (proposal) stage, and as yet relatively little software has + been developed. Deja Vu is intended principally as a + research framework rather than as a service tool. + + o Demon is a project at Bellcore, US, investigating the + network requirements of near-term residential multimedia + services. The project is designing and implementing an + experimental application which serves the needs of casual + multimedia users. + + o InfoNote is a distributed, multiuser hypermedia system from + Japan, implemented on a NEC EWS4800 running UNIX and X. + InfoNote has an editor which can create Japanese texts, + figures, and raster images. The same windows are used both + for editors and browsers. The functionality of the window + can be changed at any time if data is not write-protected. + + o MADE - Multimedia Application Demonstration Environment - is + a project at British Telecom's research laboratory which + centres on the use of the developing MHEG standard to access + a multimedia object server. The server platform is a Sun + SPARCstation with an object-oriented database package + (ONTOS). Audio, video, text and graphical media types are + covered. The University of Kent is working on a sub- + project: "Multi-user Indexing in a Distributed Multimedia + Database". + + o Zenith aimed to establish a set of principles to assist + designers and developers of object management systems + intended for distributed multimedia design environments. + The project implemented a prototype generalised multimedia + object management system. + + + + + + +Adie [Page 54] + +RFC 1614 Network Access to Multimedia Information May 1994 + + +5. Standards + +5.1. Structuring Standards + + This section describes some of the important standards for providing + hyperstructure to multimedia data. + + SGML + + SGML (Standard Generalized Markup Language - ISO 8879) is a + metalanguage for defining markup notations for text. SGML is used to + write Document Type Definitions or DTDs, to which individual document + instances must conform. It finds application in a wide and + increasing range of text processing applications. + + The relevance of SGML to distributed hypermedia systems is + surprisingly high, mainly because of the great expressive power of + SGML, and its ability to handle non-textual data using "external + entities" and "notations". + + o The World-Wide Web is an SGML application with its own DTD. + + o The important HyTime hypermedia structuring standard (see + below) is based on SGML. + + o The forthcoming MHEG hypermedia structuring standard (see + below) has an SGML encoding. + + o SGML has been used in research hypermedia systems - for + example Microcosm. + + o SGML is used in some commercial hypermedia systems - for + example DynaText. + + o SGML is of increasing importance for academic publishing + houses. + + It was interesting to note that at a recent (CEC-sponsored) workshop + on Hypertext and Hypermedia standards, most of the speakers were + conversant with and supportive of the use of SGML for such systems. + + A related standard which may become important for SGML on networks is + SDIF (SGML Data Interchange Format - ISO 9069). This standard + specifies how an SGML document, which may exist in a number of + separate files of different media types, may be encoded using ASN.1 + into a single bytestream. The entity structure is preserved, so that + the bytestream may be decoded by the recipient into the same set of + files. + + + +Adie [Page 55] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + HyTime + + HyTime (Hypermedia/Time-Based Structuring Language) is a standardised + infrastructure for the representation of integrated, open hypermedia + documents. It was developed principally by ANSI committee X3V1.8M, + and was subsequently adopted by ISO and published as ISO 10744. + + HyTime is based on SGML. It is not itself an SGML DTD, but provides + constructs and guidelines ("architectural forms") for making DTDs for + describing Hypermedia documents. For instance, the Standard Music + Description Language (SMDL: ISO/IEC Committee Draft 10743) defines a + (meta-)DTD which is an application of HyTime. In fact, HyTime + started as an attempt to produce a markup scheme for music publishing + purposes. + + HyTime specifies how certain concepts common to all hypermedia + documents can be represented using SGML. These concepts include: + + o association of objects within documents with hyperlinks + + o placement and interrelation of objects in space and time + + o logical structure of the document + + o inclusion of non-textual data in the document + + An "object" in HyTime is part of a document, and is unrestricted in + form - it may be video, audio, text, a program, graphics, etc. The + terminology used in HyTime (and in this section) thus differs + slightly from the terminology used in the rest of this report. A + HyTime object corresponds roughly to a node as defined in section + 1.2, and a HyTime document is a hyperdocument in the terminology of + this report. + + HyTime consists of six modules, which are very briefly and + selectively described below: + + o Base module. This provides facilities required by other + modules, including a lexical model for describing element + contents; facilities for identifying policies for coping + with changes to a document, or traversing a link ("activity + tracking"); and the ability to define "container entities" + which can hold multiple data objects. This last was added + to the HyTime standard at a late stage, at the instigation + of Apple Computers Inc, as a "hook" for their Bento + specification [18]. + + + + + +Adie [Page 56] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + o Measurement module. This allows for an object to be located + in time and/or space (which HyTime treats equivalently), or + any other domain which can be represented by a finite + coordinate space, within a bounding box called an "event", + defined by a set of coordinate points. Coordinates may be + expressed in any units (predefined units include + femtoseconds, fortnights, millenia, angstroms, Northern feet + and lightyears!). + + o Location Address module. In addition to the fundamental + ability of SGML to identify and refer to elements, this + module provides a special "named location address" + architectural form which can be used to refer indirectly to + data which spans elements, or which is located in external + entities. Data may also be addressed indirectly through the + use of "queries", which return addresses of objects within + some domain which have properties matching the query. A + "HyQ" notation is provided for defining the query. + + o Hyperlinks module. Two basic types of hyperlink are + defined: the contextual link (clink) has two anchors, one of + which is embedded in a document to explicitly denote the + anchor location; and the independent link (ilink) which may + have more than two anchors, and which does not require the + anchors to be embedded in the document. ilinks thus allow + hyperlink information to be maintained separately from + document content. + + o Scheduling module. This specifies how events in a source + finite coordinate space (FCS) are to be mapped onto a target + FCS. For instance, events on a time axis could be projected + onto a spatial axis for graphical display purposes, or a + "virtual" time axis as used in music could be projected onto + a physical time axis. + + o Rendition module. This allows for individual objects to be + modified before rendition, in an object-specific way. One + example is modification of colours in image so that it can + be displayed using the currently-selected colour map on a + graphics terminal, or changing the volume of an audio + channel according to a user's requirements. + + It is not envisaged that a hypermedia application would need to use + the entire range of HyTime facilities. An application designer is + able to choose appropriate HyTime architectural forms, and to add + application-specific constraints to them. The designer may also of + course use non-HyTime SGML elements and attributes, but these aspects + of the application can't be understood by a "HyTime engine". Even in + + + +Adie [Page 57] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + the absence of a HyTime engine, the HyTime architectural forms + provide a useful base of ideas from which a hypermedia system + designer may wish to work. + + The role of a HyTime engine is not specified in the standard, but + essentially it is a (sub)program which recognises HyTime constructs + in document instances and performs application-independent processing + on them. For instance, it could interact with multimedia network + servers to resolve and access hyperlink anchors. A commercial HyTime + engine (HyMinder) is under development by TechnoTeacher in the US, + and the Interactive Multimedia Group at the University of + Massachusetts - Lowell (contact lrutledg@cs.ulowell.edu) is also + working on a HyTime engine (HyOctane). + + The Davenport group (a loose consortium of interested companies and + individuals) is producing a series of standards on hypermedia which + further constrain the HyTime architectural forms. One example is the + SOFABED module [19], which standardises the representation of certain + kinds of navigational information - tables of contents, indexes and + glossaries. + + HyTime was envisaged as an interchange format rather than as a format + for directly-executable hypermedia applications. It is therefore + very expressive, but may be difficult to optimise for run-time + efficiency. + + An attempt has been made [20] to adapt the hyperlink structure in + WWW's existing HTML DTD to comply with HyTime's clink architectural + form. This requires changes to WWW document instances as well as to + browser software, and in the absence of any immediate benefit it has + found little favour with the WWW community. However, it is possible + that HTML2 will use some aspects of HyTime. + + It is recommended that any further RARE work on networked hypermedia + should take account of the importance of SGML and HyTime. + + MHEG + + MHEG stands for the Multimedia and Hypermedia information coding + Experts Group, also known as ISO/IEC JTC1/SC29/WG12 (it used to come + under SC2). This group is developing a standard "Coded + Representation of Multimedia and Hypermedia Information Objects" (ISO + CD 13522, or CCITT T.171), commonly called MHEG. The standard is to + be published in two parts - part 1 being the base notation, + representing objects using ASN.1, and part 2 being an alternate + notation which uses SGML. Part 1 has nearly (June 1993) achieved CD + status, and is intended to reach full IS in 1994. Part 2 is intended + to reach the CD stage in late 1993. + + + +Adie [Page 58] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + MHEG is suited to interactive hypermedia applications such as on-line + textbooks and encyclopaedia. It is also suited for many of the + interactive multimedia applications currently available (in + platformspecific form) on CD-ROM. MHEG could for instance be used as + the data structuring standard for a future home entertainment + interactive multimedia appliance. Telecommunications operators are + interested in MHEG for providing interactive multimedia services + across ISDN. + + To address such markets, MHEG represents objects in a non-revisable + form, and is therefore unsuitable as an input format for hypermedia + authoring applications: its place is perhaps more as an output format + for such tools. MHEG is thus not a multimedia document processing + format - instead it provides rules for the structure of multimedia + objects which permits the objects to be represented in a convenient + "final" form with the aim of direct presentation. + + The MHEG draft standard is expressed in object-oriented terms. The + main object classes are outlined briefly below. + + o Content class. A content object contains the encoded + (monomedia) information to be presented, along with + attributes which identify the type of information and the + encoding method, and mediaspecific attributes such as fonts + used, sampling rate, image size, etc. + + o Selection class and Modification class. The user may + interact with MHEG objects which inherit interactive + behaviour from these classes. (The MHEG object model + supports multiple inheritance.) + + o Action class. Two types of action may be applied to + objects: projection, which controls how objects are + rendered; and status actions which affect the state of + objects. + + o Link class. MHEG hyperlinks connect a "start" object with + one or more "end" objects. Links consist of a set of + conditions relating to the state of the start object, and a + set of actions which are carried out when these conditions + are satisfied. Links also define the spatio-temporal + relationships between objects. + + o Script class. Script objects are used to describe more + complex interobject linkages (e.g., multiple-source links). + MHEG does not define a scripting language - instead it + provides a formalism for encapsulating scripts which may be + executed by an external program (see SMSL below). + + + +Adie [Page 59] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + o Composite class. Related objects may be grouped together + into a single composite object (recursively). The + relationships between content objects within a composite + object are determined by link and script objects which also + are members of the composite object. + + o Descriptor class. Descriptor objects contain general + information about sets of interchanged objects, so that a + target system can ensure it has adequate resources to run + the hypermedia application represented by the object set. + + The relationship between HyTime and MHEG has not yet been fully + established. One possible relationship [21] is that an MHEG + application could be the output of a compilation process which used + an equivalent HyTime document as input. This approach would benefit + both from the expressive power of HyTime and the run-time efficiency + of MHEG. However, it has yet to be shown that this is feasible, + since the capabilities of HyTime and MHEG do not completely overlap. + + There seems to be relatively little interest in or awareness of MHEG + within the Internet community, which is only just beginning to be + aware of HyTime. In view of the draft nature of the MHEG standard, + this report recommends that RARE should not invest substantial effort + in MHEG at this time. However, particularly in view of the interest + in it shown by PTTs, a watching brief should be kept on MHEG, as it + may well be relevant in the future. + + ODA + + The Open Document Architecture standard (ODA - ISO 8613 or T.140) is + a compound document interchange format designed for transferring + documents between open systems. It is able to represent documents in + both a formatted form and a processable (i.e., revisable) form, thus + allowing both the content and the printed appearance of the document + to be unambiguously transferred. + + In addition to text data, ODA supports graphics and image data. A + revised version to be published in 1993 will support colour. Future + developments include support for audio content (underway) and video + content (planned). An interface to MHEG is also planned. + + ODA differs from SGML in that the former concerns itself with the + physical appearance of the document, while SGML deliberately avoids + doing so. SGML concerns itself with semantic markup, and can be used + to describe a wide range of data and document architectures. ODA has + a more limited concept of a document. + + + + +Adie [Page 60] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Hypermedia extensions to ODA (HyperODA) are underway. The extensions + will support: + + o References to data held externally to the document (similar + to SGML's external entities?). + + o Non-linear structures, using contextual and independent + hyperlinks based on the HyTime model. + + o Temporal relationships between document components (e.g., + sequential, parallel, cyclic, duration, start delay). + + HyperODA is not being developed in competition to HyTime or MHEG its + purpose is to add hypermedia features to ODA rather than to be a + completely general framework for hypermedia applications. + + Bearing in mind that: + + o the HyperODA extensions are still under development; + + o in some senses ODA can be seen as a competitor to SGML, + which has greater presence in the hypermedia world; + + o there seems to be a lack of enthusiasm for ODA in the + Internet community (the IETF WG on piloting ODA has + disbanded); + + o Adobe's newly-released Acrobat technology (described below) + will have a significant effect on the marketplace; + + this report recommends that ODA should not form a basis for + investment in networked hypermedia technology by RARE. + + PREMO + + PREMO (Presentation Environment for Multimedia Objects) is a new work + item in ISO/IEC JTC1/SC24 (the graphics standards subcommittee). An + initial draft [22] exists, and the schedule calls for a CD by June + 1994, a DIS by June 1995, and the final IS by June 1996. + + PREMO addresses the construction of, presentation of, and interaction + with multimedia objects. It specifies techniques for creating + audiovisual interactive single and multiple media applications. It + is consistent with the principles of the Computer Graphics Reference + Model (CGRM, ISO 11072), and is defined in object-oriented terms. + + It is not clear how PREMO relates to HyTime and MHEG. Although these + standards are listed in section 2 (References) of the initial draft, + + + +Adie [Page 61] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + they appear not to be mentioned in the text. The wisdom of + developing what appears to be yet another structuring standard for + multimedia data is doubtful. + + The PREMO work is not sufficiently advanced to permit a judgement of + its usefulness in satisfying the requirements under discussion. + + Acrobat + + Adobe, Inc. has introduced a new format called Acrobat PDF, which it + is putting forward as a potential de facto standard for portable + document representation. Based on the Postscript page description + language, Acrobat PDF is also designed to represent the printed + appearance of a document (which may include graphics and images as + well as text. Unlike postscript however, Acrobat PDF allows data to + be extracted from the document. It is thus a revisable format. It + includes support for annotations, hypertext links, bookmarks and + structured documents in markup languages such as SGML. PDF files can + represent both the logical and the formatting structure of the + document. + + Acrobat PFD thus appears to offer very similar functionality to ODA. + Adobe's successful Postscript de facto standard profoundly influenced + information technology - it is possible that if successful, Acrobat + PDF will be almost as important. RARE should be aware of this + technology and its potential impact on multimedia information + systems. + +5.2. Access Mechanisms + + This section describes some standards which are useful in providing + network access to multimedia data. Of course, there are many + multimedia transport protocols, which this report does not attempt to + describe (see [1] for further information). The protocols mentioned + below are search/retrieve protocols which were not mentioned in [1]. + + Multimedia Extensions to SQL + + A new work item in ISO (ISO/IEC JTC1 N2265) to extend the SQL + standard to include multimedia data is expected to be approved + shortly. Initially this work will concentrate on developing a + framework, and on free text data. Support for non-text data will be + added later, using a separate part of the standard for each media + type. + + The expected timescale for this standardisation work is lengthy (part + 1 - the framework - is targeted for completion in 1996). + + + + +Adie [Page 62] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + There are suggestions that this standard could be used as a query + language in conjunction with the HyQ query component of the HyTime + standard. + + DFR + + DFR is the Document Filing and Retrieval system, specified in ISO + 10166-1 and ISO 10166-2. It is intended for office automation + applications, and falls within the Distributed Office Applications + (DOA) model of ISO 10031-1. DFR has design similarities to the ISO + Directory and to the X.400 Message Store, and it is likewise part of + OSI. + + DFR defines a Document Store, which provides a service to a DFR User + over an OSI protocol stack incorporating ROSE (and optionally RTSE). + A document in the Document Store may have a number of attributes + associated with it, including pointers to related documents. There + is support for multiple versions of the same document, and for + hierarchical groups of documents. The access protocol supports + searching for documents based on their attributes. DFR itself does + not restrict the content of documents in any way, but the natural + partner to DFR is the ODA standard for document content. + + It is not clear that DFR offers significantly more useful + functionality than is available from other, simpler access protocols + already in use on the Internet. + +5.3. Other Standards + + This section briefly describes other standards in this area and + discusses their relevance. + + MIME + + MIME (Multipurpose Internet Mail Extensions) is a mechanism for + transferring multimedia information in an RFC822 mail message. STD + 11, RFC 822 defines a message representation protocol which specifies + considerable detail about message headers, but which leaves the + message content as flat ASCII text. RFC 1341 redefines the format of + message bodies to allow multi-part textual and non-textual message + bodies to be represented and exchanged without loss of information. + Because RFC 822 said very little about message content, RFC 1341 is + largely orthogonal to (rather than a revision of) RFC 822. + + MIME provides facilities to include multiple objects in a single + message, to represent text in character sets other than US-ASCII, to + represent formatted multi-font text messages, to represent non + textual material such as images and audio fragments, and generally to + + + +Adie [Page 63] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + facilitate later extensions defining new types of Internet mail for + use by co-operating mail agents. It does not define any structure to + allow relationships between body parts within a message to be + expressed. + + For the purposes of the requirements considered by this report, the + relevance of MIME is that it separates media type from media + encoding, and that it defines a procedure for registering values of + these attributes. + + The MIME construct of chief interest is the "Content-Type" field. + This contains a MIME "type" and "subtype", and any "parameters" which + further qualify the subtype. The register of MIME content-types is + maintained by the Internet Assigned Numbers Authority (IANA). Content + types defined in the MIME standard itself include: + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +Adie [Page 64] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Type Subtype Parameters Meaning + + + text plain charset Plain text + + richtext charset Text with SGML-like + markup for + representing + formatting. + + image jpeg JPEG File Interchange + Format + + gif Graphics Interchange + Format + + audio basic 8-bit -law 8kHz PCM + encoding + + video mpeg + + application ODA profile Open Document + (used (Document Architecture + for Application document. + application Profile) + -specific + data) + + octet- name (e.g., General binary data + stream filename); such as an arbitrary + type (for binary file. + human + recipient), + etc. + + postscript Document in + postscript. + + Private experimental values of types and subtypes starting with X may + be used between consenting adults without registration with IANA. + + MIME also defines a "Content-Transfer-Encoding" field, which is used + to specify an invertible mapping between the "native" encoding of a + media type and a representation that may be readily exchanged using + 7bit mail transfer protocols. + + WWW's HTTP2 protocol makes use of MIME media type and encoding + attributes, and also uses MIME's message format for retrieving data + + + +Adie [Page 65] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + from the server. It is the first MIME application to utilise the + 8bit Content-Transfer-Encoding, which essentially means no encoding. + + SMSL + + SMSL is the Standard Multimedia Scripting Language. It is a proposed + new work item for ISO/IEC JTC1/SC18/WG8 (HyTime) and JTC1/SC29/WG12 + (MHEG). The functional requirements are expected to be completed in + 1994, and the coding scheme completed in 1995. + + SMSL is designed as an open language with a similar purpose to + existing vendor-specific scripting languages such as Macromind's + "Lingo", Kaleida's "Script/X", and Gain's "GEL". The intention is to + offer an intermediate open multimedia scripting language which could + be used both for interchange purposes, and for controlling the + presentation of HyTime or MHEG multimedia structures. Several + different approaches to defining SMSL have been suggested, including + using the ANDF (Architecture-Neutral Distribution Format) approach, + and basing SMSL on SGML or on the Scheme language. + + The SMSL work is not sufficiently advanced to permit a judgement of + its usefulness in satisfying the requirements under discussion. + However, it is interesting to note that despite the descriptive power + of HyTime and MHEG, there is still perceived to be a role for + procedural scripting. + + AVIs + + The CCITT is defining a set of Audio Visual Interactive Services + (AVIs), intended for offering to domestic and business consumers over + a national network (e.g., by PTTs). These services will be specified + as T.17x recommendations, and will include MHEG. These services + would also make use of the SMSL work. + + Insufficient information is available about this area to allow its + relevance to be judged. + +5.4. Trade Associations + + This section mentions some trade associations which are involved in + standards making in the multimedia area. + + Interactive Multimedia Association + + The Interactive Multimedia Association (IMA) is an international + trade association with over 250 members, representing a wide spectrum + of multimedia industry players. Members include Apple, Microsoft, + MIT CECI (the developers of AthenaMuse 2), 3DO, and many other + + + +Adie [Page 66] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + important market actors. + + In 1989, the IMA initiated a "Compatibility Project", tasked with + developing technical solutions to the cross-platform compatibility + problem. The Project has published two important documents: + + o "Recommended Practices for Multimedia Portability" [23] + outlines a specification for a common interface to be used + by interactive video delivery systems. It has been adopted + by the US Military as part of Military Standard 1379. + + o "Recommended Practices for Enhancing Digital Audio + Compatibility in Multimedia Systems" [24] defines four + standard digital audio data types and four sampling rates + (from low-end -law 8kHz mono encoding, up through ADPCM + modes to CD-quality 44kHz 16-bit stereo). + + Work is continuing to produce further recommendations on other + issues. + + The Compatibility Project has now initiated a procurement process by + publishing three Request for Technology (RFT) documents, defining the + requirements of a platform-independent interactive multimedia system, + including networking requirements. The RFTs cover "Multimedia System + Services", a "Scripting Language for Interactive Multimedia Titles", + and "Multimedia Data Exchange". An "Architecture Reference Model" + for cross-platform desktop and distributed multimedia systems + provides the framework for these RFTs, which are pragmatic documents + outlining the technical requirements for time-based media handling in + detail. Note that relatively little is said about non-time-based + data. + + A first reading of the Multimedia Data Exchange RFT reveals that the + Apple Bento standard [18] and the Microsoft/IBM RIFF format [25] both + influenced the development of this document. The selected system may + well be based on one or both of these technologies. + + A joint response to the Multimedia System Services RFT has been + received from HP, IBM and Sun. Two responses to the Scripting + Languages RFT have been received - from Kaleida (Script-X) and Gain + Technology (GEL). Two partial responses to the Multimedia Data + Exchange RFT have been received from Apple (Bento) and Avid (Open + Media Framework). + + Responses to the RFTs are currently being analysed by the IMA, and + the result will be announced in November 1993. The specifications + which will eventually result from this process will be important for + future commercial multimedia products. It is important that the + + + +Adie [Page 67] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + community keep a watching brief on the IMA Compatibility Project and + its possible implications for distributed multimedia applications on + the Internet. + + Multimedia Communications Forum + + The Multi-Media [sic] Communications Forum (MMCF) is a recently + formed (June 1993) trade consortium whose initial members include + IBM, National Semiconductor, Apple, Siemens and AT&T. Intended to + complement the work of the IMA, the MMCF plans to develop guidelines + and recommendations for the industry to help ensure "end-to-end + network interconnectivity of multimedia applications, workstations + and devices". They also plan to provide input to standards bodies. + + It is still too early to say whether this forum will succeed. If the + IMA Compatibility Project specifications, when they are published, + leave networking issues open, then MMCF could have an important role + to play. It is recommended that RARE consider becoming an Observing + + Member ($350 US pa), entitling it to attend general and annual MMCF + meetings (but not committee meetings), and to receive minutes and + other general papers (but not working documents); with the prospect + of becoming an Auditing Member ($1200 US pa) later if relevant. + + Multimedia Communications Community of Interest + + This is a very new organisation formed at a meeting in France in June + 1993. Its charter is to promote the use of applications which let + people in different locations view documents, images, graphics and + full-motion video on a PC screen. The remit includes CSCW aspects. + Members of the organisation include IBM, Intel, Northern Telecom, + Telstra (Australia), BT, France Telecom and DB Telekom. The + companies plan field trials of multimedia services in 1Q94. + +6. Future Directions + +6.1. General Comments on the State-of-the-Art + + Distributed hypermedia systems are now emerging from the research + phase into the experimental deployment stage. Every project team + (and standards committee), almost without exception, hopes for their + system to become the de facto standard for hypermedia. + + As we've seen, Gopher and WWW already offer multimedia capability, + but they are still largely oriented to the use of external viewers + for non-text nodes. This "unintegrated" approach is in contrast to + typical stand-alone multimedia applications, where the presentation + of related information in different media is tightly integrated. The + + + +Adie [Page 68] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + in-line image feature of XMosaic and the new version of HTML + currently under development may represent the start of a move towards + greater integration of different media in such distributed hypermedia + systems. + + Three important factors in the design of distributed hypermedia + systems appear to emerge from the preceding chapters of this report. + They can each be formulated in terms of distinctions between two + aspects of the system. + + o A common and apparently fruitful approach to hypermedia + systems is to distinguish the content from the + hyperstructure. Standards work clearly distinguishes + between these concepts, with standards such as MPEG, JPEG, + G.72x, etc, for content; and HyTime or MHEG for structure. + Currently-deployed systems also make this distinction, most + obviously in Gopher, where the structure/content split maps + onto the server filesystem's directory/file split. In a + similar way, the ability to maintain hyperlink information + separately from data is perceived in hypermedia research + circles as a "good thing". Research systems such as + Microcosm and Hyper-G do this, and HyTime with its ilink + element also supports it. WWW does not support this, but + requires link anchors to be edited into source data. There + are problems with this approach, however - see the section + on Microcosm for details. + + o A useful approach to content is to distinguish the media + type from the media encoding. The MIME standard (used by + HTTP2) illustrates how this can be done, and Gopher+ employs + a similar system. + + o The distinction between data and protocol is also important + for some systems. WWW for instance has clearly separate + protocol (HTTP) and data (HTML) specifications. However, + Gopher+ is specified without making this distinction. (The + original Gopher system is very simple and arguably has no + need for such separation.) + + The most significant mismatches between the capabilities of + currentlydeployed systems and user requirements are in the areas of + presentation and quality of service. Adding flexibility in + presentation capabilities to WWW or Gopher should be possible without + any major change to the protocols (although it may require changes to + data formats). Such capabilities could result from the progress + towards greater integration of media types presaged above. However, + improving QOS is significantly more difficult, as it may require + changes at a more fundamental level. The following section outlines + + + +Adie [Page 69] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + some possible solutions to this problem. + +6.2. Quality of Service + + Meeting the responsiveness requirement is certainly the key factor + for the acceptance of networked multimedia information systems in the + user community. To reiterate the requirement given in a previous + section: + + o For simple actions such as "next page", tolerable delays are + of the order of 0.2s. + + o For more complex actions such as "search for documents + containing this word", then a tolerable delay is of the + order of 2s. + + o Users tend to give up waiting for a response after about + 20s. + + There are several methods which may alleviate the problem of poor + responsiveness (or cause the user to revise his or her expectations + of responsiveness!), some of which are described below. + + 1. Give clues that fetching a particular item might be time- + consuming - simply quoting the size (and/or location) may be + sufficient. WAIS and some Gopher clients already quote the + size. + + 2. Display a "progress" indicator while fetching data. + + 3. Allow the user to interact with other, previously fetched + information while waiting for data to be retrieved. The + inability to do this is an annoying limitation of XMosaic. + It can be difficult to implement, except on a multi-threaded + operating system such as OS/2 or Windows NT. + + 4. Allow several fetches to be performed in parallel. Again, + multithreading support makes this easier. This technique is + less likely to be useful if all the nodes being requested + come from the same server. + 5. Pre-fetch information which the client software believes the + user will wish to see next. This requires some "hints" in + the data about which nodes might be good candidates for pre- + fetching. + + 6. Cache information locally. The use of Universal Resource + Numbers (see the section on WWW) is relevant for managing + this. + + + +Adie [Page 70] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + 7. Where multiple copies of the same information are held in + different network locations, fetch the "nearest" copy. This + is sometimes known as "anycasting", and is a more general + case of local caching. The proposed URN-to-URL resolution + service [26] could be used to support this. + + 8. When retrieving a document, the client should be able to + display the first part of the document to the user. The + user can then start to read the document while the system is + still downloading it. Alternatively, the user may decide + that the document is not relevant and abort the retrieval. + + 9. Offer multiple views of image or video data at different + resolutions and therefore sizes. This enables the user to + select a balance between speed of retrieval and data + quality. Gopher+ and HTML2 both support this. + + 10. Future high-speed networks and protocols (ATM, RTP) will + allow real-time display of isochronous data. Information + systems should be able to take advantage of this. + + A useful description of the problem is given in [27]. This paper + rightly contends that the view, held by many hypermedia researchers + and implementors, that the network is simply a transparent data + highway which needs no special consideration in application design, + is wrong. It is argued that: + + "the very same structural characteristics that may make + a multimedia document appealing to the end user are the + characteristics that are extremely helpful during + dynamic network performance optimisation". + + This is a particularly relevant statement considered in the light of + suggestion 5 above. + +6.3. Recommended Further Work + + To meet the needs of applications such as those described in section + 2.1, the community must seek where possible to adapt and enhance + existing tools, not to build new ones. There is now an opportunity + for RARE to stimulate and encourage this process of adaptation and + enhancement, and the following subsections outline a strategy for + this. + + + + + + + +Adie [Page 71] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Selecting a System + + In order to have the greatest effect, RARE should concentrate its + efforts on only one of the existing tools. Candidate technologies + are those already outlined: Gopher, WWW, WAIS, Hyper-G, Microcosm and + AthenaMuse 2. + + It is recommended that RARE should select the World-Wide Web to + concentrate its efforts on. The reasons for this decision are as + follows. + + o Flexibility. The rich yet straightforward design of WWW, + with its clearly separable components (HTML, URL and HTTP), + means that it is a very flexible basis on which to develop + distributed multimedia applications. + + o Existing efforts. The WWW implementor community is already + discussing and designing extensions to HTML (HTML2), + intended (among other things) to support multimedia. There + is clearly much interest in this area, and RARE efforts + could complement existing work. + + o Hyperlinks. A clear requirement of many applications is the + availability of hyperlinking, which WWW supports well. + + o Integrated solution. Because WAIS, Gopher and Hyper-G (as + well as anonymous FTP servers) may all be accessed from Web + clients, WWW serves as an important integrating tool for + information services. It is important that distributed + multimedia applications, which require extensive support in + the client software, should be based on a technology "close + to" such integrated clients. + + o Penetration and growth. Although Gopher far surpasses WWW + in the number of servers available, the rate of growth in + WWW usage is greater than that of Gopher. There is an + increasing realisation in the community that Gopher is over- + simplistic for many purposes, and a corresponding increase + in interest in WWW. + + o Attention to QOS issues. There is already an awareness in + the WWW community of the need for achieving an appropriate + QOS, and a mechanism has already been proposed in HTTP2 to + alleviate the problem. + + o Standardisation. The WWW team is taking standardisation of + the existing WWW system components seriously. The URL + format has already been published as an Internet draft (and + + + +Adie [Page 72] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + has been adopted as an important component of the proposed + Internet integrated information infrastructure), and the + current version of HTML is about to follow suit. The use of + SGML as the basis of HTML complies with the perceived + importance of SGML for hypermedia in general (and also fits + in with RARE's approach of adopting appropriate open + standards). + + o Software status. CERN has recently placed the WWW code + developed by it into the public domain. This is unlike all + the other candidate technologies, which all have + restrictions on who can do what with the code. In the case + of Gopher, these restrictions are already causing some + commercial users to look at other options. + + WWW has two significant disadvantages, both of which are being + alleviated: + + o Restricted choice of client software. At present, Apple + Macintosh and PC/MS Windows clients are available in beta + form only. By contrast, there are more than one well-tested + Gopher clients available for these platforms. + + However, other WWW clients for the Mac and MS Windows are in + the pipeline. + + o There is a perception in the community that making + information available over HTTP is difficult, and that it + must be put into HTML. + + However, it is possible to put plain-text, non-HTML + documents onto the Web. Such documents of course cannot + contain links. + + Furthermore, WYSIWYG HTML text editors are available, to + ease the pain of writing HTML. + + The main disadvantages of the other systems are: + + o Gopher is designed for simplicity, and therefore lacks the + flexibility of WWW. In particular its structure is too + inflexibly hierarchical and it does not have hyperlinks. + Its main advantage is its very heavy penetration. However, + because of the WWW approach to accessing data using other + protocols, all of gopherspace is part of the Web. Any Web + client should be able to be a gopher client too. + + + + + +Adie [Page 73] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + It is neither envisaged that Gopher will go away, nor that + it won't be used for multimedia data. However, Gopher is + unlikely to be used for more sophisticated multimedia + applications such as academic publishing, interactive + multimedia databases and CAL, because of the above-mentioned + limitations. + + o WAIS is a specialised tool, and will certainly form part of + the overall solution, particularly for database-type + applications. It is not a general solution for distributed + hypermedia applications. + + o AthenaMuse 2 is commercially-oriented: it is clear that + academic and research users will have to pay to use the + software. Its level of use is thus very unlikely to be as + great as publiclyavailable systems such as WWW. Moreover, + it does not support all the required platforms. + + o Microcosm network support is still in early stages, limited + at present to the PC/Windows platform. If it can be shown + to perform adequately over a network, if it is capable of + scaling to global levels, and if the advantages of + maintaining link information separately from documents are + found clearly to outweigh the consequent difficulties, it + may become important in the future. Microcosm's authors need + to ensure that the commercialisation of Microcosm does not + hinder its adoption by the academic community. + + o Hyper-G is more difficult to dismiss. It is still in a + relatively early stage of development, but appears to have + many of the necessary features. Its main disadvantages are: + (a) the lack of penetration outside the University of Graz - + the author is aware of only one other site using it; and (b) + it is currently limited to UNIX only. The author believes + that, given WWW's head start in terms of deployment, and the + current progress in adding multimedia facilities to it, WWW + stands a much better chance than Hyper-G of being accepted + as the de facto standard for distributed multimedia + applications on the Internet. + + Directions for RARE + + Earlier in this report, it was noted that the most important areas + where effort was needed were (a) provision of facilities for the + integrated presentation of multimedia data (including synchronisation + issues); and (b) ensuring adequate responsiveness. + + + + + +Adie [Page 74] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + Bearing this in mind, it is recommended that RARE should invite + proposals and (subject to funding being available) subsequently + commission work to: + + 1. Develop conversion tools from commercial authoring packages + to WWW, and establish authoring guidelines for authors who + wish to use the conversion tools. This is a significant and + high-profile development aimed at enabling sophisticated + multimedia applications to run over the network. (Authoring + guidelines will be necessary to enable authors to fit in + with the Web's way of doing things, and to document features + of the authoring package which should be avoided because of + conversion difficulties.) + + 2. Implement and evaluate the most promising ways of overcoming + the QOS problem. This is an essential task without which + interactive distributed multimedia applications cannot + become a reality. Some possibilities have already been + outlined in the preceding chapter. + + 3. Implement a specific user project using these tools, in + order to validate that the facilities being developed are + truly relevant to actual user requirements. It may be that + partner funding from the selected user project would be + appropriate. + + 4. Use the experience gained from 1, 2 and 3 to inform and + influence the further development of HTML2 and HTTP2 to + ensure that they provide the required facilities. + + 5. Contribute to the development of the WWW clients + (particularly the Apple Macintosh and PC/MS Windows clients) + in terms of their multimedia data handling facilities. + + Although it is strictly speaking outside the remit of this report + (since it is not specifically concerned with multimedia data), it is + noted that the rapid growth of WWW may in the future lead to problems + through the implementation of multiple, uncoordinated and mutually + incompatible add-on features. To guard against this trend, it may be + appropriate for RARE, in coordination with CERN and other interested + parties such as NCSA, to: + + 6. Encourage the formation of a consortium to coordinate WWW + technical development (protocol enhancements, etc). + + + + + + + +Adie [Page 75] + +RFC 1614 Network Access to Multimedia Information May 1994 + + +7. References + + [1] "A Survey of Distributed Multimedia Research, + Standards and Products", ed. C. Adie, January 1993 + (RARE Technical Report 5). + URL=ftp://ftp.ed.ac.uk/pub/mmsurvey/ + + [2] "The Dexter Hypertext Reference Model", F. Halasz and + M. Schwartz, NIST Hypertext Standardisation Workshop, + January 1990. + + [3] "Response Time and Display Rate in Human Performance + with Computers", B. Shneiderman, Comp. Surveys 16, + 1984. + + [4] "Gopher+: Proposed Enhancements to the Internet + Gopher Protocol", B. Alberti, F. Anklesaria, P. Linder, + M. McCahill, D. Torrey, Summer 1992. + URL=gopher://boombox.micro.umn.edu:70/11/gopher/gop + her_protocol/Gopher%2b + + [5] "WAIS Interface Protocol", F. Davies, B. Kahle, H. + Morris, J. Salem, T. Shen, R. Wang, J. Sui and M. + Grinbaum, April 1990. + URL=ftp://quake.think.com/wais/doc/protspec.txt + + [6] "Uniform Resource Locators", T. Berners-Lee, March + 1993. URL=ftp://info.cern.ch/pub/ietf/url4.ps + + [7] "The HTTP Protocol as Implemented in W3", T. Berners- + Lee, January 1992. + URL=ftp://info.cern.ch/pub/www/doc/http.txt + + [8] "Protocol for the Retrieval and Manipulation of + Textual and Hypermedia Information", T. Berners-Lee, + 1993. URL=ftp://info.cern.ch/pub/www/doc/httpspec.ps + + [9] "Hypertext Markup Language (HTML)", T Berners-Lee, + March 1993. URL=ftp://info.cern.ch/pub/www/doc/html- + spec.ps + + [10] "Hyper-G: A Universal Hypermedia System", F. Kappe and + N. Sherbakov, March 1992. URL=ftp://iicm.tu- + graz.ac.at/pub/HyperG/doc/report333.txt.Z + + + + + + + +Adie [Page 76] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + [11] "Towards an Integrated Information Environment with + Open Hypermedia Systems", H. Davis, W. Hall, I. Heath, + G. Hill, Proceedings of the ACM Conference on + Hypertext, Milan 1992, p181-190. + + [12] "The AthenaMuse 2 Functional Specification", L. + Bolduc, J. Culbert T. Harada, J. Harward, E. + Schlusselberg, May 1992. + URL=ftp://ceci.mit.edu/pub/AM2/funcspec.txt.Z + + [13] "Research and Technology Development in Advanced + Communications Technologies in Europe: RACE '92", + CEC, March 1992. Available from: + raco@postman.dg13.cec.be + + [14] "Esprit Programme Synopses", CEC, October 1992. In + seven volumes. Available from + esprit_order_mailbox@eurokom.ie + + [15] "CMIFed: A Presentation Environment for Portable + Hypermedia Documents", G. van Rossum, J. Jansen, K. S. + Mullender, D. C. A. Bulterman, Amsterdam 1993 (also + presented at ACM Multimedia 93 conference). + URL=ftp://ftp.cwi.nl/pub/CWIreports/CST/CSR9305.ps.Z + + [16] "The Amsterdam Hypermedia Model: extending hypertext + to support real multimedia", L. Hardman, D. C. A. + Bulterman, G. van Rossum, Amsterdam 1993 + URL=ftp://ftp.cwi.nl/pub/CWIreports/CST/CSR9306.ps.Z + + [17] "Deja-Vu Distributed Hypermedia Application + Framework", A. Eliens. + URL=ftp://ftp.cs.vu.nl/eliens/Deja-Vu-proposal.ps + + [18] "Bento Specification", J. Harris and I. Ruben, Apple + Computer Inc, August 1992. + URL=ftp://ftp.apple.com/apple/standards/Bento_1.0d4.1 + + [19] "Davenport Advisory Standard for Hypermedia (DASH), + Module I: Standard Open Formal Architecture for + Browsable Hypermedia Documents (SOFABED)", ed S. R. + Newcomb and V. T. Newcomb. + URL=ftp://sgml1.ex.ac.uk/davenport/sofabed.0.9.6.ps.Z + + [20] Article in comp.text.sgml newsgroup, 24 May 1993, by + Eliot Kimber (drmacro@vnet.ibm.com). + URL=ftp://ftp.ifi.uio.no/SGML/comp.text.sgml/by.msg + id/19930524.152345.29@almaden.ibm.com + + + +Adie [Page 77] + +RFC 1614 Network Access to Multimedia Information May 1994 + + + + [21] "Emerging Hypermedia Standards" B. Markey, Multimedia + for Now and the Future (Usenix Conference + Proceedings), June 1991. + + [22] "Initial Draft PREMO (Presentation Environment for + Multimedia Objects", ISO/IEC JTC1/SC24 N847, November + 1992. + + [23] "Recommended Practices for Multimedia Portability", + Release 1.1 October 1990, Interactive Multimedia + Association, 3 Church Circle, Suite 800, Annapolis, + MD 21401-1993, USA. + + [24] "Recommended Practices for Enhancing Digital Audio + Compatability in Multimedia Systems", Release 3.00 + 1992, Interactive Multimedia Association, 3 Church + Circle, Suite 800, Annapolis, MD 21401-1993, USA. + + [25] "RIFF Tagged File Format", Microsoft Inc, 1992. + + [26] "A Vision of an Integrated Internet Information + Service", C. Weider and P. Deutsch, March 1993, + Work in Progress. + + [27] "Delivering Interactive Multimedia Documents over + Networks", S. Loeb, IEEE Communications Magazine, May + 1992. + + [28] "A Status Report on Networked Information Retrieval: + Tools and Groups", ed. J. Foster, G. Brett and P. + Deutsch, March 1993. + URL=ftp://mailbase.ac.uk/pub/nir/nir.status.report + + + + + + + + + + + + + + + + + + +Adie [Page 78] + +RFC 1614 Network Access to Multimedia Information May 1994 + + +8. Security Considerations + + Security issues are not discussed in this memo. + +9. Author's Address + + Chris Adie + Edinburgh University Computing Service + University Library + George Square + Edinburgh EH8 9LJ + United Kingdom + + Phone: +44 31 650 3363 + Fax: +44 31 662 4809 + EMail: C.J.Adie@edinburgh.ac.uk + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +Adie [Page 79] + |