1 files changed, 675 insertions, 0 deletions
diff --git a/doc/rfc/rfc1187.txt b/doc/rfc/rfc1187.txt
new file mode 100644
index 0000000..aa30ac8
--- /dev/null
+++ b/doc/rfc/rfc1187.txt
@@ -0,0 +1,675 @@
+
+
+
+
+
+
+Network Working Group                                            M. Rose
+Request for Comments: 1187       Performance Systems International, Inc.
+                                                           K. McCloghrie
+                                                      Hughes LAN Systems
+                                                                J. Davin
+                                     MIT Laboratory for Computer Science
+                                                            October 1990
+
+
+                   Bulk Table Retrieval with the SNMP
+
+1.  Status of this Memo
+
+   This memo reports an interesting family of algorithms for bulk table
+   retrieval using the Simple Network Management Protocol (SNMP).  This
+   memo describes an Experimental Protocol for the Internet community,
+   and requests discussion and suggestions for improvements.  This memo
+   does not specify a standard for the Internet community.  Please refer
+   to the current edition of the "IAB Official Protocol Standards" for
+   the standardization state and status of this protocol.  Distribution
+   of this memo is unlimited.
+
+Table of Contents
+
+   1. Status of this Memo ..................................    1
+   2. Abstract .............................................    1
+   3. Bulk Table Retrieval with the SNMP ...................    2
+   4. The Pipelined Algorithm ..............................    3
+   4.1 The Maximum Number of Active Threads ................    4
+   4.2 Retransmissions .....................................    4
+   4.3 Some Definitions ....................................    4
+   4.4 Top-Level ...........................................    5
+   4.5 Wait for Events .....................................    6
+   4.6 Finding the Median between two OIDs .................    8
+   4.7 Experience with the Pipelined Algorithm .............   10
+   4.8 Dynamic Range of Timeout Values .....................   10
+   4.9 Incorrect Agent Implementations .....................   10
+   5. The Parallel Algorithm ...............................   11
+   5.1 Experience with the Parallel Algorithm ..............   11
+   6. Acknowledgements .....................................   11
+   7. References ...........................................   12
+   Security Considerations..................................   12
+   Authors' Addresses.......................................   12
+
+2.  Abstract
+
+   This memo reports an interesting family of algorithms for bulk table
+   retrieval using the Simple Network Management Protocol (RFC 1157) [1].
+
+
+
+Rose, McCloghrie & Davin                                        [Page 1]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+   The reader is expected to be familiar with both the Simple Network
+   Management Protocol and SNMP's powerful get-next operator.  Please
+   send comments to: Marshall T. Rose <mrose@psi.com>.
+
+3.  Bulk Table Retrieval with the SNMP
+
+   Empirical evidence has shown that SNMP's powerful get-next operator is
+   effective for table traversal, particularly when the management
+   station is interested in well-defined subsets of a particular table.
+   There has been some concern that bulk table retrieval can not be
+   efficiently accomplished using the powerful get-next operator.  Recent
+   experience suggests otherwise.
+
+   In the simplest case, using the powerful get-next operator, one can
+   traverse an entire table by retrieving one object at a time.  For
+   example, to traverse the entire ipRoutingTable, the management station
+   starts with:
+
+                  get-next (ipRouteDest)
+
+   which might return
+
+                  ipRouteDest.0.0.0.0
+
+   The management station then continues invoking the powerful get-next
+   operator, using the value provided by the previous response, e.g.,
+
+                  get-next (ipRouteDest.0.0.0.0)
+
+   As this sequence continues, each column of the ipRoutingTable can be
+   retrieved, e.g.,
+
+                  get-next (ipRouteDest.192.33.4.0)
+
+   which might return
+
+                  ipRouteIfIndex.0.0.0.0
+
+   Eventually, a response is returned which is outside the table, e.g.,
+
+                  get-next (ipRouteMask.192.33.4.0)
+
+   which might return
+
+                  ipNetToMediaIfIndex.192.33.4.1
+
+   So, using this scheme, O(rows x columns) management operations are
+   required to retrieve the entire table.
+
+
+
+Rose, McCloghrie & Davin                                        [Page 2]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+   This approach is obviously sub-optimal as the powerful get-next
+   operator can be given several operands.  Thus, the first step is to
+   retrieve an entire row of the table with each operation, e.g.,
+
+              get-next (ipRouteDest, ipRouteIfIndex, ..., ipRouteMask)
+
+   which might return
+
+                  ipRouteDest.0.0.0.0
+                  ipRouteIfIndex.0.0.0.0
+                  ipRouteMask.0.0.0.0
+
+   The management station can then continue invoking the powerful get-
+   next operator, using the results of the previous operation as the
+   operands to the next operation.  Using this scheme O(rows) management
+   operations are required to retrieve the entire table.
+
+   Some have suggested that this is a weakness of the SNMP, in that
+   O(rows) serial operations is time-expensive.  The problem with such
+   arguments however is that implicit emphasis on the word "serial".  In
+   fact, there is nothing to prevent a clever management station from
+   invoking the powerful get-next operation several times, each with
+   different operands, in order to achieve parallelism and pipelining in
+   the network.  Note that this approach requires no changes in the
+   SNMP, nor does it add any significant burden to the agent.
+
+4.  The Pipelined Algorithm
+
+   Let us now consider an algorithm for bulk table retrieval with the
+   SNMP.  In the interests of brevity, the "pipelined algorithm" will
+   retrieve only a single column from the table; without loss of
+   generality, the pipelined algorithm can be easily extended to
+   retrieve all columns.
+
+   The algorithm operates by adopting a multi-threaded approach: each
+   thread generates its own stream of get-next requests and processes
+   the resulting stream of responses.  For a given thread, a request
+   will correspond to a different row in the table.
+
+   Overall retrieval efficiency is improved by being able to keep
+   several transactions in transit, and by having the agent and
+   management station process transactions simultaneously.
+
+   The algorithm will adapt itself to varying network conditions and
+   topologies as well as varying loads on the agent.  It does this both
+   by varying the number of threads that are active (i.e., the number of
+   transactions that are being processed and in transit) and by varying
+   the retransmission timeout.  These parameters are varied based on the
+
+
+
+Rose, McCloghrie & Davin                                        [Page 3]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+   transaction round-trip-time and on the loss/timeout of transactions.
+
+4.1.  The Maximum Number of Active Threads
+
+   One part of the pipelined algorithm which must be dynamic to get best
+   results is the determination of how many threads to have active at a
+   time.  With only one thread active, the pipelined algorithm
+   degenerates to the serial algorithm mentioned earlier.  With more
+   threads than necessary, there is a danger of overrunning the agent,
+   whose only recourse is to drop requests, which is wasteful.  The
+   ideal number is just enough to have the next request arrive at the
+   agent, just as it finishes processing the previous request.  This
+   obviously depends on the round-trip time, which not only varies
+   dynamically depending on network topology and traffic-load, but can
+   also be different for different tables in the same agent.
+
+   With too few threads active, the round-trip time barely increases
+   with each increase in the number of active threads; with too many,
+   the round-trip time increases by the amount of time taken by the
+   agent to process one request.  The number is dynamically estimated by
+   calculating the round-trip-time divided by the number of active
+   threads; whenever this value takes on a new minimum value, the limit
+   on the number of threads is adjusted to be the number of threads
+   active at the time the corresponding request was sent (plus one to
+   allow for loss of requests).
+
+4.2.  Retransmissions
+
+   When there are no gateways between the manager and agent, the
+   likelihood of in-order arrival of requests and responses is quite
+   high.  At present, the decision to retransmit is based solely on the
+   timeout.  One possible optimization is for the manager to remember
+   the order in which requests are sent, and correlate this to incoming
+   responses.  If one thread receives a response before another thread
+   which sent an earlier request, then lossage could be assumed, and a
+   retransmission made immediately.
+
+4.3.  Some Definitions
+
+   To begin, let us define a "thread" as some state information kept in
+   the management station which corresponds to a portion of the table to
+   be retrieved.  A thread has several bits of information associated
+   with it:
+
+      (1)  the range of SNMP request-ids which this thread can use,
+           along with the last request-id used;
+
+      (2)  last SNMP message sent, the number of times it has been
+
+
+
+Rose, McCloghrie & Davin                                        [Page 4]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+           (re)sent, the time it was (re)sent;
+
+      (3)  the inclusive lower-bound and exclusive upper-bound of
+           the object-instance for the portion of the table that
+           this thread will retrieve, along with the current
+           object-instance being used;
+
+      (4)  the number of threads which were active at the time it
+           was last sent;
+
+   When a thread is created, it automatically sends a get-next message
+   using its inclusive lower-bound OID.  Further, it is placed at the
+   end of the "thread queue".
+
+   Let us also define an OID as a concrete representation of an object
+   identifier which contains two parts:
+
+      (1)  the number of sub-identifiers present, "nelem";
+
+      (2)  the sub-identifiers themselves in an array, "elems",
+           indexed from 1 up to (and including) "nelem".
+
+4.4.  Top-Level
+
+   The top-level consists of starting three threads, and then entering a
+   loop.  As long as there are existing threads, the top-level waits for
+   events (described next), and then acts upon the incoming messages.
+   For each thread which received a response, a check is made to see if
+   the OID of the response is greater than or equal to the exclusive
+   upper-bound of the thread.  If so, the portion of the table
+   corresponding to the thread has been completely retrieved, so the
+   thread is destroyed.
+
+   Otherwise, the variable bindings in the response are stored.
+   Following this, if a new thread should be created, then the portion
+   of the table corresponding to the thread is split accordingly.
+   Regardless, another powerful get-next operator is issued on behalf of
+   the thread.
+
+   The initial starting positions (column, column.127, and column.192),
+   were selected to form optimal partitions for tables which are indexed
+   by IP addresses.  The algorithm could easily be modified to choose
+   other partitions; however, it must be stressed that the current
+   choices work for any tabular object.
+
+      pipelined_algorithm (column)
+      OID  column;
+      {
+
+
+
+Rose, McCloghrie & Davin                                        [Page 5]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+          timeout ::= some initial value;
+
+          start new thread for [column, column.127);
+          start new thread for [column.127, column.192);
+          start new thread for [column.192, column+1);
+
+          while (threads exist) {
+             wait for events;
+             foreach (thread that has an incoming message,
+                      examined in order from the thread queue) {
+                 OID     a;
+
+                 if (message's OID >= thread's upper-bound) {
+                     destroy thread;
+                     continue;
+                 }
+
+                 store variable-bindings from message;
+
+                 if (number of simultaneous threads does NOT
+                             exceed a maximum number
+                          && NOT backoff
+                          && (a ::= oid_median (message's OID,
+                                                thread's
+                                                    upper-bound))) {
+                      start new thread for [a, thread's upper-bound);
+                      thread's upper-bound ::= a;
+                      place thread at end of thread queue;
+                      backoff ::= TRUE;
+                  }
+                  do another get-next for thread;
+              }
+          }
+      }
+
+
+4.5.  Wait for Events
+
+   Waiting for events consists of waiting a small amount of time or
+   until at least one message is received.
+
+   Any messages encountered are then collated with the appropriate
+   thread.  In addition, the largest round-trip time for
+   request/responses is measured, and the maximum number of active
+   threads is calculated.
+
+   Next, the timeout is adjusted: if no responses were received, then
+   the timeout is doubled; otherwise, a timeout-adjustment is calculated
+
+
+
+Rose, McCloghrie & Davin                                        [Page 6]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+   as 1.5 times the largest observed round-trip time.  If the timeout-
+   adjustment is greater than the current timeout, the current timeout
+   is set to the timeout-adjustment.  Otherwise, the current timeout is
+   averaged with the timeout-adjustment.
+
+   Finally, if at least one thread did not receive a response, then the
+   thread is identified which has waited the longest.  If the elapsed
+   time (with noise factor) since the last request (or retransmission)
+   is greater than the current timeout value, another retransmission is
+   attempted.
+
+   wait for events ()
+   {
+       backoff ::= TRUE, maxrtt ::= 0;
+       find the thread which has been waiting the longest
+           for a response;
+       timedelta = timeout
+                       - time since request was sent for thread;
+       wait up to timedelta seconds or until some messages arrive;
+
+       if (least one message arrived) {
+           discard any messages which aren't responses;
+           foreach (response which corresponds to a thread) {
+               if (the response is a duplicate)
+                   drop it and continue;
+
+               if (this response is for a message that was
+                       not retransmitted) {
+                  if (the round-trip time is larger than maxrtt)
+                       set maxrtt to the new round-trip time;
+                   if (round-trip time / number of active threads
+                         < minimum previous round-trip time / number
+                              of active threads) {
+                       set new minimum round-trip time per number of
+                           active threads
+                       set new maximum number of threads
+                  }
+                   backoff ::= FALSE;
+               }
+           }
+       }
+       if (backoff)
+           double timeout;
+       elsif (maxrtt > 0) {
+          timeadjust ::= maxrtt * 3 / 2;
+           if (timeadjust > timeout)
+               timeout ::= timeadjust; backoff ::= TRUE;
+           else
+
+
+
+Rose, McCloghrie & Davin                                        [Page 7]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+               timeout ::= (timeout + timeadjust) / 2;
+       }
+       if (timeout exceeds some threshold)
+          set timeout to that threshold;
+      elsif (timeout is smaller than some threshold)
+           set timeout to that threshold;
+
+       if (at least one thread didn't receive a response) {
+           find the thread which has been waiting the longest
+               for a response,
+               and determine the elapsed time since a message
+               was sent;
+           if (the elapsed time with noise is greater than timeout) {
+               if (the number of retransmissions for this thread
+                       exceeds a threshold)
+                   abort the algorithm;
+               retransmit the request;
+               backoff ::= TRUE;
+           }
+       }
+  }
+
+4.6.  Finding the Median between two OIDs
+
+   The object identifier space is neither uniform nor continuous.  As
+   such, it is not always possible to choose an object identifier which
+   is lexicographically-between two arbitrary object identifiers.  In
+   view of this, the pipelined algorithm makes a best-effort attempt.
+
+   Starting from the beginning, each sub-identifier of the two OIDs is
+   scanned until a difference is encountered.  At this point there are
+   several possible conditions:
+
+      (1)  The upper OID has run out of sub-identifiers.  In this
+           case, either the two OIDs are are identical or the lower
+           OID is greater than the upper OID (an interface error),
+           so no OID is returned.
+
+      (2)  The lower OID has run out of sub-identifiers.  In this
+           case, the first subsequent non-zero sub-identifier from
+           the upper OID is located.  If no such sub-identifier is
+           found, then no OID exists between the lower and upper
+           OIDs, and no OID is returned.  Otherwise, a copy of the
+           upper OID is made, but truncated at this non-zero
+           sub-identifier, which is subsequently halved, and the
+           resulting OID is returned.
+
+      (3)  Otherwise, a copy of the lower OID is made, but truncated
+
+
+
+Rose, McCloghrie & Davin                                        [Page 8]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+           at the point of difference.  This last sub-identifier is
+           then set to the arithmetic mean of the difference.  In
+           the case where the difference is only 1 (so the last
+           sub-identifier remains the same) then a new sub-
+           identifier is added, taking care to be larger than a
+           possible sub-identifier present in the lower OID.
+           Regardless, the resulting OID is returned.
+
+       oid_median (lower, upper)
+       OID     lower,
+               upper;
+       {
+           for (i ::= 1; i < upper:nelem; i++) {
+               if (i > lower:nelem) {
+                   while (upper:elems[i] == 0)
+                       if (++i > upper:nelem)
+                           return NULL;
+                   median ::= copy of upper;
+                   median:nelem ::= i;
+                   median:elems[i] ::= upper:elems[i] / 2;
+
+                   return median;
+              }
+
+              if (lower:elems[i] == upper:elems[i])
+                  continue;
+
+               median ::= copy of lower;
+               median:nelem ::= i;
+               median:elems[i] ::= (lower:elems[i]+upper:elems[i])/2;
+               if (median:elems[i] == lower:elems[i]) {
+                   median:nelem ::= (i + 1);
+                  if (lower:nelem < i)
+                      median:elems[median:nelem] ::= 127;
+                   elsif ((x ::= lower:elems[i + 1]) >= 16383)
+                      median:elems[median:nelem] ::= x + 16383;
+                   elsif (x >= 4095)
+                      median:elems[median:nelem] ::= x + 4095;
+                   elsif (x >= 1023)
+                       median:elems[median:nelem] ::= x + 1023;
+                   elsif (x >= 255)
+                       median:elems[median:nelem] ::= x + 255;
+                   else median:elems[median:nelem] ::=
+                                                (x / 2) + 128;
+               }
+
+                return median;
+           }
+
+
+
+Rose, McCloghrie & Davin                                        [Page 9]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+           return NULL;
+       }
+
+4.7.  Experience with the Pipelined Algorithm
+
+   This pipelined algorithm has been implemented and some
+   experimentation has been performed.  It would be premature to provide
+   extensive performance figures at this time, as the pipelined
+   algorithm is still being tuned, and is implemented only in a
+   prototype setting.  However, on tables of size O(2500), performance
+   of 121 entries/second has been observed.  In contrast, the serial
+   algorithm has performance of roughly 56 entries/second for the same
+   table.
+
+4.8.  Dynamic Range of Timeout Values
+
+   It should be noted that the pipelined algorithm takes a simplistic
+   approach with the timeout value: it does not maintain a history of
+   the value and act accordingly.
+
+   For example, if the timeout reaches the maximum timeout limit, and
+   then latches for some period of time, this indicates a resource
+   (either the network or the agent) is saturated.  Unfortunately, a
+   solution is difficult: an elegant approach would be to combine two
+   threads (but it is quite possible that no two consecutive threads
+   exist when this determination is made).  Another approach might be to
+   delay the transmission for threads which are ready to issue a new
+   get-next operation.
+
+   Similarly, if the timeout drops to the minimum value and subsequently
+   latches, more threads should be started.
+
+4.9.  Incorrect Agent Implementations
+
+   An interesting result is that many agents do not properly implement
+   the powerful get-next operator.  In particular, when a get-next
+   request contains an operand with an arbitrarily-generated suffix,
+   some agent implementations will handle this improperly, and
+   ultimately return a result which is lexicographically less than the
+   operand!
+
+   A typical cause of this is when the instance-identifier for a
+   columnar object is formed by a MAC or IP address, so each octet of
+   the address forms a sub-identifier of the instance-identifier.  In
+   such circumstances, the incorrect agent implementations compare
+   against only the least significant octet of the sub-identifiers in
+   the operand, instead of the full value of the sub-identifiers.
+
+
+
+
+Rose, McCloghrie & Davin                                       [Page 10]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+   Upon encountering such an interaction, the pipelined algorithm
+   implementation declares the thread dead (noting a possible gap in the
+   table), and continues.
+
+5.  The Parallel Algorithm
+
+   One interesting optimization is to view the problem in two steps: in
+   the first step, one column of the table is traversed to determine the
+   full range of instances identifiers meaningful in the table.
+   (Indeed, although as described above, the pipelined algorithm
+   retrieves a single column, the prototype implementation can retrieve
+   multiple columns).  In the second step, additional columns can be
+   retrieved using the SNMP get operation, since the instance
+   identifiers are already known.  Further, the manager can dynamically
+   determine how many variables can be placed in a single SNMP get
+   operation in order to minimize the number of requests.  Of course,
+   since the agent's execution of the get operation is often less
+   expensive than execution of the powerful get-next operation, when
+   multiple columns are request, this two-step process requires less
+   execution time on the agent.
+
+   A second algorithm can be developed, the "parallel algorithm".  At
+   present, each thread is mapped onto a single SNMP operation.  A
+   powerful insight is to suggest mapping several threads onto a single
+   SNMP operation: the manager must dynamically determine how many
+   variables can be placed in a single powerful get-next operation.
+   This has the advantage of reducing traffic, at the expense of
+   requiring the agent to utilize more resources for each request.
+
+   Earlier it was noted that the serial retrieval of objects could be
+   viewed as a degenerate case of the pipelined algorithm, in which the
+   number of active threads was one.  Similarly, the pipelined algorithm
+   is a special case of the parallel algorithm, in which the number of
+   threads per SNMP operation is one.
+
+5.1.  Experience with the Parallel Algorithm
+
+   The parallel algorithm has been implemented and some experimentation
+   has been performed.  It would be premature to provide extensive
+   performance figures at this time, as the algorithm is still being
+   tuned, and is implemented only in a prototype setting.  However, on
+   tables of size O(2500), performance of 320 entries/second has been
+   observed, a performance improvement of 571% over the serial
+   algorithm.
+
+6.  Acknowledgements
+
+   A lot of the ideas on pipelining are motivated by Van Jacobson's work
+
+
+
+Rose, McCloghrie & Davin                                       [Page 11]
+
+RFC 1187           Bulk Table Retrieval with the SNMP       October 1990
+
+
+   on adaptive timers in TCP.  The parallelization modifications were
+   originally suggested by Jeffrey D. Case.
+
+   Finally, the comments of the following individual is acknowledged:
+
+      Frank Kastenholz, Racal-Interlan
+
+7.  References
+
+   [1] Case, J., Fedor, M., Schoffstall, M., and J. Davin, Simple
+       Network Management Protocol (SNMP), RFC 1157, SNMP Research,
+       Performance Systems International, Performance Systems
+       International, MIT Laboratory for Computer Science, May 1990.
+
+Security Considerations
+
+   Security issues are not discussed in this memo.
+
+Authors' Addresses
+
+   Marshall T. Rose
+   PSI, Inc.
+   PSI California Office
+   P.O. Box 391776
+   Mountain View, CA 94039
+
+   Phone: (415) 961-3380
+   EMail: mrose@PSI.COM
+
+
+   Keith McCloghrie
+   Hughes LAN Systems
+   1225 Charleston Road
+   Mountain View, CA 94043
+
+   Phone: (415) 966-7934
+   EMail: KZM@HLS.COM
+
+
+   James R. Davin
+   MIT Laboratory for Computer Science, NE43-507
+   545 Technology Square
+   Cambridge, MA 02139
+
+   Phone:  (617) 253-6020
+   EMail:  jrd@ptt.lcs.mit.edu
+
+
+
+
+
+Rose, McCloghrie & Davin                                       [Page 12]
+
+\ No newline at end of file