File:  [CSRG BSD Unix] / 43BSDReno / contrib / isode-beta / doc / ufn / ufn.tex
Revision 1.1.1.1 (vendor branch): download - view: text, annotated - select for diffs
Tue Apr 24 16:12:56 2018 UTC (8 years, 1 month ago) by root
Branches: MAIN, BSD
CVS tags: HEAD, BSD43reno
BSD 4.3reno

%\documentstyle [11pt,ucl-rn-pic,tgrind,a4] {article}
\documentstyle [11pt,ucl-rn,tgrind] {article}
\author {S.E. Kille}
\date {June 1990}
\rnnumber{RN/90/29}
\title {Using the OSI Directory to achieve \\  User Friendly Naming}
\begin {document}
\maketitle
\begin {abstract}
The OSI Directory has user friendly naming as a goal.  A simple minded usage
of the directory does not achieve this.  Two aspects not achieved are:

\begin{itemize}
\item A user oriented notation
\item Guessability
\end{itemize}

This proposal sets out some conventions for representing names in a friendly
manner, and shows how this can be used to achieve really friendly naming.
\end {abstract}

\tableofcontents
\listoffigures
\listoftables
\pagebreak

\section {Why a notation is needed}

Many OSI Applications make use of Distinguished Names (DN) as defined in the
OSI Directory \cite{CCITT.Directory}.
The main reason for having a notation for
is for interacting with a user
interface.  This proposal is coming dangerously close to the sin of
standardising interfaces.  However, there are aspects of presentation which
it is desirable to standardise.

In very many cases, a user will be required to
input a name.  This notation is designed to allow this to happen in a
uniform manner across many user interfaces.    The intention is that
the name can just be typed in.  There should not be any need to engage in
form filling or complex dialogue.

A secondary goal, is to have a common
format to be able to unambiguously refer to names.
This notation should {\em not} be needed to write on business cards or in
the minutes of meetings.  If this is the case, the directory has failed to a
large extent.  It should be possible to take the ``human'' description given
at the meeting, and use it directly.  The means in which this happens will
become clear later.  In practice, it will often be useful to explicitly write
down directory names, even though it is not strictly needed.

\section {A notation for Distinguished Name}

\subsection {Goals}

The following goals are laid out:

\begin {itemize}
\item  It should be intuitive for the majority of names encountered
\item It should be fully general
\item It should be possible to lay it out in a number of ways
\end {itemize}

\subsection {Informal definition}

This notation is designed to be convenient for common forms of name.
Some examples are given. 
My directory (distinguished) name would be written:

\begin{verbatim}
Steve Kille, Computer Science, University College London, GB  
\end{verbatim}

This may be folded, perhaps to display in multi-column format.
For example:

\begin {verbatim}
Steve Kille, 
Computer Science, 
University College London, 
GB  
\end{verbatim}

or 

\begin {verbatim}
Steve Kille 
Computer Science 
University College London, GB  
\end{verbatim}

Another name might be:


\begin {verbatim}
Christian Huitema, INRIA, FR
\end{verbatim}

An example, showing how different attribute types are handled:


\begin {verbatim}
James Hacker 
locality=Basingstoke 
Widget Inc 
GB 
\end{verbatim}

An example showing quoting of a comma in an Organisation name:

\begin {verbatim}
L. Eagle, "Sue, Grabbit and Runn", GB
\end{verbatim}


\subsection {Formal definition}


The structure is specified in a BNF grammar in Figure~\ref{BNF}.

\begin{figure}
\begin {verbatim}
<name> ::= <name-component>
       | <name-component> <spaced-separator> <name>

<spaced-separator> ::= <optional-space>  
                <separator>
                <optional-space>

<separator> ::= "," ( <CR> )
              | <CR> 

<optional-space> ::= *( " " ) 

<name-component> ::= <attribute> 
        | <attribute> <optional-space> "+" ( <CR> ) 
          <optional-space> <name-component>

<attribute> ::= <string>
        | <key> <optional-space> "=" <optional-space> <string>

<key> ::= 1*( <keychar> )
<keychar> ::= letters, numbers, and space

<string> ::= *( <stringchar> | <pair> )
         | '"' *( <stringchar> | "," | "+" | "=" | <CR> ) '"'
         
<special> ::= "," | "=" | '"' | <CR> | "\" | "+"
<pair> ::= "\" <special> 
<stringchar> ::= any char except <special>

\end{verbatim}
\caption {BNF Grammar for Distinguished and Purported Name}
\label {BNF}
\end{figure}


The quoting mechanism is used for the following cases:

\begin {itemize}
\item Strings containing ``,'', ``+'', ``=''or ``\verb|"|''  or \verb|<CR>|
\item Strings with leading or trailing spaces
\item Strings containing consecutive spaces
\end {itemize}

A list of valid keywords for well known attribute types used in naming is
given in Table~\ref{keywords}.  This is a list of keywords which must be
supported.  These are chosen because they appear in common forms of name,
and can do so in a place which does not correspond to the default schema
used.
If other attributes are used for naming, this can always be
extended locally.

\begin{table}
\begin{center}
\begin {tabular}{l l}
Key & Attribute (X.520 keys) \\
\hline
Locality & LocalityName \\
Area & LocalityName \\
Organisation & OrganizationName \\
Organization & OrganizationName \\
\end {tabular}
\end{center}
\caption {Standardised Keywords}
\label {keywords}
\end{table}


Only string type attributes are
considered, but other attribute syntaxes could be supported locally.
It is assumed that the interface will translate from the supplied string
into PrintableString or T.61.

The \verb|"+"| notation is used to specify multi-component RDNs.  In this
case, the types for attributes in the RDN must be explicit.

The name is presented/input in a little-endian order.  If no type are given,
a type hierarchy (schema) is assumed as:

\begin {verbatim}
Common Name, (((Organisational Unit)*,  Organisation,) Country)
\end{verbatim}

Explicitly typed RDNs may be inserted into this hierarchy.  Two types of
object are named.  They are distinguished by context in the application:

\begin{description}
\item[Common Name Object] Typically a leaf object and a person.  In this
case, the LHS RDN is assumed to be common name, and other RDNs follow the
hierarchy.

\item[Other Object] Typically a non-leaf object.  
\end{description}


\section {Purported names}


A purported name is what a user supplies to an interface for resolution
into  one or more distinguished names.
The same
notation is used as for distinguished name.  
A purported name may differ from a distinguished name in a number of ways
which are itemised below.

Typically, an
interface will display a distinguished name, using the distinguished name
notation.  However, it may display a purported name, as this will be more
pleasing to the user.  The most common example of this is where the higher
components of the distinguished name are not displayed (abbreviation).

The ways in which a purported name may vary from a distinguished name are
now described:

\begin {description}
\item[Abbreviation] Some of the more significant components of the DN will
be omitted, and then defaulted in some way (e.g., relative to a local
context).  For example:

\begin {verbatim}
Steve Kille
\end{verbatim}

\item[Component Omission]
An intermediate component of the name may be omitted.  Typically this will
be an organisational unit.   For example:

\begin {verbatim}
Steve Kille, University College London, GB
\end{verbatim}


In some cases, this can be combined with abbreviation.  For example:

\begin {verbatim}
Steve Kille, University College London
\end{verbatim}

\item[Approximation] Approximate renditions or alternate values
of one or more of the components
will be supplied. For example:

\begin {verbatim}
Stephen Kille, CS, UCL, GB   
\end{verbatim}

or

\begin{verbatim}
Steve Keill, Comp Sci, Univarstiy College London, GB
\end{verbatim}


\item[Friendly Country] A ``friendly country name'' can be used instead of
the ISO 3166 two letter code.  For example: UK; USA; France; Deutchland.  

\item[Type Changing] A type may be omitted, even where it does not follow
the hierarchy.   Type variants can be explored.
The previous distinguished name:
\begin {verbatim}
James Hacker 
locality=Basingstoke 
Widget Inc 
GB 
\end{verbatim}

Could be represented by the purported name:

\begin {verbatim}
James Hacker 
Basingstoke 
Widget Inc 
GB 
\end{verbatim}


\end {description}


\section {Matching a purported name}

The major utility of the purported name is to provide the important ``user
friendly'' characteristic of guessability.  A user will supply a purported
name to a user interface, and this will be resolved onto a distinguished
name.   
When a user supplies a purported name there is a need to derive the DN.  In
most cases, it should be possible to derive a single name from the purported
name.  In some cases, ambiguities will arise and the user will be prompted
to select from a multiple matches.  This should also be the case where a
component of the name did not ``match very well''.   

There is an assumption that the user will simply enter the name correctly.
The purported name variants are designed to make this happen!  There is no
need for fancy window based interfaces or form filling for many applications
of the directory.  Note that the fancy interfaces still have a role for
browsing, and for more complex matching.  This type of naming is to deal
with cases where information on a known user is desired and keyed on the
user's name.

\subsection {Environment}




All matches occur in the context of a local environment.
The local environment defines a 
sequence of name of a non-leaf objects in the DIT.   
This environment effectively defines a list of acceptable name abbreviations
where the DUA is employed.   It also defines an order in which to operate.


This list is defined in the context of the number of name components
supplied.  This allows varying heuristics, depending on the environment, to
make the approach have the ``right'' behaviour.


In most cases, the environment will start at a local point in the DIT, and
move upwards.   Examples are given in Tables~\ref{e1} and~\ref{e2}.
Table~\ref{e1} shows an example for a typical local DUA, which has the
following characteristics:

\begin{description}
\item[One component] Assumed first to be a user in the department, then a
user or department within the university, the a national organisation, and
finally a country.
\item[Two components] Most significant component is first assumed to be a
national organisation, then a department (this might be reversed in some
organisations), and finally a country.
\item[Three or more components] The most significant component is first
assumed to be a country, then a national organisation, and finally a
department. 
\end{description}



\begin{table}
\begin{center}
\begin{tabular}{l l}
Number of & Environment \\
Components & \\
\hline
1 & Physics, University College London, GB  \\
& University College London, GB \\
& GB  \\
& -- \\
\hline
2 & GB  \\
& University College London, GB \\
& -- \\
\hline
3+  & -- \\
& GB  \\
& University College London, GB \\

\end{tabular}
\end{center}
\caption {Local environment for private DUA}
\label{e1}
\end{table}

\begin{table}
\begin{center}
\begin{tabular}{l l}
Number of & Environment \\
Components & \\
\hline
1,2  & US  \\
& CS \\
& -- \\
\hline
3+ & -- \\
& US \\
& CA \\
\end{tabular}
\end{center}
\caption {Local environment for US Public DUA}
\label{e2}
\end{table}


\subsection {Matching}


A purported name will be supplied, usually with a small number of components.
This will be matched in the context of an environment.

Where there are multiple components to be matched, these should be matched
sequentially.  If an unambiguous DN is determined, the match continues as if
the full DN had been supplied.  
For example if 

\begin{quote}
\begin{verbatim}
Stephen Kille, UCL
\end{verbatim}
\end{quote}

is being matched in the context of environment \verb|GB|, first \verb|UCL|
is resolved to the distinguished name:

\begin{quote}
\begin{verbatim}
University College London, GB
\end{verbatim}
\end{quote}

Then the next component of the purported name is taken to determine the
final name. 
If there is an ambiguity (e.g., if \verb|UCL| had made two matches,
both paths are explored to see if the ambiguity can
be resolved.  Eventually a set of names will be passed back to the user.


Each component of the environment is taken in turn.
If the purported name has more
components than the maximum depth, the environment element is skipped.  
The advantage of this will be seen in the example given later.






A match of a name is considered to have three levels:

\begin{description}
\item[Exact]  A DN is specified exactly
\item[Good]
Initially, a match should be considered good if
it is unambiguous, and exactly matches an attribute value in the entry.
For human names, a looser metric is probably desirable (e.g., \verb|S Kille|
should be a good match of \verb|S. Kille|, \verb|S.E. Kille| or \verb|Steve
Kille| even if these are not explicit alternate values).
\item[Poor] Any other substring or approximate match
\end{description}

Following a match, the reference can be followed, or the user prompted.
If there are multiple matches, more than one path may be followed.
There is also a shift/reduce type of choice: should any partial matches be
followed or should the next element of the environment be tried.
The following heuristics are suggested, which may be modified in the light
of experience.  The overall aim is to resolve cleanly specified names with a
minimum of fuss, but give sufficient user control to prevent undue searching
and delay.   

\begin{enumerate}
\item Always follow an exact match. 
\item Follow all good matches if there are no exact matches.
\item If there are only poor matches, prompt the user.  If the user accepts
one or more match, they can be considered as good.  If all are rejected,
this can be treated as no matches.
\item Automatically move to the next element of the environment if no
matches are found.   
\end{enumerate}

When the final component is matched, a set of names will be identified.
If none are identified, proceed to the next environment element.
If the user rejects all of the names, processing of the next environment
element should be confirmed.

The exact approach to matching will depend on the level of the tree at which
matching is being done.  
We can now consider how attributes are matched at various levels of the DIT.


There is an issue of approximate matching.  Sometimes it helps, and
sometimes just returns many spurious matches.  When a search is requested,
all relevant attributes should be returned, so that distinguished and
non-distinguished values can be looked at.  This will allow a distinction to
be made between good and poor matches.  It is important that where, for
example, an acronym exactly matches an organisation, that the user is not
prompted about other organisations where it matches as a substring.

\subsection {Top Level}

In this case, a match is being done at the root of the DIT.   
Three approaches are suggested, dependent on the length of supplied name.
All lead to a single level search of the top level of the DIT. 

\begin{description}
\item[Exactly 2] This is assumed to be a 3166 two letter country code, or an exact
match on a friendly country or organisation (e.g., UK or UN).  
Do exact match on country and friendly country.
\item[Greater than 2] Make 
an approximate and substring match on friendly country and organisation.

\end{description}






\subsection {Intermediate Level}

Once the root level has been dealt with, intermediate levels will be looking
for organisational components (Organisation, Locality, Org Unit). 
In some cases, private schema control will allow the system to determine
which is at the next level.  In general this will not be possible.
In each case, make  a substring  and approximate match search of one level.
The choice depends on the base object used in the search.

\begin {enumerate}
\item If DN has no Organisation or Locality, filter on Organisation and
Locality. 
\item If DN has Org Unit, filter on Org Unit.
\item If DN has Organisation, filter on Locality and Org Unit.
\item If DN has Locality, filter on Organisation.
\end {enumerate}

These allow some optimisation, based on legal choices of schema.  Keeping
filters short is usually desirable to improve performance.   

A few examples of this, where a base object has been determined (either by
being the environment or by partial resolution of a purported name), and the
next element of a purported name is being considered.  This will generate a
single level search.  What varies is the types being filtered against.
If the DN is:

\begin{verbatim}
University College London, GB
\end{verbatim}

The search should be for Org Unit or Locality.  If the DN is:

\begin{verbatim}
Organisation=UN
\end{verbatim}

the search should be for Org Unit or Locality.

There may be some improvements with respect to very short keys.  Not making
approximate or substring matches in these cases seems sensible\footnote{It
might be desirable to allow ``\verb|*|'' as a part of the purported name
notation}.

\subsection {Bottom level}

The ``Bottom Level'' is to deal with leaf entries in the DIT.  This will
often be a person, but may also be a role, an application entity or
something else.  

The last component of a purported name may either reference a leaf or
non-leaf.  For this reason, {\em both} should be tested for.
As a heuristic, if the base object for the search  has two or more
components it should be tested first as a bottom level name and then
intermediate.  Reverse this for shorter names.  This optimises for the
(normal) case
of non-leaves high up the tree and leaves low down the tree.

For bottom level names, make an
approximate and substring match against Common Name, Surname, and User ID.
Where common name is looked for, 
A full subtree search will be used when at
the second level of the DIT or lower, otherwise a single level search.

For example, if I have resolved a purported name to the distinguished name

\begin{verbatim}
University College London, GB
\end{verbatim}

and have a single component \verb|Bloggs|, this will generate a subtree
search.

\section{Examples}


This is all somewhat confusing, and a few examples are given.  These are all
in the context of the environment shown in Table~\ref{e1} on
Page~\pageref{e1}.

If ``Joe Bloggs'' is supplied, a subtree search of 

\begin{verbatim}
Physics, University College London, GB
\end{verbatim}

will be made, and the user prompted for ``Joseph Z. Bloggs'' as the only
possible match.

If ``Computer Science'' is supplied, first 

\begin{verbatim}
Physics, University College London, GB
\end{verbatim}

will be searched, and the user will reject the approximate match of ``Colin
Skin''.   Then a subtree search of

\begin{verbatim}
University College London, GB
\end{verbatim}

will be made, looking for a person.  Then a single level search will be made
looking for Org Unit, and 

\begin{verbatim}
Computer Science, University College London, GB
\end{verbatim}

will be returned without prompting (exact match).   


Supplying ``Steve Kille'' will lead to a failed subtree search of 

\begin{verbatim}
Physics, University College London, GB
\end{verbatim}

and lead straight to a subtree search of

\begin{verbatim}
University College London, GB
\end{verbatim}

This will lead to an exact value match, and so a single entry returned
without prompting.


If ``Andrew Findlay, Brunel'' is supplied, the first element of the
environment will be skipped, single level search of ``Brunel'' under ``GB'
will find:

\begin{verbatim}
Brunel University, GB
\end{verbatim}

and a subtree search for ``A Findlay'' initiated.   This will yield

\begin{verbatim}
Andrew Findlay, Computing and Media Services, Brunel University, GB

Dr A J Findlay, Manufacturing and Engineering Systems, 
Brunel University, GB
\end{verbatim}


and the user will be prompted with a choice.


This approach shows how a simple format of this nature will ``do the right
thing'' in many cases.   

\section {Support required from the standard}

Fortunately, all that is needed is there!  It would be useful to have
``friendly country name'' as a standard attribute.


\section{Support of OSI Services}

The major focus of this work has been to provide a mechanism for identifying
Organisations and Users.   A related function is to identify applications.
Where the Application is identified by an AET (Application Entity Title)
with an RDN of Common Name, this proposal leads to a natural usage.
For example, if a filestore in named ``gannet'', then this could easily be
identified by the name:

\begin{verbatim}
Gannet, Computer Laboratory, Cambridge University, GB   
\end{verbatim}

In normal usage, this might lead to access (using a purported name) of:

\begin{verbatim}
FTAM gannet,cambridge
\end{verbatim}

A second type of access is where the user identifies an Organisation
(Organisational Unit), and expects to obtain a default service.   
The service is implied by the application, and should not require any
additional naming as far as the user is concerned.  It is proposed that this
is supported by User Friendly Naming in the following way.  

\begin{enumerate}
\item Determine that the purported name identifies a non-leaf object, which is
of object class Organisation or Organistaional Unit or Locality.

\item Perform a {\em single level} search for Application Entities which
support the required application contexts.  This assumes that all services
which are supporting default access for the organisation are registred at
one level below (possibly by the use of aliaes), and that other services
(specific machines or parts of the organisation) are represented further
down the tree.  This seems to be a reasonable layout, and its utility can be
evaluated by experiment.

\end{enumerate}

\section {Experience}

An experimental implementation of this has been written by Colin Robbins.
The example in Figure~\ref{usage} shows that it can be very effective at
locating known individuals with a minimum of effort.

\begin{figure}
\begin{small}
\begin{verbatim}
-> t hales, csiro, australia
Found good match(es) for 'australia'
Found exact match(es) for 'csiro'
Please select from the following:
   Trevor Hales, OC, HPCC, DIT, IICT, CSIRO, AU [y/n] ? y
The following were matched...
   Trevor Hales, OC, HPCC, DIT, IICT, CSIRO, AU

-> g michaelson, queensland, au
Found exact match(es) for 'au'
Please select from the following:
   University of Queensland, AU [y/n] ? y
   Axolotl, AU [y/n] ? n
Please select from the following:
   George Michaelson, Prentice Computer Centre, University of Queensland, AU
[y/n] ? y
   Manager, University of Queensland, AU [y/n] ? n
The following were matched...
   George Michaelson, Prentice Computer Centre, University of Queensland, AU

-> r needham, cambridge
Found good match(es) for 'cambridge'
Please select from the following:
   Roger Needham, Computer Lab, Cambridge University [y/n] ? y
The following were matched...
   Roger Needham, Computer Lab, Cambridge University

-> kirstein
Found good match(es) for 'kirstein'
The following were matched...
   Peter Kirstein


\end{verbatim}
\end{small}
\caption {Example usage of User Friendly Naming}
\label{usage}
\end{figure}


\section {Relationship to other work}

Colin Robbin's work on the interface ``Tom'' and implementation of a
distribution list interface strongly influenced this proposal.

Some of the ideas used here originally came from a UK Proposal to the
ISO/CCITT Directory Group on ``New Name Forms'' \cite{NNF}.  This defined,
and showed how to implement, four different types of names:

\begin{description}
\item[Typed and Ordered] The current Distinguished Name is a restricted
example of this type of name.

\item[Untyped and Ordered] This is the type of name proposed here (with some
extensions to allow optional typing).  It is seen as meeting the key user
requirement of disliking typed names, and is efficient to implement.

\item[Typed and Unordered] This sort of name is proposed by others as the
key basis for user friendly naming.  Neufeld shows how X.500 can be used to
provide this \cite{Neufeld.Descr}, and Peterson proposes the Profile system
to provide this \cite{Profile.2}.  The author contends that whilst typed
naming is interesting for some types of searching (e.g., yellow page
searching), it is less desirable for
naming objects.  This is born out by operational experience with OSI
Directories \cite{THORN.RARE.LSPX}.

\item[Untyped and Unordered] Surprisingly this form of name can be supported
quite easily.  However, a considerably gain in efficiency can be achieved by
requiring ordering.  In practice, users can supply this easily.  Therefore,
this type of name is not proposed.   
\end{description}


\section {Proposal}

This approach should be tested out experimentally, with the QUIPU
implementation.  This will involve:

\begin{itemize}
\item An interface to generate strings in this form from DNs, with control over
\begin{itemize}
\item Folding (line, column, or max width)
\item Presence of commas at EOL
\item A DN to be abbreviated against
\end{itemize}

\item A routine to resolve a purported name in the context of an environment.
\end{itemize}

The Distribution List interface seems to be an ideal system to experiment
with.

A specification should then be progressed, and made widely available.
Perhaps as a RARE Document or RFC.

\bibliographystyle{alpha}
\bibliography{../../bib/sek}

\pagebreak
\appendix

\section{Pseudo-code for the matching algorithm}

The following pseudo-code is intended to clarify the matching algorithm.
The language uses ASN.1 data types, with flow control `C'-like, but with
keywords upper--cased.

\tagrindfile{algo}{Matching Algorithm}{algo}



\end {document}


unix.superglobalmegacorp.com

This archive runs on limited infrastructure. Preserving old code on modern bandwidth. Automated agents are requested to crawl responsibly.