IPP Mail Archive: Re: IPP>MOD add another issue [encoding of CompoundValue]

Mon Jan 12 16:18:42 EST 1998

This is a multi-part message in MIME format.
--------------03F1D2F48D1D31E99F75DE1F
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

> Although compoundValue can be made to work, its complexity will lead to
> bugs. Also its type is determined by looking at all of the tags of
> values that it contains. This suggests that we should look for a
> simpler-to-implement option.
>
> The most obvious solution is to add two new types text-language and
> name-language which are the langauge constrained versions of text and
> name.
>
Since we still have 'compound-attribute' in the protocol, couldn't we represent
'textWithLanguage' and 'nameWithLanguage' using that mechanism?

(The 'textWithLanguage' attribute syntax is a compound attribute syntax,  represented as an
OCTET_STRING consisting of 4 fields:  a) a SIGNED-SHORT which is the number of octets in the
following field  b) a value of type natural-language,  c) a SIGNED-SHORT which is the number
of octets in the following field,  d) a value of type text.  The length of  a
textWithLanguage value SHALL be 4 + the value of field a + the value of field c.).

I'm finding all these embedded lengths to be a pain to implement, and it seems redundant with
the logic we already have for compound-attribute.

-Carl Kugler

======================Original message=========================

> Re: IPP>MOD add another issue [encoding of CompoundValue]
>
> Robert Herriot (Robert.Herriot at Eng.Sun.COM)
> Tue, 25 Nov 1997 18:54:45 -0800
>
>    * Messages sorted by: [ date ][ thread ][ subject ][ author ]
>    * Next message: Carl-Uno Manros: "Re: IPP>IPP Last Call - need advance list of issues"
>    * Previous message: Marcia Beaulieu: "IPP> Re: IPP conflict with TLS in DC"
>
> I will try to explain the issue by giving more detail.
>
> The compoundValue has an integer value which specifies the number of
> following values that compose the compound value. There are two
> obvious ways to implement compoundValue in a general way:
>
> 1) recurse looking for additional values until the correct number
> is found or until a non-null attribute name is found or a delimiter
> tag is found. The latter two conditions are errors. This method
> works, but is tricky the "nested" values are really at the same
> level as other values in the protocol.
>
> 2) continue picking up values, but make a note that a compoundValue
> is being built. In this case, there must be a check when a
> non-null name is encountered, and when a delimiter tag is found
> for the error of a compoundValue is still being built.
> At first glance, this seems simpler, but it is easy to forget the
> checks mentioned above.
>
> Although compoundValue can be made to work, its complexity will lead to
> bugs. Also its type is determined by looking at all of the tags of
> values that it contains. This suggests that we should look for a
> simpler-to-implement option.
>
> The most obvious solution is to add two new types text-language and
> name-language which are the langauge constrained versions of text and
> name. Attributes with text and name values could also have a value of
> type text-language or name-language. Tom and others have suggested
> that language and text/name be separated by a single-quote character.
> It would work, but is not in the spirit of the current protocol which
> uses lengths instead of delimiting characters. So I suggest the value
> be <language length> <language string> <text/name string>. The length
> of the text/name string is length of the value minus ( language-length
> + 2).
>
> This solution is easier to parse because the components are contained
> with a single value.
>
> If we believe that tags are in short supply and that we don't want to
> allocate two values for such obscure types, we could create a tag type
> of "typed-octets" where the first byte of the value contains the
> sub-tag value which in our case would be either text-language or
> name-language. We could also have 2 bytes for the sub-tag rather than
> 1.
>
> > From hastings at cp10.es.xerox.com Mon Nov 24 10:46:48 1997
> >
> > As long as you've re-opened this issue, I'd like to add several
> > other alternatives into the mix. (A committee is better able to
> > pick between alternatives, than to design one on the fly).
> >
> > On the other hand, it may be better to live with the current scheme
> > than to try to pick a new one.
> >
> > At 19:48 11/21/1997 PST, Robert Herriot wrote:
> > >
> > >As I am implementing the CompoundValue, I am finding problems that make
> > >me think it should be changed. It requires too much special-casing and
> > >in its current form it will introduce bugs where the value of the
> > >CompoundValue exceeds the number of remaining attributes for the
> > >attribute name or attribute group. To avoid those bugs, checks have to
> > >be made in several places.
> >
> > Please explain this problem more.
> >
> > >
> > >I suggest we reexamine the other possible solutions, one simple with
> > >no room for extension, the other with room for extension.
> > >
> > > a) add two new value types: text-language and name-language each of which
> > > is a single value in the protocol but which consists of 4 subfields:
> > > a text/name length field, a text/name field, a language length field,
> > > and a language field, .
> > >
> > > b) add a single new type: compound-value which consists of a single value
> > > in the protocol but which consists of a value-tag field followed by
> > > any number of groups-of-three subfields. Each group-of-three
> > > consists of a value tag, a value length and a value. The text-language
> > > solution of a) is represented by a text-language tag, a text tag, a
> > > text length, a text value, a natural-language-tag, a natural-language
> > > length and a natural-language value.
> > >
> > >I prefer b) because it offers room for extension and an implementation can
> > >determine if it supports the compound value by examining the initial
> > >tag in the compoundValue.
> >
> > Here are my additional alternatives:
> >
> >
> > c) Amplify the 'text' and 'name' attribute syntaxes to allow a second
> > natural language override value to precede the actual value which indicates
> > the language of the immediately following value. The attribute syntax of
> > the first value, when present, is: 'naturalLanguage' as defined in the
> > current spec.
> >
> > Advantages: simple
> >
> > Disadvantages: a single-valued attribute sometimes has two values, making
> > the validation of single-valued attributes more adhoc. Also counting
> > attribute values is more adhoc.
> >
> >
> > d) have two data types for each of 'text' and 'name':
> > 'text' (same as current) and 'taggedText'
> > 'name' (same as current) and 'taggedName'
> >
> > The 'taggedText' and 'taggedName' data types use the RFC 2184 tagging
> > in the beginning of the data (but for language only, not charset)
> > to indicate natural language override:
> >
> > en'...
> > en-us'...
> >
> > to indicate English and U.S. English
> >
> > Those attributes which currently have 'text' and 'name' would
> > be changed to require support of both 'text' and 'taggedText'
> > and 'name' and 'taggedName'
> >
> > For example:
> >
> > job-name (name | taggedName)
> >
> > Advantages: most request/response instances would not need to use the
> > taggedText and taggedName in most interchanges.
> >
> > Disadvantages: clients and IPP objects would still have to support both
> > forms.
> >
> >
> > e) Change the spec for 'text' and 'name' to always require the RFC 2184
> > natural language prefix (but not charset).
> >
> > Advantages: simple, natural language tag is always stored with the data.
> > Only one protocol value for each attribute value.
> >
> > Disadvantages: tag has to be skipped over when processing or displaying
> > the data.
> >
> >
> > f) Same as e) except include the charset tag as well, so in full compliance
> > with RFC 2184 (same as we had in the Model document after the Atlanta
> > meeting). Example:
> >
> > us-ascii'en'...
> > utf-8'en-us'...
> >
> > Advantages: simple, charset and natural language tag is always stored
> > with the data. Only one protocol value for each attribute value.
> > IPP object doesn't need to charset convert data to a single charset.
> >
> > Disadvantage: tags have to be skipped over when processing or displaying
> > the data.
> >
> >
> > g) Add the dictionary attribute syntax that we postponed.
> >
> > Advantages: It is even more general than your alternative b) and is
> > something we have agreed is something we want. I'd hate to see us
> > put in something that is half a dictionary. I think that the dictionary
> > also fixes the length checking problem that the current CompoundValue
> > has, correct?
> >
> > Disadvantages: None.
> >
> > Tom
> >
> >
> >
> >
>
>    * Next message: Carl-Uno Manros: "Re: IPP>IPP Last Call - need advance list of issues"
>    * Previous message: Marcia Beaulieu: "IPP> Re: IPP conflict with TLS in DC"

http://www.pwg.org/hypermail/ipp/2831.html

--------------03F1D2F48D1D31E99F75DE1F
Content-Type: text/html; charset=us-ascii; name="2831.html"
Content-Transfer-Encoding: 7bit
Content-Disposition: inline; filename="2831.html"
Content-Base: "http://www.pwg.org/hypermail/ipp/2831.
	html"

<!-- received="Tue Nov 25 21:55:45 1997 EST" -->
<!-- sent="Tue, 25 Nov 1997 18:54:45 -0800" -->
<!-- name="Robert Herriot" -->
<!-- email="Robert.Herriot at Eng.Sun.COM" -->
<!-- subject="Re: IPP&gt;MOD add another issue [encoding of CompoundValue]" -->
<!-- id="199711260254.SAA28805 at woden.eng.sun.com" -->
<!-- inreplyto="IPP&gt;MOD add another issue [encoding of CompoundValue]" -->
<title>IPP Mail Archive: Re: IPP&gt;MOD add another issue [encoding of CompoundValue]</title>
<h1>Re: IPP&gt;MOD add another issue [encoding of CompoundValue]</h1>
Robert Herriot (<i>Robert.Herriot at Eng.Sun.COM</i>)<br>
Tue, 25 Nov 1997 18:54:45 -0800
<p>
<ul>
<li> <b>Messages sorted by:</b> <a href="date.html#2831">[ date ]</a><a href="index.html#2831">[ thread ]</a><a href="subject.html#2831">[ subject ]</a><a href="author.html#2831">[ author ]</a>
<!-- next="start" -->
<li> <b>Next message:</b> <a href="2832.html">Carl-Uno Manros: "Re: IPP&gt;IPP Last Call - need advance list of issues"</a>
<li> <b>Previous message:</b> <a href="2830.html">Marcia Beaulieu: "IPP&gt; Re: IPP conflict with TLS in DC"</a>
<!-- nextthread="start" -->
</ul>
<!-- body="start" -->
I will try to explain the issue by giving more detail.<br>
<p>
The compoundValue has an integer value which specifies the number of<br>
following values that compose the compound value.  There are two<br>
obvious ways to implement compoundValue in a general way:<br>
<p>
   1) recurse looking for additional values until the correct number<br>
      is found or until a non-null attribute name is found or a delimiter<br>
      tag is found. The latter two conditions are errors. This method<br>
      works, but is tricky the "nested" values are really at the same<br>
      level as other values in the protocol.<br>
<p>
   2) continue picking up values, but make a note that a compoundValue<br>
      is being built.  In this case, there must be a check when a<br>
      non-null name is encountered, and when a delimiter tag is found<br>
      for the error of a compoundValue is still being built.<br>
      At first glance, this seems simpler, but it is easy to forget the<br>
      checks mentioned above. <br>
<p>
Although compoundValue can be made to work, its complexity will lead to<br>
bugs.  Also its type is determined by looking at all of the tags of<br>
values that it contains.  This suggests that we should look for a<br>
simpler-to-implement option.<br>
<p>
The most obvious solution is to add two new types text-language and<br>
name-language which are the langauge constrained versions of text and<br>
name. Attributes with text and name values could also have a value of<br>
type text-language or name-language.  Tom and others have suggested<br>
that language and text/name be separated by a single-quote character.<br>
It would work, but is not in the spirit of the current protocol which<br>
uses lengths instead of delimiting characters. So I suggest the value<br>
be &lt;language length&gt; &lt;language string&gt; &lt;text/name string&gt;.  The length<br>
of the text/name string is length of the value minus ( language-length<br>
+ 2).  <br>
<p>
This solution is easier to parse because the components are contained<br>
with a single value.<br>
<p>
If we believe that tags are in short supply and that we don't want to<br>
allocate two values for such obscure types, we could create a tag type<br>
of "typed-octets" where the first byte of the value contains the<br>
sub-tag value which in our case would be either text-language or<br>
name-language. We could also have 2 bytes for the sub-tag rather than<br>
1.<br>
<p>
<i>&gt; From hastings at cp10.es.xerox.com Mon Nov 24 10:46:48 1997</i><br>
<i>&gt; </i><br>
<i>&gt; As long as you've re-opened this issue, I'd like to add several</i><br>
<i>&gt; other alternatives into the mix.  (A committee is better able to</i><br>
<i>&gt; pick between alternatives, than to design one on the fly).</i><br>
<i>&gt; </i><br>
<i>&gt; On the other hand, it may be better to live with the current scheme</i><br>
<i>&gt; than to try to pick a new one.</i><br>
<i>&gt; </i><br>
<i>&gt; At 19:48 11/21/1997 PST, Robert Herriot wrote:</i><br>
<i>&gt; &gt;</i><br>
<i>&gt; &gt;As I am implementing the CompoundValue, I am finding problems that make</i><br>
<i>&gt; &gt;me think it should be changed. It requires too much special-casing and</i><br>
<i>&gt; &gt;in its current form it will introduce bugs where the value of the</i><br>
<i>&gt; &gt;CompoundValue exceeds the number of remaining attributes for the</i><br>
<i>&gt; &gt;attribute name or attribute group.  To avoid those bugs, checks have to</i><br>
<i>&gt; &gt;be made in several places.</i><br>
<i>&gt; </i><br>
<i>&gt; Please explain this problem more.</i><br>
<i>&gt; </i><br>
<i>&gt; &gt;</i><br>
<i>&gt; &gt;I suggest we reexamine the other possible solutions, one simple with </i><br>
<i>&gt; &gt;no room for extension, the other with room for extension.</i><br>
<i>&gt; &gt;</i><br>
<i>&gt; &gt;  a) add two new value types: text-language and name-language each of which</i><br>
<i>&gt; &gt;     is a single value in the protocol but which consists of 4 subfields:</i><br>
<i>&gt; &gt;     a text/name length field, a text/name field, a language length field, </i><br>
<i>&gt; &gt;     and a language field, .</i><br>
<i>&gt; &gt;</i><br>
<i>&gt; &gt;  b) add a single new type: compound-value which consists of a single value</i><br>
<i>&gt; &gt;     in the protocol but which consists of a value-tag field followed by </i><br>
<i>&gt; &gt;     any number of groups-of-three subfields. Each group-of-three </i><br>
<i>&gt; &gt;     consists of a value tag, a value length and a value. The text-language </i><br>
<i>&gt; &gt;     solution of a) is represented by a text-language tag, a text tag, a </i><br>
<i>&gt; &gt;     text length, a text value, a natural-language-tag, a natural-language</i><br>
<i>&gt; &gt;     length and a natural-language value.</i><br>
<i>&gt; &gt;</i><br>
<i>&gt; &gt;I prefer b) because it offers room for extension and an implementation can</i><br>
<i>&gt; &gt;determine if it supports the compound value by examining the initial</i><br>
<i>&gt; &gt;tag in the compoundValue.</i><br>
<i>&gt; </i><br>
<i>&gt; Here are my additional alternatives:</i><br>
<i>&gt; </i><br>
<i>&gt;  </i><br>
<i>&gt; c) Amplify the 'text' and 'name' attribute syntaxes to allow a second</i><br>
<i>&gt; natural language override value to precede the actual value which indicates </i><br>
<i>&gt; the language of the immediately following value.  The attribute syntax of </i><br>
<i>&gt; the first value, when present, is: 'naturalLanguage' as defined in the</i><br>
<i>&gt; current spec.</i><br>
<i>&gt; </i><br>
<i>&gt; Advantages:  simple</i><br>
<i>&gt; </i><br>
<i>&gt; Disadvantages:  a single-valued attribute sometimes has two values, making</i><br>
<i>&gt; the validation of single-valued attributes more adhoc.  Also counting</i><br>
<i>&gt; attribute values is more adhoc. </i><br>
<i>&gt; </i><br>
<i>&gt; </i><br>
<i>&gt; d) have two data types for each of 'text' and 'name': </i><br>
<i>&gt;    'text' (same as current) and 'taggedText'</i><br>
<i>&gt;    'name' (same as current) and 'taggedName'</i><br>
<i>&gt; </i><br>
<i>&gt; The 'taggedText' and 'taggedName' data types use the RFC 2184 tagging</i><br>
<i>&gt; in the beginning of the data (but for language only, not charset)</i><br>
<i>&gt; to indicate natural language override:</i><br>
<i>&gt; </i><br>
<i>&gt;    en'...</i><br>
<i>&gt;    en-us'...</i><br>
<i>&gt; </i><br>
<i>&gt; to indicate English and U.S. English</i><br>
<i>&gt; </i><br>
<i>&gt; Those attributes which currently have 'text' and 'name' would</i><br>
<i>&gt; be changed to require support of both 'text' and 'taggedText'</i><br>
<i>&gt; and 'name' and 'taggedName'</i><br>
<i>&gt; </i><br>
<i>&gt; For example:</i><br>
<i>&gt; </i><br>
<i>&gt;   job-name (name | taggedName)</i><br>
<i>&gt; </i><br>
<i>&gt; Advantages:  most request/response instances would not need to use the </i><br>
<i>&gt; taggedText and taggedName in most interchanges.</i><br>
<i>&gt; </i><br>
<i>&gt; Disadvantages:  clients and IPP objects would still have to support both</i><br>
<i>&gt; forms.</i><br>
<i>&gt; </i><br>
<i>&gt; </i><br>
<i>&gt; e) Change the spec for 'text' and 'name' to always require the RFC 2184</i><br>
<i>&gt; natural language prefix (but not charset).</i><br>
<i>&gt; </i><br>
<i>&gt; Advantages:  simple, natural language tag is always stored with the data.</i><br>
<i>&gt; Only one protocol value for each attribute value.</i><br>
<i>&gt; </i><br>
<i>&gt; Disadvantages:  tag has to be skipped over when processing or displaying</i><br>
<i>&gt; the data.</i><br>
<i>&gt; </i><br>
<i>&gt; </i><br>
<i>&gt; f) Same as e) except include the charset tag as well, so in full compliance</i><br>
<i>&gt; with RFC 2184 (same as we had in the Model document after the Atlanta </i><br>
<i>&gt; meeting).  Example:</i><br>
<i>&gt; </i><br>
<i>&gt;   us-ascii'en'...</i><br>
<i>&gt;   utf-8'en-us'...</i><br>
<i>&gt; </i><br>
<i>&gt; Advantages:  simple, charset and natural language tag is always stored </i><br>
<i>&gt; with the data.  Only one protocol value for each attribute value.</i><br>
<i>&gt; IPP object doesn't need to charset convert data to a single charset.</i><br>
<i>&gt; </i><br>
<i>&gt; Disadvantage:  tags have to be skipped over when processing or displaying</i><br>
<i>&gt; the data.</i><br>
<i>&gt; </i><br>
<i>&gt; </i><br>
<i>&gt; g) Add the dictionary attribute syntax that we postponed.</i><br>
<i>&gt; </i><br>
<i>&gt; Advantages:  It is even more general than your alternative b) and is</i><br>
<i>&gt; something we have agreed is something we want.  I'd hate to see us</i><br>
<i>&gt; put in something that is half a dictionary.  I think that the dictionary</i><br>
<i>&gt; also fixes the length checking problem that the current CompoundValue</i><br>
<i>&gt; has, correct?</i><br>
<i>&gt; </i><br>
<i>&gt; Disadvantages:  None.</i><br>
<i>&gt; </i><br>
<i>&gt; Tom</i><br>
<i>&gt; </i><br>
<i>&gt; </i><br>
<i>&gt; </i><br>
<i>&gt; </i><br>
<!-- body="end" -->
<p>
<ul>
<!-- next="start" -->
<li> <b>Next message:</b> <a href="2832.html">Carl-Uno Manros: "Re: IPP&gt;IPP Last Call - need advance list of issues"</a>
<li> <b>Previous message:</b> <a href="2830.html">Marcia Beaulieu: "IPP&gt; Re: IPP conflict with TLS in DC"</a>
<!-- nextthread="start" -->
</ul>

--------------03F1D2F48D1D31E99F75DE1F--