[DFDL-WG] Agenda for OGF DFDL WG call 24 February 2010- 13:00 UK (8:00 ET)
Alan Powell
alan_powell at uk.ibm.com
Wed Feb 24 04:38:26 CST 2010
1. Remaining 037 review issues
A:
16.2 scannablility with lengthKind pattern:
Confirm that this is what we agreed
In summary, you can use a data pattern on any element (complex, simple
text, simple binary) as long as the bytes are legal in the stated
encoding, which where binary data is involved in practice means an 8-bit
ASCII encoding.
By 8-bit ASCII we really mean an encoding where all the codepoints from
0-255 map to the equivalent value. Subsequent investigation indicates that
'all' 8-bit ASCII encodings have gaps so there isn't a valid character.
Mike has suggested
1) for all ascii-based character sets, we say that bytes 0x00 to 0xFF all
map to exactly those codepoints in ISO 10646 for the infoset, and vice
versa.
2) define dfdl:encoding="bytes" as a special character set name which has
the above property.
Briefly discussed but no conclusion.
B:
Glossary
Variable-Occurrence Item - Optional elements have a variable number of
occurrences (0 or 1) and arrays also can have a variable number of
occurrences (when minOccurs < maxOccurs). So when we say an item with a
variable number of occurrences, this can mean either an optional element,
or an array where minOccurs < maxOccurs. In either array or optional
elements, we have the additional constraint that the DFDL representation
properties do not preclude a variable number of occurrences. When
dfdl:occursCountKind='explicit' and dfdl:occursCount has a literal
constant as its value, or an expression that statically evaluates to a
constant, then the DFDL properties are specifying exactly the number of
occurrences for all instances and so are said to preclude a variable
number of occurrences. If dfdl:occursCount has a formula as its expressed
value, then the DFDL properties do not preclude a variable number of
occurrences.
MikeB Comment:
This idea that you can have minOccurs < maxOccurs, but dfdl:occurs is
equal to a constant and dfdl:occursKind="explicit" is causing us a bunch
of grief in these definitions.
Can we be conservative and just say it is a schema definition error if
minOccurs < maxOccurs but the length is static, i.e., an explicit
constant-valued expression?
C:
DFDL Schema Component Model
What needs to be changed in the UML diagram?
D:
Sequence Groups
Mike B:
TBD: rewrite these property descriptions in terms of the grammar for
sequences
Specifically, this is where the FinalUnused Region must be described.
E: Check other comments in document.
2. Go through Actions
Current Actions:
No
Action
049
20/05 AP Built-in specification description and schemas
03/06: not discussed
24/06: No Progress
24/06: No Progress (hope to get these from test cases)
15/07: No progress. Once available, the examples in the spec should use
the dfdl:defineFormat annotations they provide.
...
14/10: no progress
21/10: Discussed the real need for this being in the specification. It
seemed that the main value is it define a schema location for downloading
'known' defaults from the web.
28/10: no progress
04/11: no progress
11/11: no update
18/11: no update
25/11: Agreed to try to produce for CSV and fixed formats
04/12: no update
09/12: no update
16/12: no update
23/12: no update
06/01: no progress. If there is no resource to complete this action it can
be deferred
13/01:no progress
20/01: no progress
27/01: no progress
29/01: No progress. The predefined formats do not need to be available
when the spec is published.
Suman said that he had been mapping COBOL structures to DFDL and it didn't
look as though the way text numbers are define is very usable. He will
document for next call
03/02: No progress
10/02: No progress
17/03: No progress
066
Investigate format for defining test cases
25/11:IBM to see if it is possible to publish its test case format.
04/12: no update
09/12: no update
16/12: reminded dent to project manager
23/12: SH will send another reminder.
06/01: Another reminder will be sent
13/01: no update
20/01: no update
27/01: no progress
29/01: no progress
03/02: IBM is still investigating
10/02: IBM is still investigating
17/02: IBM is willing in principle to publish the test case format and
some of the test cases. May need some time to build a 'compliance suite'
079
MB:Encoding for binary fields when lenghtkind is pattern
17/02: Discussed but no conclusion
080
AP:Clarify semantics of fn:poisition and fn:count
17/02: no progress
081
AP: Inf and Nan
The description is the way ICU behaves but need clarification. It isn't
clear how inf and Nan are represented in the infoset. Need to investigate
if XML allows these values
17/02: XML allows Nan and inf for float and double Dfdl will do the same.
Requires more investigation of ICU.
3 DFDL v1 Specification completion.
Draft 039 will be publish today.
WG review and Comments by 3 March
Draft 40 with updates for OGF submission - available 5 March
Regards
Alan Powell
Development - MQSeries, Message Broker, ESB
IBM Software Group, Application and Integration Middleware Software
-------------------------------------------------------------------------------------------------------------------------------------------
IBM
MP211, Hursley Park
Hursley, SO21 2JN
United Kingdom
Phone: +44-1962-815073
e-mail: alan_powell at uk.ibm.com
Unless stated otherwise above:
IBM United Kingdom Limited - Registered in England and Wales with number
741598.
Registered office: PO Box 41, North Harbour, Portsmouth, Hampshire PO6 3AU
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.ogf.org/pipermail/dfdl-wg/attachments/20100224/76c0b7f7/attachment-0001.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 25343 bytes
Desc: not available
Url : http://www.ogf.org/pipermail/dfdl-wg/attachments/20100224/76c0b7f7/attachment-0001.gif
More information about the dfdl-wg
mailing list