Add straightforward endianness implementation; accommodate all allowable string encodings #26

cgobat · 2024-03-18T17:52:05Z

This PR ended up turning into a kind of rollup of several of the changes I've made in my fork to make the module usable. I am opening this as a draft for discussion purposes and as a sort of already-implemented feature(s) request. The major changes here are:

~~Addition of a BooleanParameterType class~~
Implementation of byte ordering/endianness checking
- While in an ideal world, spacecraft subsystems, etc. would all stick to big endian data types, in practice that is not the case. Hardcoding big endian types for all parameters is not a safe assumption, and the ability to handle both MSB- and LSB-first fields is a pretty fundamental/core functionality
- I noticed several notes in the code along the lines of TODO: implement ByteOrderList to inform endianness. I'm not sure what "ByteOrderList" is a reference to (a name idea for a new/future class?), but the way I've done it here is pretty clean (IMO) and seems to be fully transparent and backwards compatible in my limited testing. Happy to discuss if you had something else in mind for this though.
Addition of the complete set of XTCE-allowable character encoding values for StringDataEncodings, plus integration with the aforementioned endianness parsing for the cases where byte order is described by the string encoding

Checklist

Changes are fully implemented without dangling issues or TODO items
Deprecated/superseded code is removed or marked with deprecation warning
Current dependencies have been properly specified and old dependencies removed
- Note: I still think it is worth it to add an lxml dependency, but I will leave that for another PR
New code/functionality has accompanying tests and any old tests have been updated to match any new assumptions
The changelog.md has been updated

medley56 · 2024-03-18T18:50:32Z

@cgobat This is great work and I want to get these changes in. I think I will do the same thing as your last PR and take these changes and incorporate them into a different changeset that includes tests and omits a few pieces (e.g. I'm still torn on incorporating the lxml dependency and a lot of tests will need to be reworked for that I think). If you're concerned about claiming "credit" for the changes, I can try to teach myself how to continue work on your branch. Is it as simple as me making commits to your fork?

The BooleanParameterType was part of PR #24, along with the time parameter types so I think we'll pull those changes out of this PR.

On the subject of the ByteOrderList element, the XTCE green book element description is a little unclear on it. I think in XTCE 1.1 there was a ByteOrderList element that literally spelled out the byte significant for every byte in a data encoding. However, it looks like in 1.2 this may have been superseded by the mostSignificantByteFirst and leastSignificantByteFirst attributes that you are using. The explicit element-form of ByteOrderList appears to be still allowed in XTCE 1.2 but seems like an extremely niche use-case now that we have the attribute forms instead. In short, I think your implementation is sufficient.

See https://issues.omg.org/issues/XTCE12-400 for a discussion of the byte ordering changes between XTCE 1.1 and 1.2.

cgobat · 2024-03-19T01:17:04Z

If you're concerned about claiming "credit" for the changes, I can try to teach myself how to continue work on your branch. Is it as simple as me making commits to your fork?

It's not a huge deal to me. It might be nice, but only if it's not too inconvenient for you and your workflow. That said, GitHub does make it pretty easy to continue working on this branch—just git fetch origin pull/26/head:pr26 (assuming your remote is called origin; you can change pr26 to be whatever you want to call your own branch), then check out that new branch (e.g. git switch pr26). Since I enabled maintainer edits on this PR, you should be able to push.

The BooleanParameterType was part of PR #24

Ah, I had missed that, thanks. I removed the duplicate definition here.

See https://issues.omg.org/issues/XTCE12-400 for a discussion of the byte ordering changes between XTCE 1.1 and 1.2.

Thanks for the link. That provides some good insight.

Cheers!

medley56 · 2024-03-19T19:44:40Z

Awesome! Thanks for the tip on checking out PR branches from forks. I've always struggled with that.

The last piece here is unit test coverage for these new features. Do you want to tackle that? If not, I can start working on it, adding commits to this branch.

cgobat · 2024-03-22T03:40:55Z

I am pretty tight for time these days. I think it will definitely be ready sooner if you go ahead get started on the tests.

medley56 · 2024-03-24T16:57:52Z

Ok. I'll take over this PR branch and start working on tests.

Remove type annotation holdovers from lxml Remove duplicate definition of `BooleanParameterType` class Already added in medley56#24

greglucas · 2024-09-02T13:04:54Z

space_packet_parser/xtcedef.py

-            raise ValueError(
-                f"Got encoding={encoding}. Encoding must be one of utf-8, utf-16-le, or utf-16-be (note that"
-                f"endianness must be specified for utf-16 encoding.")
+        if encoding.upper() not in self._supported_encodings:


This will fail for the Windows-1252 comparison because it isn't uppercased in the _supported_encodings.

The flipping between incoming/upper/lower is a bit confusing through here. Can we not do the upper/lower conversions and rely on the user matching the spec? Or store this as upper-case on the self.encoding attribute and then special-case Windows later in the lookup.

I just went down a rabbit hole of defining canonicalized enums for all these string parameters, allowing aliases but coercing to standard representations, and just decided it's more complicated than it's worth. I'm OK with demanding strict adherence to the spec.

How do you feel about this when it comes to integer encodings? For example signed is not a valid encoding according to XTCE. It must be twosComplement.

space_packet_parser/xtcedef.py

cgobat mentioned this pull request Mar 18, 2024

Add more *ParameterType classes #20

Closed

medley56 mentioned this pull request Mar 19, 2024

Fix incorrect parsing of StringDataEncoding element #30

Open

medley56 force-pushed the implement-byteorder branch from 4e697f7 to 236eeba Compare September 1, 2024 16:05

Fix unit tests for lxml

d74bdcd

Remove type annotation holdovers from lxml Remove duplicate definition of `BooleanParameterType` class Already added in medley56#24

medley56 force-pushed the implement-byteorder branch from 2a1d9d2 to a79fe10 Compare September 2, 2024 04:58

medley56 requested a review from greglucas September 2, 2024 05:01

medley56 marked this pull request as ready for review September 2, 2024 05:01

medley56 self-requested a review as a code owner September 2, 2024 05:01

medley56 self-assigned this Sep 2, 2024

medley56 added this to the Version 5.0.0 Release milestone Sep 2, 2024

medley56 added the enhancement New feature or request label Sep 2, 2024

greglucas reviewed Sep 2, 2024

View reviewed changes

Add little endian signed and unsigned integer parsing and add tests

763cd2f

medley56 force-pushed the implement-byteorder branch from a79fe10 to 763cd2f Compare September 3, 2024 21:06

medley56 merged commit 3148a44 into medley56:main Sep 3, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add straightforward endianness implementation; accommodate all allowable string encodings #26

Add straightforward endianness implementation; accommodate all allowable string encodings #26

cgobat commented Mar 18, 2024 •

edited

Loading

medley56 commented Mar 18, 2024

cgobat commented Mar 19, 2024

medley56 commented Mar 19, 2024

cgobat commented Mar 22, 2024

medley56 commented Mar 24, 2024

greglucas Sep 2, 2024

medley56 Sep 3, 2024

Add straightforward endianness implementation; accommodate all allowable string encodings #26

Add straightforward endianness implementation; accommodate all allowable string encodings #26

Conversation

cgobat commented Mar 18, 2024 • edited Loading

Checklist

medley56 commented Mar 18, 2024

cgobat commented Mar 19, 2024

medley56 commented Mar 19, 2024

cgobat commented Mar 22, 2024

medley56 commented Mar 24, 2024

greglucas Sep 2, 2024

Choose a reason for hiding this comment

medley56 Sep 3, 2024

Choose a reason for hiding this comment

cgobat commented Mar 18, 2024 •

edited

Loading