Wireshark · Wireshark-dev: Re: [Wireshark-dev] Calling a dissector: Type for data parameter

Wireshark-dev: Re: [Wireshark-dev] Calling a dissector: Type for data parameter

From: João Valverde <joao.valverde@xxxxxxxxxxxxxxxxxx>

Date: Tue, 22 Jun 2021 02:28:09 +0100



On 22/06/21 01:26, John Thacker wrote:

On Mon, Jun 21, 2021 at 2:21 PM João Valverde via Wireshark-dev<wireshark-dev@xxxxxxxxxxxxx <mailto:wireshark-dev@xxxxxxxxxxxxx>> wrote:
    On 16/06/21 15:36, David Perry wrote:
     > Sorry to drag up an old topic, but I've been thinking about this:
     >
     >> Message: 5
     >> Date: Sat, 29 May 2021 09:32:29 +0200
     >> From: Anders Broman <a.broman58@xxxxxxxxx
    <mailto:a.broman58@xxxxxxxxx>>
     >> To: Developer support list for Wireshark
    <wireshark-dev@xxxxxxxxxxxxx <mailto:wireshark-dev@xxxxxxxxxxxxx>>
     >> Subject: Re: [Wireshark-dev] Calling a dissector: Type for data
     >>     parameter
     >> Message-ID:
     >>
         <CAOpyz=zDycm33PXUwtBCTew7gTTEcSLiJ-f8SHW0L-863Q517A@xxxxxxxxxxxxxx <mailto:zDycm33PXUwtBCTew7gTTEcSLiJ-f8SHW0L-863Q517A@xxxxxxxxxxxxxx>>
     >> Content-Type: text/plain; charset="utf-8"
     >>
     >> Hi,
     >> Yes the method is fragile. At the time of development I think it was
     >> proposed to pass a struct containing a string and the void
    pointer where
     >> the string could be used as a identifier. But that was voted down.
     >> Regards
     >> Anders
     >
     > I wasn't around for that discussion so I don't know the reasons, but
     > how does this sound as a refined approach?:
     >
     > * Define a `dissector_data_t` that has a `guint32` identifier field,
     > and a `void *` data field.
     >
     > * Replace the `void *data` parameter to dissectors with a pointer
    to a
     > `dissector_data_t`.
     >
     > * Either:
     >
     >     * Easy way: maintain a static list of identifiers that map to
     > expected data types, or
     >
     >     * Have dissector X request an identifier in its registration
     > function for the type of data it expects, and have dissector Y
    (which
     > will call X) request, in its handoff function, the identifier of the
     > type of data it needs to pass to X.
     >
     > * Dissectors check for the right identifier in their
     > `dissector_data_t` parameter and don't try to use it if it's wrong.
     >
     > Thoughts?
     >

    I think what you suggest would be the most straightforward fix.

    To avoid breaking backward compatibility and changing thousands of
    dissectors at the same time, both of which are highly problematic, it
    can be done by adding a new dissector type (like it was done with
    "dissector_cb_t", only using a different signature).[1]

    Also a giant static list of dissector_data_t type identifiers would be
    pretty clunky. I think we could recycle the protocol registration
    number
    for that.
Perhaps I don't quite understand, but what would be the point if theprotocol registration number were used? Presumably that is the numberfor the called protocol, based on what David outlined (the calledprotocol registering what data it expects.) But the calling dissectorwould always have that number (via dissector_handle_get_protocol_index()) and pass it in, which wouldn't provide any guarantee that the datapassed in was the correct type than what is being done now.
The only way that I see it would make sense to pass in an identifier isif a protocol registers multiple data types it might expect to be passedin when called from different types (whether in one dissect_proto()function or multiple ones), in which case the protocol registrationnumber couldn't be used, or if the identifier is instead related to thecalling protocol and controlled by it (which is perhaps for this methodof calling the wrong dependency direction, unlike with dissector tableswhere the calling protocol does control the passed data type, e.g.packet-ip always passes a ws_ip4* to the "ip proto" table or itsheuristic subdissector table.)
That doesn't sound like what's being proposed, though, so I am confused.

The way I proposed it the identifier is related to the calling protocoland controlled by it. It refers to the first suggestion above: "Easyway: maintain a static list of identifiers that map to expected data types"

Instead of having the biggest enum ever of static data identifiers wecan use the protocol registration number. There would probably be otherfields too, like version and flags, TBD).

Dissectors registering on the "ip.proto" table can receive a ws_ip4 or aws_ip6, the first byte is the type, so I don't follow your exampleeither. It's not really true that there is always a ws_ip4 pointer.

John Thacker

___________________________________________________________________________
Sent via:    Wireshark-dev mailing list <wireshark-dev@xxxxxxxxxxxxx>
Archives:    https://www.wireshark.org/lists/wireshark-dev
Unsubscribe: https://www.wireshark.org/mailman/options/wireshark-dev
              mailto:wireshark-dev-request@xxxxxxxxxxxxx?subject=unsubscribe

Follow-Ups:
- Re: [Wireshark-dev] Calling a dissector: Type for data parameter
  - From: John Thacker

References:
- Re: [Wireshark-dev] Calling a dissector: Type for data parameter
  - From: David Perry
- Re: [Wireshark-dev] Calling a dissector: Type for data parameter
  - From: João Valverde
- Re: [Wireshark-dev] Calling a dissector: Type for data parameter
  - From: John Thacker

Prev by Date: Re: [Wireshark-dev] Calling a dissector: Type for data parameter
Next by Date: Re: [Wireshark-dev] Calling a dissector: Type for data parameter
Previous by thread: Re: [Wireshark-dev] Calling a dissector: Type for data parameter
Next by thread: Re: [Wireshark-dev] Calling a dissector: Type for data parameter
Index(es):
- Date
- Thread