Wireshark · Wireshark-dev: Re: [Wireshark-dev] On building better statistics

Wireshark-dev: Re: [Wireshark-dev] On building better statistics

From: João Valverde <j@xxxxxx>

Date: Tue, 15 Feb 2022 12:20:52 +0000



On 14/02/22 15:34, João Valverde wrote:

Hi Jaap,

If I understand correctly I think the numbers are correct by design.
When viewing packet details the analysis is almost always on theprotocol header. In this case that's what the size represents andthat's what I would expect.
I don't typically use Protocol Hierarchy statistics but I thinkcounting total protocol size (header + payload) is a lot moreinteresting and useful. That matches my understanding of PDU size,which is presumably what I'm trying to look at with this statistic.
I think there is still a bug lurking with fixed-size headers vsvariable size headers that needs to be fixed, to match the currentbehavior.

Fixed-length headers are under-counting (as I understand the statisticdescribed in the User's Guide) but the fact that variable lengthprotocols work seems accidental. I think they are over-counting if thereis a trailer present.

And also, it's not really correct to include IP inside ICMP as IP bytes,but that's another issue entirely.

On 13/02/22 23:18, Jaap Keuter wrote:
This discussion was brought on by issue 17877 titled “Non-visiblephoto items cannot change length after construction”. In there it iscorrectly stated that calls to set proto_item length (eitherproto_item_set_len() or proto_item_set_end()) are not effective whenproto_items are not visible. When looking at these functions it canbe seen that these are part of the group of functions which areoptimised for faking proto_items when not visible.
Without going into the faking details (see the macroTRY_TO_FAKE_THIS_ITEM_OR_FREE in epan/proto.c for that) it comes downto just short cutting the proto_item creation process by returningthe tree intended to attach the newly to be constructed proto_itemto. In effect these all return the root node of the dissection tree.
A special case exception is put in place for proto_items of typeFT_PROTOCOL. EPAN can be setup so that proto_items of this type infact are allowed to be created, even if they otherwise would havebeen faked. This feature is used for protocol level statistics. Theprotocol level statistics ’tally the numbers’ by running thedissection, with a hidden tree, but with the special case exceptionset. This results in a very compact dissection tree, consisting ofthe root node and the proto_items of type FT_PROTOCOL. The length ofthese items is then used to determine the numbers to add to thevarious protocol layers in the statistics tree.
This is where the ambiguity comes in. Some protocols claim just thereshare of the octets in the frame (discussing Ethernet packetdissection here, to keep it simple). Others create their proto_itemuntil the end of the TVB handed to them (usually to the end of theframe), and adjust the length after dissection of their fields havetaken place and the variable number of bytes in the protocol layerhas been determined. However, the functions to set these lengthsdon’t work when faking items is in effect.
As a result these protocols take up way more of the frame in thestatistics than they in fact do. Overall more the 100% of the frameis allotted to the protocols contained in them. The User's Guide goesinto this fact with the explanation that these protocol 'contain’their payload, so that is why the payload is added to the protocol.That is one interpretation, but not really consistent because fixedsize dissectors, which create their proto_items of type FT_PROTOCOLwith fixed size, do not exhibit this behaviour.
The simplest step to take would be to allow the functionsproto_item_set_len() and proto_item_set_end() to operate onproto_items of type FT_PROTOCOL if the afore mentioned special caseexception was in effect. However, since faking of other types ofproto_items is still in effect, all these other proto_items are nowusing the proto_items of type FT_PROTOCOL as proto_item, rather thanthe root node. This means that code setting the length of a field, isnow also no longer blocked, and in fact setting the length of theproto_item of type FT_PROTOCOL rather than his own (which is faked).
A simple experiment with the file (code_mac_tagged.pcap) attached toissue 17877 makes this clear. Changing proto_items_set_len() to allowproto_items with type FT_PROTOCOL to set their length if the specialcase exception is set, shows a protocol hierarchy statistics pagewith all protocols matching their length in the dissection detailspane. Except for COSE, the dissection details say 309 while thestatistics say 246. This last length is the length set for the finalpart of the PDU, which ends up becoming the length of the protocol inthe protocol hierarchy statistics.
The question now becomes how to proceed with this. Faking proto_itemsmakes legitimate calls to set the length of proto_items of typeFT_PROTOCOL indistinguishable from those calls meant for fieldswithin those protocols. Another approach of faking proto_items byalways returning the root node instead of the tree creates it’s ownset of side effects. Now all ’non-protocol’ proto_items are processedfor statistics too, and exposing things like expert info in thestatistics also. This defeats the purpose of faking proto_items inthe first place.
A ‘quick and dirty’ solution is to run dissection with a visibletree. This gives the desired results, but at the cost of doing a lotmore dissection work than strictly necessary, voiding the wholepurpose of the special case exception in not faking proto_items withtype FT_PROTOCOL.
I’m not seeing a simple way out of this. Do we want to modify thestatistics in this way is the fist question to answer. They will bedifferent than in 3.6, 3.4 and earlier. This could be the right timeto do it though. Then the question becomes how to achieve thiswithout taking a significant performance hit by sidestepping thefaking of proto_items.
Thanks,
Jaap
(not rereading this draft, it’s way too late for that. Sorry for anymistyping or confusing statements)
___________________________________________________________________________
Sent via:    Wireshark-dev mailing list <wireshark-dev@xxxxxxxxxxxxx>
Archives:    https://www.wireshark.org/lists/wireshark-dev
Unsubscribe: https://www.wireshark.org/mailman/options/wireshark-dev
mailto:wireshark-dev-request@xxxxxxxxxxxxx?subject=unsubscribe
___________________________________________________________________________
Sent via:    Wireshark-dev mailing list <wireshark-dev@xxxxxxxxxxxxx>
Archives:    https://www.wireshark.org/lists/wireshark-dev
Unsubscribe: https://www.wireshark.org/mailman/options/wireshark-dev
mailto:wireshark-dev-request@xxxxxxxxxxxxx?subject=unsubscribe

Follow-Ups:
- Re: [Wireshark-dev] On building better statistics
  - From: Jaap Keuter

References:
- [Wireshark-dev] On building better statistics
  - From: Jaap Keuter
- Re: [Wireshark-dev] On building better statistics
  - From: João Valverde

Prev by Date: Re: [Wireshark-dev] On building better statistics
Next by Date: Re: [Wireshark-dev] On building better statistics
Previous by thread: Re: [Wireshark-dev] On building better statistics
Next by thread: Re: [Wireshark-dev] On building better statistics
Index(es):
- Date
- Thread