igb: Optimize TX path
Reduce the number of status reports of TX ring: at most 16 reports every
TX descriptor count tranmission. It is unnecessary to report status for
every TX descriptor. This could greatly reduce bus traffic.
Use "Transmit Completions Head Write Back" as mentioned in the datasheet.
In this model, TX descriptors are no longer written by hardware thus cache
trashing is avoided. This also greatly reduce the complexity of igb_txeof.
Implemention note of "Transmit Completions Head Write Back",
- HWBTHRESH is not used, since:
o 82575 does not support it
o Number of status reports are already greatly reduced
- WB_on_EITR is not used, since:
o 82575 does not support it
o It will cause unnecessary head write-back
Performance is almost same as previous code:
- 1.48Mpps for 18bytes UDP datagram
- Line rate for 1472bytes UDP datagram and TCP stream