Skip to content

Commit

Permalink
WiFiClient::abort() (#8738)
Browse files Browse the repository at this point in the history
Api for saving heap when Client class is used by a Server (WiFiServer class): Client = Server.available().

Suppose the local end is the server and the remote end is the client, we will deal with heap memory at the local end.

When the local application (server) decides to close an active connection with a remote end it issues an Client.stop.
The stop() function calls the close() function of ClientContext class which in turn calls tcp_close.
The connexion is closed by tcp_close and the protocol control block (pcb) can be put in the following states depending on the requests sent by the remote: CLOSING, FIN_WAIT_1 and FIN_WAIT_2. In theses states pcbs are not freed, then consume some memory heap.
If an acknowledgment from the remote end is received, the pcb enter in TIME_WAIT state for some minutes but pcbs in TIME_WAIT state are not freed. Then consume some heap memory.
TIME_WAIT pcbs are automatically freed after some minutes or can be freed for instance issuing an tcp_kill_timewait()
in the local application which will free the oldest pcb in TIME_WAIT state.

If the connection is first closed from the remote end (the client), the local end (server) receive a connection termination request. It then acknowledge it and enter in CLOSE_WAIT state waiting for a connection termination request from the local application.
It then send a termination request and enter in LAST_ACK state until it receive an acknowledgment from the remote end.
After receiving the acknowledgment it enter in ClOSED state and the local pcb is freed leaving some room in the heap memory.

To summarize, when a connexion termination request is send by one end (remote or local), the local pcb is not freed immediatly.
This pcb can be in the following states: FIN_WAIT_1, FIN_WAIT_2, CLOSING, TIME_WAIT, CLOSE_WAIT, LAST_ACK.
As a consequence, some old pcbs from old closed connections are still consuming heap memory.

The local application can call tcp_kill_timewait hoping it will free some TIME_WAIT state pcbs. But if the server receive frequent connections requests and close them after sending whatever it has to send, there may be zero pcbs in TIME_WAIT state among all previously closed connections.

In case of insufficient memory to accept a new connection, lwip has developped a strategy: it successively tries to kill the oldest pcb in TIME_WAIT state, or in LAST_ACK state or in CLOSING state or the oldest active connection with lower priority than the new one.

As a matter of fact this "urgent" strategy is deployed only when very few heap memory remain available (less than some kb). In case of success, Client.available returns a valid Client but the local application will crash when sending or receiving data from the client (Client.read ou readuntil or available) because this need more heap memory and just some kb were freed in lwip to allocate the new pcb structure ans start the new connection.

The propose API is intended to avoid this drawback by calling the abort function of ClientContext which in turn calls tcp_abort which calls tcp_abandon. The connection is aborted and notified to the client with a RESET and the pcb and ressources associated are immediately released increasing the available heap memory.
  • Loading branch information
RobertGnz committed Dec 6, 2022
1 parent 821ccde commit 3c6db4e
Show file tree
Hide file tree
Showing 3 changed files with 45 additions and 1 deletion.
29 changes: 29 additions & 0 deletions doc/esp8266wifi/client-class.rst
Original file line number Diff line number Diff line change
Expand Up @@ -29,6 +29,35 @@ Default input value 0 means that effective value is left at the discretion of th

``stop()`` returns ``false`` in case of an issue when closing the client (for instance a timed-out ``flush``). Depending on implementation, its parameter can be passed to ``flush()``.

abort
~~~~~

.. code:: cpp
void abort();
Originally proposed in `#8738 <https://github.com/esp8266/Arduino/pull/8738>`__
Unlike ``stop()``, immediately shuts down internal connection object.

Under usual circumstances, we either enter ``CLOSE_WAIT`` or ``TIME_WAIT`` state. But, the connection object is not freed right away, and requires us to either
* wait until ``malloc()`` returns ``NULL`` when our TCP stack tries to allocate memory for a new connection
* manually call ``tcp_kill_timewait()`` to forcibly stop the 'oldest' connection

This API frees up resources used by the connection. Consider using it instead of ``stop()`` if your application handles a lot of clients and frequently runs out of available heap memory.

*Example:*

.. code:: cpp
# define MIN_HEAP_FREE 20000 // or whatever min available heap memory convienent for your application
auto client = server.accept();
// ... do something with the client object ...
if (ESP.getFreeHeap() >= MIN_HEAP_FREE) {
client.stop();
} else {
client.abort();
}
setNoDelay
~~~~~~~~~~

Expand Down
11 changes: 11 additions & 0 deletions libraries/ESP8266WiFi/src/WiFiClient.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -376,6 +376,17 @@ uint16_t WiFiClient::localPort()
return _client->getLocalPort();
}

// Api for heap saving. Optional use instead of WiFiClient::stop to systematically retreive some heap memory
// and avoiding server crashes in case of frequent clients connections.
void WiFiClient::abort()
{
if (!_client)
return;

flush(0); // Flush output buffer. Don't make any use of return boolean.
_client->abort(); // Wich in turn calls tcp_abort which calls tcp_abandon().
}

void WiFiClient::stopAll()
{
for (WiFiClient* it = _s_first; it; it = it->_next) {
Expand Down
6 changes: 5 additions & 1 deletion libraries/ESP8266WiFi/src/WiFiClient.h
Original file line number Diff line number Diff line change
Expand Up @@ -103,7 +103,7 @@ class WiFiClient : public Client, public SList<WiFiClient> {
friend class WiFiServer;

using Print::write;

static void stopAll();
static void stopAllExcept(WiFiClient * c);

Expand Down Expand Up @@ -148,6 +148,10 @@ class WiFiClient : public Client, public SList<WiFiClient> {
virtual bool outputCanTimeout () override { return connected(); }
virtual bool inputCanTimeout () override { return connected(); }

// Immediately stops this client instance.
// Unlike stop(), does not wait to gracefuly shutdown the connection.
void abort();

protected:

static int8_t _s_connected(void* arg, void* tpcb, int8_t err);
Expand Down

0 comments on commit 3c6db4e

Please sign in to comment.