WiFi Mesh unstable when parent offline (IDFGH-13875) #14720

michaelsimp · 2024-10-14T01:23:56Z

Answers checklist.

I have read the documentation ESP-IDF Programming Guide and the issue is not addressed there.
I have updated my IDF branch (master or release) to the latest version and checked that the issue is present there.
I have searched the issue tracker for a similar issue and not found a similar issue.

IDF version.

v5.3.0

Espressif SoC revision.

Chip is ESP32-S3 (QFN56) (revision v0.2)

Operating System used.

Windows

How did you build your project?

VS Code IDE

If you are using Windows, please specify command line type.

PowerShell

Development Kit.

ESP32-S3-WROOM-1

Power Supply used.

USB

What is the expected behavior?

I expect the ESP32 to continue to run the application without crashing when the WIFI Mesh parent disappears.
If the MESH_ROOT was powered off, I expect a MESH_NODE to assume the role of MESH_ROOT
If the WIFI Router is powered off, when I restore it, I expect the mesh network to establish itself

What is the actual behavior?

Sometimes these tests work perfectly. The Mesh network goes down and the nodes start scanning. If I restore the WiFi router, the Mesh network is reestablished
Sometimes the Mesh network goes down, and can't recover. It doesn't crash but it doesn't scan properly and reestablish the Mesh network.
Very regularly, if I power off the WiFi router the MESH_ROOT intermittently crashes OR if I power off the MESH_ROOT a MESH_NODE intermittently crash

Steps to reproduce.

Power on system comprising 2 x ESP32-S3 dev boards and a Wifi router
Connect a serial terminal (I am using PUTTY) to each serial port for monitoring
Let the Mesh network get established and verify MESH_ROOT and MESH_NODE connected.
Power off the WiFi Router
In the example (logs below) the MESH_ROOT crashed

Debug Logs.

I (00:58:03.336) aWifiMesh: <MESH_EVENT_MESH_STARTED>ID:77:77:77:77:77:76
I (136546) mesh: <MESH_NWK_LOOK_FOR_NETWORK>need_scan:0x3, need_scan_router:0x0, look_for_nwk_count:1
I (00:58:03.336) aWifiMesh: This node MAC:48:ca:43:9b:53:d8
I (00:58:03.354) aWifiMesh: WiFi Mesh started successfully, heap:141084, root not fixed
WIN> I (140766) mesh: [S6]VONETS, 00:17:13:20:bd:74, channel:8, rssi:-12
I (140776) mesh: find router:[ssid_len:6]VONETS, rssi:-12, 00:17:13:20:bd:74(encrypted), new channel:8, old channel:0
I (140786) mesh: [FIND][ch:0]AP:11, otherID:0, MAP:1, idle:1, candidate:0, root:0[00:17:13:20:bd:74]router found
I (140796) mesh: [FIND:1]find a network, channel:8, cfg<channel:0, router:VONETS, 00:00:00:00:00:00>

I (00:58:07.590) aWifiMesh: <MESH_EVENT_FIND_NETWORK>new channel:8, router BSSID:00:00:00:00:00:00
W (140796) wifi:<MESH AP>adjust channel:1, secondary channel offset:1(40U)
W (140816) wifi:<MESH AP>adjust channel:8, secondary channel offset:1(40U)
I (141126) mesh: [SCAN][ch:8]AP:1, other(ID:0, RD:0), MAP:0, idle:0, candidate:1, root:0, topMAP:0[c:0,i:0][00:17:13:20:bd:74]router found<>
I (141126) mesh: 1330[SCAN]init rc[48:ca:43:9b:53:d9,-9], mine:0, voter:0
I (141136) mesh: 1368, vote myself, router rssi:-9 > voted rc_rssi:-120
I (141146) mesh: [SCAN:1/10]rc[128][48:ca:43:9b:53:d9,-9], self[48:ca:43:9b:53:d8,-9,reason:0,votes:1,idle][mine:1,voter:1(1.00)percent:1.00][128,1,48:ca:43:9b:53:d9]

I (141456) mesh: [SCAN][ch:8]AP:2, other(ID:0, RD:0), MAP:1, idle:1, candidate:1, root:0, topMAP:0[c:0,i:1][00:17:13:20:bd:74]router found<>
I (141466) mesh: [SCAN:2/10]rc[128][48:ca:43:9b:53:d9,-8], self[48:ca:43:9b:53:d8,-8,reason:0,votes:1,idle][mine:1,voter:2(0.50)percent:1.00][128,1,48:ca:43:9b:53:d9]

I (141776) mesh: [SCAN][ch:8]AP:2, other(ID:0, RD:0), MAP:1, idle:0, candidate:1, root:1, topMAP:0[c:0,i:0][00:17:13:20:bd:74]router found<>
I (141776) mesh: 7391[selection]try rssi_threshold:-78, backoff times:0, max:5<-78,-82,-85>
I (141796) mesh: [DONE]connect to parent:ESPM_3372B8, channel:8, rssi:-15, 30:30:f9:33:72:b9[layer:1, assoc:0], my_vote_num:0/voter_num:0, rc[00:00:00:00:00:00/-8/0]
I (141806) mesh: set router bssid:00:17:13:20:bd:74
I (142596) mesh: <MESH_NWK_MIE_CHANGE><><><><ROOT ADDR><><><>
I (142596) mesh: <MESH_NWK_ROOT_ADDR>from assoc, layer:2, root_addr:30:30:f9:33:72:b9, root_cap:1
I (142616) mesh: <MESH_NWK_ROOT_ADDR>idle, layer:2, root_addr:30:30:f9:33:72:b9, conflict_roots.num:0<>
I (00:58:09.409) aWifiMesh: <MESH_EVENT_ROOT_ADDRESS>root address:30:30:f9:33:72:b9
I (142616) mesh: [scan]new scanning time:600ms, beacon interval:300ms
I (142636) mesh: 2012<arm>parent monitor, my layer:2(cap:6)(node), interval:7286ms, retries:1<normal connected>
I (00:58:09.436) aWifiMesh: <MESH_EVENT_PARENT_CONNECTED>layer:1-->2, parent:30:30:f9:33:72:b9<layer2>, ID:77:77:77:77:77:76
I (00:58:09.451) mesh_netif: It was a wifi station removing stuff
Guru Meditation Error: Core  0 panic'ed (LoadProhibited). Exception was unhandled.

Core  0 register dump:
PC      : 0x4212753c  PS      : 0x00060030  A0      : 0x82127613  A1      : 0x3fcc1660
A2      : 0xffffffff  A3      : 0x00000000  A4      : 0xff000000  A5      : 0x00000001
A6      : 0x3fcc0a64  A7      : 0xff000000  A8      : 0x3c1505e4  A9      : 0x00000000
A10     : 0x3fcc0a64  A11     : 0x00000000  A12     : 0x00000101  A13     : 0x3c1505e4
A14     : 0x00000007  A15     : 0x3fcd8024  SAR     : 0x00000004  EXCCAUSE: 0x0000001c
EXCVADDR: 0xff00000c  LBEG    : 0x40056f5c  LEND    : 0x40056f72  LCOUNT  : 0xffffffff


Backtrace: 0x42127539:0x3fcc1660 0x42127610:0x3fcc16b0 0x4037e0aa:0x3fcc16d0

More Information.

My application integrates a number of IDF example programs including ip_internal_network
I went back to the example project ip_internal_network and built it unmodified, and can reproduce the same problems quite readily.

Also, for when the ESP32 nodes don't completely crash, I would like to know how to restart the Mesh network in software.
I have tried stopping the Mesh network with:
ESP_ERROR_CHECK(esp_mesh_stop());
ESP_ERROR_CHECK(esp_mesh_deinit());
ESP_ERROR_CHECK(mesh_netifs_destroy()); // I have tried with and without this line. Without it, the logs continually report:
I (135746) mesh: mesh is not started
E (00:58:02.547) mesh_netif: Received with err code 16388 ESP_ERR_MESH_NOT_START

I then try to restart the Mesh network with:
/* mesh initialization /
ESP_ERROR_CHECK(esp_mesh_init());
ESP_ERROR_CHECK(esp_mesh_set_max_layer(CONFIG_MESH_MAX_LAYER));
ESP_ERROR_CHECK(esp_mesh_set_vote_percentage(1));
ESP_ERROR_CHECK(esp_mesh_set_ap_assoc_expire(10));
/ set blocking time of esp_mesh_send() to 30s, to prevent the esp_mesh_send() from permanently for some reason /
ESP_ERROR_CHECK(esp_mesh_send_block_time(5000)); // was 30 seconds
mesh_cfg_t cfg = MESH_INIT_CONFIG_DEFAULT();
cfg.crypto_funcs = NULL;
/ mesh ID */
memcpy((uint8_t ) &cfg.mesh_id, MESH_ID, MAC_SIZE);
/ router */
cfg.channel = CONFIG_MESH_CHANNEL;

cfg.router.ssid_len = strlen(meshProvisionData.ssid);
memcpy((uint8_t *) &cfg.router.ssid, meshProvisionData.ssid, cfg.router.ssid_len);
memcpy((uint8_t *) &cfg.router.password, meshProvisionData.password, strlen(meshProvisionData.password));

ESP_ERROR_CHECK(esp_mesh_set_ap_authmode((wifi_auth_mode_t) CONFIG_MESH_AP_AUTHMODE));
cfg.mesh_ap.max_connection = CONFIG_MESH_AP_CONNECTIONS;
cfg.mesh_ap.nonmesh_max_connection = CONFIG_MESH_NON_MESH_AP_CONNECTIONS;
memcpy((uint8_t *) &cfg.mesh_ap.password, CONFIG_MESH_AP_PASSWD, strlen(CONFIG_MESH_AP_PASSWD));
ESP_ERROR_CHECK(esp_mesh_set_config(&cfg));
ESP_ERROR_CHECK(esp_mesh_start());

Doing the above when the system is running normally, often causes the ESP32's to crash with various errors
eg after start, MESH_NODE does a scan and then crashes with Guru Meditation Error: Core 0 panic'ed
I (00:22:02.752) aWifiMesh: <MESH_EVENT_FIND_NETWORK>new channel:8, router BSSID:00:00:00:00:00:00
W (1323864) wifi:adjust channel:1, secondary channel offset:1(40U)
W (1323874) wifi:adjust channel:8, secondary channel offset:1(40U)
I (1324184) mesh: [SCAN][ch:8]AP:2, other(ID:0, RD:0), MAP:1, idle:0, candidate:1, root:1, topMAP:0[c:0,i:0][00:17:13:20:bd:74]router found<>
I (1324184) mesh: 7391[selection]try rssi_threshold:-78, backoff times:0, max:5<-78,-82,-85>
I (1324204) mesh: [DONE]connect to parent:ESPM_3372B8, channel:8, rssi:-14, 30:30:f9:33:72:b9[layer:1, assoc:0], my_vote_num:0/voter_num:0, rc[00:00:00:00:00:00/-120/0]
I (1324214) mesh: set router bssid:00:17:13:20:bd:74
I (1324834) mesh: <MESH_NWK_MIE_CHANGE><><><><><><>
I (1324834) mesh: <MESH_NWK_ROOT_ADDR>from assoc, layer:2, root_addr:30:30:f9:33:72:b9, root_cap:1
I (1324844) mesh: <MESH_NWK_ROOT_ADDR>idle, layer:2, root_addr:30:30:f9:33:72:b9, conflict_roots.num:0<>
I (1324854) mesh: [scan]new scanning time:600ms, beacon interval:300ms
I (00:22:03.744) aWifiMesh: <MESH_EVENT_ROOT_ADDRESS>root address:30:30:f9:33:72:b9
I (1324854) mesh: 2012parent monitor, my layer:2(cap:6)(node), interval:4526ms, retries:1
I (00:22:03.771) aWifiMesh: <MESH_EVENT_PARENT_CONNECTED>layer:2-->2, parent:30:30:f9:33:72:b9, ID:77:77:77:77:77:76
I (00:22:03.785) mesh_netif: It was a wifi station removing stuff
Guru Meditation Error: Core 0 panic'ed (LoadProhibited). Exception was unhandled.

Core 0 register dump:
PC : 0x4212753c PS : 0x00060830 A0 : 0x82127613 A1 : 0x3fcc15d0
A2 : 0xffffffff A3 : 0x00000000 A4 : 0x00000278 A5 : 0x00000001
A6 : 0x3fcc09d0 A7 : 0x00000278 A8 : 0x3c1505e4 A9 : 0x3fcd778c
A10 : 0x3fcc09d0 A11 : 0x00000000 A12 : 0x00000101 A13 : 0x3c1505e4
A14 : 0x00000007 A15 : 0x3fcaa7f4 SAR : 0x00000004 EXCCAUSE: 0x0000001c
EXCVADDR: 0x00000284 LBEG : 0x40056f5c LEND : 0x40056f72 LCOUNT : 0xffffffff

Backtrace: 0x42127539:0x3fcc15d0 0x42127610:0x3fcc1620 0x4037e0aa:0x3fcc1640

I sometimes get MTX task stack overflows too when I try this, same as #13882

The text was updated successfully, but these errors were encountered:

zhangyanjiaoesp · 2024-10-22T09:56:57Z

@michaelsimp
I have tested using the ip_internal_network example, but I didn't reproduce your problem. Can you provide the .elf file when the crash issue happen? Or can you provide the core dump decode file?

michaelsimp · 2024-10-24T04:15:21Z

Hi Thanks for the response. I have a deadline for a project demo this week but I will go back and reinstall from scratch and rebuild and test and send you the files requested. One thing I note is that I originally installed IDF into vscode when 5.2.2 was the current build. I have since upgraded to ver 5.3.0 and selected this and did a clean build on my project. Now I have gone back to "show examples" to recreate the ip_internal_network project from scratch, it asks which version of the IDF I want to use but only lists ver 5.2.2. Only once I create the project I can change the version of IDF to 5.3.0 Using winmerge I compared the directory structure and files from my target folder containing ip_internal_network from ver 3.2.2, with the ...\esp\v5.3\esp-idf\examples\mesh\ip_internal_network folder and the only change was in the partition.csv which now must be aligned. I also note there is now a version 5.3.1 marked as stable, and 5.3.0 has gone? I also found migration notes for going from 5.3 to 5.4 although I can't find this in the installer. Is 5.4 the master (development branch). Which version would you recommend I use? I have found everything in IDF to be stable and bug free except for the WiFi Mesh. Could you advise how I update IDF properly so it will enable me to select latest version of IDF when I create project from "Show Examples". Last time I followed the instructions from Visual Studio at: https://marketplace.visualstudio.com/items?itemName=espressif.esp-idf-extension#:~:text=In%20Visual%20Studio%20Code%2C%20select,not%20supported%20inside%20configured%20paths Regards Michael On 22/10/2024 10:57 pm, ZYJ wrote: @michaelsimp I have tested using the ip_internal_network example, but I didn't reproduce your problem. Can you provide the .elf file when the crash issue happen? Or can you provide the core dump decode file? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***> [ { ***@***.***": "http://schema.org", ***@***.***": "EmailMessage", "potentialAction": { ***@***.***": "ViewAction", "target": "#14720 (comment)", "url": "#14720 (comment)", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { ***@***.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

…

--------------UEln60op0cSX066h04nQooA4 Content-Type: image/png; name="HUQ061jBK81kEjpj.png" Content-Disposition: inline; filename="HUQ061jBK81kEjpj.png" Content-Id: Content-Transfer-Encoding: base64 iVBORw0KGgoAAAANSUhEUgAAARoAAAEvCAYAAACaBA6kAAAgAElEQVR4Xu19DbQd1XXefk8/ j8fPQ1TYQMBGgvXA6ymoxEIgEI717Dh1rNXIiaWsYqzmx4XlZSe1bJ6cChGXunZoo2db7XKo I+y4toxBIFrTmLTguFL4keXISlTbosHCeQKLYAGqxeNHYCy9zpmZM3P+Zs6ZOzN3ztz73bW0 4N17fvb5zj7f7LPPmb0HRkcvmiF8gAAQAAI1IjAAoqkRXTQNBIBAiACIBooABIBA7QiAaGqH GB0AASAAooEOAAEgUDsCIJraIUYHQAAIgGigA0AACNSOAIimdojRARAAAiAa6AAQAAK1I9Ac 0bzlY/TFD15KJwdDPPiX76UNW2sfKzrwAIHl679Av//mYNZf2Eefu/5P6BEPZIII9SPQMdEk CiPJeJDuu+ZG+qqL3E0Tzdo/pjvetcAg6cu079Z/Rf/xoein9/2Hr9HK85ViySJZTn+45UN0 6WmGZp64j675N7dbkUjaFxaesc+4JZ2Uy8tgFbKyAqKsBXSlsv7RUFMIdEA0OYpNBZSnJqJJ CNC20K1Ecy3dcsdKMlFR+jTOwyKYUoendlGiiRRFxLlNREMEi6appd5sv8WJRligL//tn9L7 N3Hjlyn8Snrq+iYtGmHRORONbMHw6RAtNtmKCAhoy7n0jdDsN/eXWiTmtsUpzyUaiagUQknG V2DMzeoaeu9jBCokGh1FbXslLv4si0b4Xn96x31o1kiwoL/1Q7ro7ZHPR/zIZCj8krRRlGjE 1jMWuSCfzf/kTjSsX9PWoyDRZMiWzlVqLbnP331E72LWH8fSYA3Gc28abzgy7rtJ4JXnRZZv Py0SrE0RY2eLto8XfRNDL040pCuRaTFl+hk42ZiIJnM7k6P8IWpliEaBnVsRGuGZCKmbFk0k Z7rguDwFicZoheltFJm/FEEm0w6a90HDljOTaPK3n1y3xHG//MLJdLLkF+P6IepmgW18Eyuv z/rsgGgYQmb/RUI4JhJJvosVQCtjWjDpd2HbB9OTKhKto7Ufoz88+CeBA7fAossiNXG7opFN TGqJs9jio7Ft34Lmilk0FqJRlDfLmtL8JOpcCDjrc6rPn+SLsvjetPEaLSwB13g+SLB4uJVq 2t7CovGTwTokGj4YlXDip+yCrBMdYaGSeryd43xlNgvzB/1kZXxSlOX76IRo7H4UytzOZRON tMg1Ukuftk0QTToeZb44yWZalub50311ykmcQLjqeFMCkS0QdSs3lRCNUK6mAwU/l2q7pSpJ NKo5HxHCn9PvRnclMj6qdaJZK6Z6gbJ+7sh43G6XiSaWR9xORETiSGyVEY1AxqYjdgcLKp6x 5FiezdeO+R8Kj/BNloJpKozzp9yD0rdeEUEQvy4Qy78wuT4Aomk3leRLX5xo2KJZtF+6I6KZ sJRaNJnOUO1pJCyirAUjWhaVbZ2yncG/R18STtXEOzWd+kf0yXC3aGSLL7UiHMlO6TrdYhyk g+cvEBy5QUEXZ7arNaE43Z9aGd9LMlhPqa7kbZ3yLRpsnfykq86IxnjRjQ2QK0H2liJZIAZF NV8CDJq1XmZLyUJ9ktpPnfSJYQr/jbPjG6ymeStlTcgNdnSPRiLizohG2w7mHaULIufNX1jM 6NdK5zCxYFwuPTKNim+Nm07FxL40C7PInS4/12ZPSVWcaDIcwZJz1rDV4KhZFdXkH1AuvumE JJrdWU9+Zd5y/BB5W7lMf4TztqUM0ZhOUjokGumo3PwaiOnkyTp/JqIxPSiUOc3aavFb5m5E IzjLO5yPnlrdHg2mA6LxSHqIAgSAQCsQANG0YpogJBBoNwIgmnbPH6QHAq1AAETTimmCkECg 3QiAaNo9f5AeCLQCARBNK6YJQgKBdiMAomn3/EF6INAKBEA0rZgmCAkE2o0AiKbd8wfpgUAr EADRtGKaICQQaDcCIJp2zx+kBwKtQABE04ppgpBAoN0IdEQ0s4fm06w5p9Hg4ElEA+0GANID ASDQAQIzRCdOvELHX3uBfv7qEWsDhYhmYHAOzT35XBqcFRAMPkAACACBAIETx1+hn738FM2c eC0Tj0JEM3TqgpBkjo2cQ89c+BZ6Yf6FdCIgH3yAABDoLwQGA1I57ciP6PU/eoiGp58OyebV Fw+WJxq2XZpz0utCkvnR5b9DC//hXnr94T006/ir/YUwRgsEgAAdnzVEz5y1lKYuWEUX/s1/ DcnmtVeezdxGOVs03Jp54pd+i858/gd0zj8+DLiBABDocwSe/oWr6bnTf5HO/7u7cq0aZ6IZ HnlT6Pj9wa9soCu+fRMsmT5XMAwfCDAEmGXznSs/Sb/4V7cQBQ7iY9N/bwTGnWhOD4gm+Hzv Vz9OVz+4DigDASAABEIEHv7lzbT4gU+E/3/seRAN1AIIAIEaEADR1AAqmgQCQEBGAEQDjQAC QKB2BEA0tUOMDoAAEADRQAeAABCoHQEQTe0QowMgAARANNABIAAEakcARFM7xOgACAABEA10 AAgAgdoRaAnRLKHrP7WWxo7uoHWb7i0HyrsnaPMKoh3rJqlkS+Xk8KV2iMd8evT2DbRljy9C QY5eQ8Abolly3S20dtGwgu+xeAF0k2hW0cTmcTpPnemnZJJbtX4zjZ8rFjqUktfS6+mWa8dI Hg0fi02Fov5p5zqa/HpUVu8r+v7Y/q204ba9UoP5OBr6BtHYJgS/V4CAX0SzYIq2btxC8tKp YJRiE1aLRl/osgRm0lu1foJoU2wlhUSzkKYEKyEiAHKwHDKIhlRrjhOiQHCBoGE/3cCx4mlp RXNW3WnFKEIhnzvrMnp16Aw698lv5gr91BvfQUOvPEdnPvN3pQYHotHgsxBNSCIjtDtv62Ug moACwu3fwoO6FSKL4Eo0rJZOeiCaUushv3KPEM0rw2fSd6/6ZDjWhQfuySSbHy/4NXriwlVh uaWPbAgI56cdg9sSolEXafr37nlr0y3MS49qFpG87Qie/juJxlfk+WhciGaMjghbGw19I9HE WyDNMlFrFyGaoK6i/IWJRqkf4hXIuPXoMmErK1tNksQZY1XlCkavbEnVrST7fRlNK74iLk/k m+PzvoOmFozT2ClEh7LmIRyXsAFmW9+dZ2mWZjgWCYOYvIO2o08k5+EVOVtldWySHuoys1Yj uWVMMsfS8fI2V3xleH5ANJ9Kflzw+H+n8564Xyp8aME76eCF706+W/rIjQHR/L+OJWk10ciK Fk3afMFvESrpPIF8Et9JzsKJlUb0kajocj+IyUcS0b++darHotH7qoRoAt9TOjabf8xsqUkE EeNOok8pJoJ0cbkTzdgpFn+Xhn/Q9nqiyU2HjFalKKtMbGwrOkFL9k1GjnKjRZPxYEj0jhNX KnPqR1O/O9K1Q4pnz15Kjy16f6La5//DX9Abpu4L/37ygn9OTy5cmfz2ph/cRmceLufQ8Ito VGdw8mQwWzTqKZS0yJyftGaLQnUGa08bweGrEY6h78iyyiM4LkdBiyYkxtQSMDqDDZZeMmqT RSOSs/bENzzUWBtLpwVrUpZJXbzJSGPrKbJWChCN7fQxb4uTK6tle2toN8R73m75NFSaf1Ob +kORW3x5D7iOzYmMis+cs4x+OPY7ya9vnPoGzQzMIrZl4p/RR79MZz397dJd+0U0mU7M7K2T eOoiEY2mUDFW1n22zRmsYM4JR1zMplMnZbFrhJCcahUkGoXUKrFo1O2dE2bCtkfCPpq7kT3p KZpEcglBuRON3c+Vbn90q1PpR9WTZMtleCgYcMg6EeRbri17solGJpWCeld66UcNqGQjNlsV ybA2QTTahHUy4UqdLGvKSTkKEo2yUJohmsj/tOxo5Ohm/z96gBNLE0QjW4fMOhUJJ1vWdILS B4FAOBlEw8dtnl6/iSYkm7OvoB8u+l1J/Iv3/zm97id/46SxLoV6m2gMF9EiBcrbC3dCNIoy dY1odFmbIprQLxX4DrdvPEzjm0fpgHAq5751ku8P6adqrid3suprmOTIKtdU8M3aOuVeJ/Cf aNiYnz378sBn83vh8C/e/8WAZKq9vdm7RMOPfqliZ3CgpBOX7qVJ8aKceumtG0QTm/jq1qAx ognxXk108AiNzTtg8FmMke4Mlm8kq877xKpItpWORBNgM0GTyYVHHZMsWYPv1y+hvZv4XS4H S9Xk6Gb+ptD5LJ6UidcaTA+zTh5wLraEe5lXT5ofFh56xZ5V0r3VqGQPEw0bnnJUyfwke0Zo 7Qr78bZ2M5hiE9p461fZz9dBNNItZDY2s2O5OaKJLwtmXUrUcDPJLx/3MhINry8kPiN3opGO tg3OcPMFSvVoWz8+T30yeTfBxVOxdlg0RYmjaHlviKao4CjvHwKFSa7BIbRJ1gZhqqxrEE1l UPZ7Q0vouk++L7j5/FW68Qvl7lzUj2SbZK0fjW70UBPR/FGQ1+kj3ZAfffiCwKob6DNvDd4F /ein6X/4IlOWHG2S1XcsHeV7+Jc/G+R1+vdh6eryOr3jJrr6oY/mijAzE6Ssw6cHEPh1uuGz 7G33wC/xtRvptmoPKyrGp02yVjz0mpsbGAhS1OZ8Hn7LZ2jxN6P3qyogmovDhr73jj+i5bBo ap5aNA8E2oPAI8yi+Sa3aB4zCl4gJe7FxAyV7/8qiKY9KgBJgUD9CDCiuSTYOjHD59jzJYnm pJHIovl+sHVanrF1wpap/klFD0CgKQSytlCPBFunS+Kt0yvTJYiGEcjw6W+KiOZXNtLyh28I rBv4YZqacPQLBHxBgJHPI1d/mi75qyg0BfPRmAjJunXihBIRzUxINFc9dIN1nCAiK0QoAAS8 R8DmCGYD2PUWTjQDiTNYrZdLNCJZDJ8eb53evoGueji4gx1/QCje6woEBAKVIyASya6rN9El 37oltmjSrZNYJpNoVAIZPi1wBg/M0KPjE7T0O/+OBo+/qgkP0ql8PtEgEPAGAZN1c2LWEO25 4t/S2I5JGpgJLJoXZB8Nr2MkGhNhDJ1yPg0EjT75T3+T5j+/n86OA+aAXLzRAwgCBLqGACeQ n5xzJR05fRG98f/8N5oJjI9XX3pCk4GVdSaaWXP/Cc0J3gA9dtrZNPXm99L5QcSuM5/9W5p1 /GdwDHdtetEREGgeAUYcx2fNpede92Z6IggLunDv12j4hcP02qtH6PjP9NjDRqIxWygz4R2a uae8kQYDq+bYqWfRcwuuoBfPWEAngvCAzEkcFgg/7P/j/zaPCSQAAkCgFALB5ZjwYnB8Ozi8 JTxAgzM/p1N/+gSdeXA3Db/4DJ0IrJmfvfRk2JPTqZOJaPh3A4OzadWvr6TBwUFjY6XGg8pA AAi0DgHGDSdOnKCv3xsEPw/Ih39yT53yrJnQRgkafc97fpO+/e3yAY1bhygEBgJAwIjAlVde Sffcc09o6UgnTcI7UpKPJs+a4USzevV7aNeuXYAcCAABIBAicNVVV9Hdd29Ptk3JSZMr0YjE w/8fRAPtAgJAQESAEc1dd90dWjMiyUQ8E/l2ci0alWjY32vWrIZFAz0DAkAgQYARzbZtdyVE oxJOAaKJTp1ANNAuIAAEVAQqIxpu2dRDNLa0rAUm1poMrUBbvVBUzeDQC2PCGLxDICKabbFF E51I822TdjNYdQSb/DOdEo0xlWucYJ1n+1NT4HaEppVo1GT0cS9Jyo/obz1DYV5UfFbDki86 GUxGAjktC4KcFI1Xz8fRgBiIpiM1QqViCDCiufNOTjTBHZvB6K6NtIUaHb0oul6nhH2onGhy E3EVG1hmaUeiyc6BbLauVq2fINo0SSyTT5hM7dqFNHX7hig5fPAxp/cwSemaqZITopy2BNH9 K9ITUzNW3amx75Y3rRINJ5gOiSby03TiDO7aArEqiyWRV0giI7RbyMSo6YAxr5NjTqIw2b2c sTEr06OeyTEmtG4QdssVvyPxrbrTUat9UaklRKMu0vTvMMEY31YYEoXJW5zg6b+TaHyFPYFc pkUTJ0I7stOQtJ6rTEYCuWzCEHWtCNEE9RTlL0zYSn0u49ajy4LUwcOxYOZkdeGPWcnytEWp bkmVraSSQzzqWMWCz/sOmlowTmOn6AneOJLq1vYQm6+ndEszLC/JqiaRi+Q8vGJzqmdhpZyt srTN1mUOa4f6I2MSfdebvFMp0fCtVPUWjZloZEWLJm1+kN2QJZoP1TRINj8+r+KUuEG73A+i pqNNVKRrFo2+0CshmoC407HZHPFmS00iVVPa2Dilb7K4ChDN2Cn5/i4NgyRFrl1W9WGw5LoJ WrJvMtoCmywaba5VvDhxpTKnfjT1u7yc8O0mIL+IJnmCxqAmFkoG0RzdIeV4lhTM+UmrTqDZ Gaw9bYQUrxrhGPqOnrA5lkEiRkGLJnzqL6Pp2B9kdAYbLL2kO5NFI5Kz9sQ3KLxGErJMWZac 9H0RolHmXZtBhnWSRlf5NVdWy/bWQDRsDMuOinm1zRbSwoN67m3xoahbb+0mFlV6v4gm07eQ vXXi1gsbmEQ0RsXNeCpJqBRMts4JR1zMpvzc0mLPy/FckGgUUqvEolEXqdU3IRNL+ORfOk1b N26hvXH+85E9hm2BWK4A0ciL1rAgE/xNlk+erFw/WOZ1w0NBw0Gfx1QaXh+5txkmIBpNTwsS TbRJkx24WdaU00OqINEoC7QZoom2qfzJzv5/9AAnlmihdZVoOM7x9ky9WpAtazpBqWUoEE4G 0RjHljQFoul9olkxP3Tk8SPmxOpZlLcX7oRoFGXqGtHosjZFNKFT+N1E2zcepvHNo3RAOJVz 3jqtUJz0muPd9eROZHRDnRxZ5WeBgm/G1ilzmxY2BqLpbaKJJ3iMKnYGB0o6celemowdzqEu qZfeukE08dNa9Q81RjQh3quJDh6hsXkHJN9ZdDI1RiQ46jXMYqsw9Vuk25LUP+ZGNNKdJuNC z5I1+H79Etq7iW35HC1V1anNqkk6AqLpcaJJnybsdCr8MD/JnhFaqz45pcdYxs1gvmc3+V/U /XwdRKPdDDY7lpsjGsulRA23LB8I84+wD/Ov7KaRa8U7Ra5EIx9Fm04HzRco83xnkVTpsbm6 peJys1Li2EA0XhGNtNbxRysRKExyDY6yTbI2CFNlXXvjDK5sRGioIQTcrI2GhFO6bZOsfiBW VgoQTVkEUT9CwHoE7hFQbZLVI9jKiMKIZjp4qfJ/xoGvSr3rVOZmcJlBoG6TCHCfluvb6ZC1 SQSa6psRzeLXTdPGz/0vLfhV4TARIJqmphH9AgG/EQiJZvGZtHvjrfQNZtWsuZG+8M4LaPYr /5e+8OH/RCyVQRLK0xYmAkTj92RDOiDQFAIR0Sym575zE23+i6X0B5Pvp0vPmEOzZ79G+7/4 B/SZIJcBiKap2UG/QKBHEOBEc+JH99DHv7QPFk2PzCuGAQS8QkAjGsUpzISFRePVlEEYINA+ BEA07ZszSAwEWocAtk4+TFmpVxZ8GED1MmhBy6rvAi12EQHZGTxAq2/6L/RrC310Biev/Avo 5AR00gJAKVkM8jA2BjPKquAS2tM2oSAaDaFaiEZ9+dU2L/i9MgTac7ydFcgqA4pV111Ph26T 38I9L5ds5BfqMsN0Sv1VdJUdRFOZQtsawjtONoTq+b01F/bKKoitfvp7EBMuCNRkjeLG5sMl I4LLvIFoXFCqqEwn8YYq6rqPm2FEM/TYNvrKvjSXk/gaAoOm9lMnc2Ak2VoIiWDebjnOSYGJ sxFN2pS7laLLxOsaovSrYRJE68pINEq4CnWbqLZX8Hc5S0DO6wOusZddxrdzihauGKPhJIyC GpYhlcOkE9btMH+H6fZpWhbEv0lyOCjZBdyyURRQLhS1IuDHS5Wml9xMQbPFmCx5AbfVYRfy o7gSjSlEpR7xPhTFFilf+z0jnKcQNNwU3CnN5GlIUyIEdNIWWq7vwoSHLaJgjAMPIm6KqxzA kpdxQP1ND+4e9yEGNuN+PEE3InKSIyq6P3Ss6wcFHBHwg2gKJU1jIzMomTZg0SJwyT7AG3Al GiXIdVhdWWBxk9ZI+aYA46r1ZtleOWWAyNzu5Y9ZW5iKLG7jGyM5F1Z+n3q6Fjnzp5HAjYRp 2Crh7W1HeqiumCdEE0dnSxaXaRGrgy6y145Jx8kKKk80sn9Hj9qWjiQmQMPCTZLiScOWtzha /m9hfMlvqgPcdHoX95HtADekdEnmqvj4kiElsugPAnsqlpTYk+DgRgIx6ElVvrXq1mHPt+QN 0UiOVacTpiJEY9q+ZM1tPUSTGynfxUIQxeVbEYFEzNsBgQR4WSdsdWxSq4WUrAamLaRS38Ua C3N6pYQDoukt7vGHaIT8PwdGDUm5VNyLntQ4l6+aaHRfhKZCpq1TTv5sE6nk+h3EJ70zDoqU nKCClK2rldzjVueqU5/yg6PzrZOa7hhbJx8oyyOiCeBgyjwaPNcCp6+YriP0fUjR6XUfjXTB S4pCb/bpmJyE0YS4Ek22M1g7GrdFylcXYmyxSFkDmB9rPdHkpnv1aHaqs1Udv7SlMPu3ZOey STVZvdU0cpRo/tHtSdrhsGTR8XGcczIOODuDxayVjlsnOIO7Tz1+EU3sFNYv1hn8AIrvQb5J ashkoJQvTzQmSyWHpDTfiJokXnF2qsfFYVaANEeV5J8JfDM7Di6kcW4FaXVVH4iOp8sFRXPm gFhpi46PO855hgq2cRKOoU1WkuqT0mR2JBqrBdb9ddjzPXpGNC3DG6cXLZswJm5B314LR+ij yCCaUrPius0q1QkqV4gAtk0VglmgKRBNAbCMRQtdBizbGeqXQgAvVZaCr0xlEE0Z9FAXCAAB JwRANE4woRAQAAJlEADRlEEPdYEAEHBCAETjBBMKAQEgUAYBEE0Z9FAXCAABJwRANE4woRAQ AAJlEADRlEEPdYEAEHBCAETjBFPNhZxeOqxZBs+aryU4uWdj7Cdx2kM0BbMg8Bf90snMD35l DROZpRVVXNgD0Wjo1kI0uLDXGLe1i2iWTtPWjTyzQR5m6tve8QuQQihMtXbxrAmshYpeQQDR dG0B4BWErkEtddQaoimtIAUXs1N/VUVqKyhbM6rSK73ipcomZtILoulGFgQ9QHg+3C5EgywI QpApNTSFKcsDsiA0sca96JNnqhwaGiL2b+7cuTRnDstUOTv8Nzg4WH+6lcifokRGqzILAtvk sGj4OVHrpNlw8rsgC0IS4EuzyJAFwYvV7ZEQjGjuv//+kFDEfzy3ExO19rxOphgh+cGJXLIg CCibIsDp7kea2DxO54Xfu2RNQBaEqTgQF7IgeLSiPRXFE6KpLwtCFJUtJ0GacWJcsiZkEw2y IHBQzVkeEsiRBcFTWqheLEY0DzzwQNMWTTAw0bHqFKnf5tQraPWo2FodtMWIBlkQDHmZYszT qwXIglD9EvejRX+IptIsCOZEboUgr4xokAXBzRGPLAiF9LNlhT0imgC5CrMg3KKkBFHnRQpO 7pA1QZ9XZEFQt0BicHFGLhOX7qXJ2/YaUgKzmupdpxyiCUpnpsRFFoRWUI5fRFNVFoScbIwU H7vKWRDsWRNMs6k7rJEFIcXJkuUBWRBaQRBVCekZ0VQ1rC61gywIXQK6ym5svr0q+0JbHAEQ TSldqOgVhFIyoHIRBArdpyrSMMrmIgCiKasgTpf7ynaC+pUggJcqK4Gxk0ZANJ2ghjpAAAgU QgBEUwguFAYCQKATBEA0naCGOkAACBRCAERTCC4UBgJAoBMEQDSdoIY6QAAIFEIARFMILhQG AkCgEwRANJ2ghjpAAAgUQgBEUwiumgpbX+CsqV+Pm60lOLnH4+110dpDNEWzIAgzZwzMlDez 8SU82r+VNrCXAh3KHtm5jia/3qG6gGg04GohGlzY61BBy1drF9E4Z0FgwMShIk6JQDrmQhox ntGbwi51KnoFAURTXpMdW8ArCI5AVVysNURTVEHS8ntpyafWkhz1LgfFMOgWC+Z5Hs0/aLFo kAWhYnXsRnN4qbIbKKt9eEE09WZBKGJ18Kh524PYOHZyQhYEZEFoYtG2sU8viKbeLAiuRCOW I7reagUhCwKyILRxyTcjsx9EEwe8IsGhWl0WBDeikftzqYMsCMiC0MyibWOvnhBNfVkQXNLW 6v6fckSDLAh8KSALQhtJoQ6ZvSGa6rMgcLhspGEI4ykhnZXjCVkQouN80xZSUVXLqRqyINSx tP1q0x+iqTQLggiyjWhME+JSx5VokAUBWRD8WvRNSOMR0QTDryoLgoSkmTTk4OQq9C5EgywI CWqmTKDIgtDEeva2T7+IpqosCF0hGpOlgiwIKfTIguDtqm9AMM+IpgEEynSJLAhl0GuoLi7s NQE8iKYU6i5brFIdoHLFCBS9YV5x933bHIim7NQjC0JZBLtXHy9Vdg9rpScQTWPQo2Mg0D8I gGj6Z64xUiDQGAIgmsagR8dAoH8QANH0z1xjpECgMQRANI1Bj46BQP8gAKLpn7nGSIFAYwiA aBqDHh0Dgf5BgBHN4sWLaWhoKPw3d+5cmjNnDs2ePTv8Nzg4SAOjoxfNMEhmZsL/JB/xb/b/ /O81a1bTrl27+gfFsiNFzGANwVqCk5edJ9TvGAFGNHfeuY0GBgaM/1jDfhBNB1kQ0vADbBjH 6NHbN9CWPTpWcjnl96d20LpN92YDXMWFPRBNd4gGF/Y6JoqyFdtFNAWyIKhvZ+e/rW2A0Wnx V/QKglNfZaca9RkCeAWhGT1oDdEUUxDTi3PFSCE/lGg8WciC0IzWluoVL1WWgq/Dyl4QTeVZ EDLeqnYiDwako4WBLAjIgtDhuuu7al4QTdVZELKsH1eryI2QkAUBWRD6ji86HrAfRFNxFoRy RGMK0WnCF1kQkAWh43XXdxU9IZpqsyCUIRpXqyeIr0cTm5fRtHSSZfIDyal5ZQ0zZwngKXl1 bZRPzrRyLz1KWzduIZYtPPlNPTUznd7FHWWnDZbHKm8Zi48vGVciix4AXrIqw+yh08nYUlwU q9K4ZTb4ZKryrfUdXXQ+YG+IptIsCB37aEEMlv4AABcHSURBVIo4jIsRzciedRRlDbCfcLFF tuxoTjre+Eh9WCARM0EKJMDLZi7afCVKZYqS66XjQRaEzpdf/9T0h2iqzIJgdOaWXxCyWrgS DbIguDnXZctDsmiynPPq964WDUKwdp3hPCKaYOwVZkFQb5aqT3zTvRr3bRObJ2RBULdAh4RM o4xcJi7dS5O3BRu5LOJfv4T2boq2etFWdJx4tlLVIR9tBcUtVmytHRUuVDoSTbF57vqa7MkO /SKairMgSD4MwX8R0sR1t9DaRUdox7pJ4vd+3U6bUj3QyyMLQooOsiD0JGN0OCjPiKbDUTRV DSZ4U8iX6BcX9kqA13FVEE3H0KXbJznXdqkGUblmBLBtqhngjOZBNGVxr+KlyrIyoL4bAnip 0g2nGkqBaGoAFU0CASAgIwCigUYAASBQOwIgmtohRgdAAAiAaKADQAAI1I4AiKZ2iNEBEAAC FRLN5fShTb9NFw+9Rg8++CBiBkO3gAAQSBDICk7+2v4v0e//5++E5RxjBoNooFdAAAiYEcgn mgGauPWDfhNN0dcGoAgVIOD7jWiv5SsSIaCCufKkCUY0FwXpVk4PUq08u+tjtOnraTaEaz75 ZVq5wDndSjMWjc9Eo4f6zJp1c0wX6SVFqapePrus3GfhIO0mkb1eyIHAXstnIxrDy6KekEUZ MTKJZvlH6PMf+CU61T2vE4hGngjXSH2slkMIC6nxVXT9dYdoC3szmn3iAFJWsnGMhWxVKK8X ctuJhqHfe+9jZRLNNTfTV951QZEEciAaaYEWCjBVVrFsT8lIMncLy0I1IBorF2cXKDBXC6YM UQVLdN1g1SyiuXLdn9IHl5zWPaLhW6AdNB7EHQnSve3nEeaiRXgeB0kJ92DaOskhLvXEcWoI TNkSULclcn012ZzZihAtlCzFEsmliPVj0hYX5TVYTZwwdhKNrwgQNoUCDbtTMDQQjQ0X9Xc1 bGj+nOTrgIZIIt8BGhV0R+qTW3c7p2jhijEaJh7WQukr+T7uhbd9+zQtu5bViz66HqjtcAzT udo9b22o6+FH0Wu3YGENMkfBrr2xaLiiyROmP+nVgFbGAEmkBkOan2apFAMyMbCUF+3U9pZc N0FL9k2GGS61N3+zXqhUYtKa3hiWvuOhOYXJs26DhLJOfhdTnFwes1eJK6yRt/oyokI0dlyC eVxPNMkzfiq4afWD9idoMg59atcBM9GwR5MaE2eMiD/AOOZanKJ0vlm7WmpejplQT8M/bjvp K2gn1aP0QZbOcTTG+cnDNexZCgRWcF17V9wbH40p17LR1Ff8DHrIxxHaLQSz4v6P7FAO4oTm WQZmqyPLopJi/mq+EYsF4uJzEYOM29L2ckJVA3yb3mY2Bu5W5JWIxh2XVPvl9vIc+i46YCYa 4eESFzCR+xExKqBpearWm/EN8PzogHKzZmev/jBysVK945NMgbw5dbJvgcQxpKa8Fi2fbQEM H91sTs1eVjx5uiQLWInMb7A6km6khZ698BLycYjCXyhuSixzdhaD2HIzEo2QBC6x8CwYiouv EC7CFjjoK5E3acO+zU2nNjuXeuapkyb3QuIpY0SVUbd5kmVkDRdqc+ybCcQ039Yg9e3hGcoj moGBf0Gf3LqyO/donCwDA7BuaTnSitEWTVRSs4maKpuYEkW1lnSBMh2ugnOYWBjRebtpHd9K GMZViGhC0zxoM895aHJOmxaNixNbW7D5uHAs061CxtM6IXn5QZKbEcLFCuFlrEQT+1XE7ZTR olHIWdrmgGhMU5JPNANhFcebwTM0MzMTVlizZnXhVxBMRGNdPEFfTtHyk5GbSCVvL6w6bNPg 2eaHSZ6ScUtne+AXEtOVmFsqej/IipUr0bgcgWtbpzxcTKSSty2Qf7OOqwDRWHUli3hXCMRi tWhsWS9cLZre2zrdeec2GhhIL+qJ/98o0USed8GBFyqV7FiUF2S8/6U0kVpYY/0E0SYWhFyf PNkJHfxui8wvReIXnXwZ2xNhIbBFs3reEaJ507Q9TvQW/iw5P+N2gi1glrNQdFCH9Y04KSsw 0xmsPp1tGHL50noRhvJWU5RRd7CzwPDDydYpnR8mszJHDjqgcY3BYavdNTIRqvYdPzkSxuZA NKb5UJ3Bqs9QJ9Tecwb7SzTCIuLHiOpRa1YmgrFTUvXL888c2rmDaAV/IqtH2/qxpXoMS4J/ xrqnziIE0bEbiq36H5RTCa286XhVXX4GxTUumnSxZ2JoqJeHCz9B4Z6fY/t30NSCceKLTa2r +Zo0P1COfyYm7s3MCtkZH9vHUKjpYG65VvfRyP6ZgGDCNopZNCL5J3qbbMccLRoXy1JjWH+/ qPDt7XJbJ38hcpTMwcHr2FJtxYpux2oTBA1bEehoy2httbkCIJqKsG/FIu6xp2RFU+dhM721 bWIAg2g8VLM6RXK63FenAGjbgoD5nk3bYQPRtH0GIT8QaAECIJoWTBJEBAJtRwBE0/YZhPxA oAUIgGhaMEkQEQi0HQEQTdtnEPIDgRYgAKJpwSRBRCDQdgRANG2fQcgPBFqAQOuJphUX5Vqg CIVEzHx1oVAr9RX2Wr7eelnSdRJBNK5IdVDOPUav/o4V6y4vyl7+u0XZwlZyYc/rhRyM3Wv5 bESDC3tJGAiuxjwsBPub/X+ZMBEdrOOwir8WTZE4wLYYJgo6ajjSOB6KHArSgGhVryB4vZDb TjShVvdUGE82Ilg0nTKcrZ5LAKmkjfKK5fISnruFZRkciMY2+zm/2yyaqKrLfJYQoutVvSEa bpkgC0JnOmBXTIPVxAljJ7IgIAtCZ3rnWssvoglSTyALQjR1RbIg2AOwBw0iC4IcRA1ZEFw5 opJyfhHNPDkynksE/L7NgiBMvynCnaYdmaE8lWwByIIgQ2eMGaxmWEAWBBsb+UU0Yj4m5hIL Q0SahoAsCBEqhmDaWTPuGjPYEL2PN5lEvtOCfMsZJRIRtOwQyIKQZX32QxYEb0N5WkNjcjLi BOXggO2ZLAgucYJF0ilCNGpaFpW8kAWBdki5w/TcYCN71sUJ8FTwHEN5xrpdOAOEzbRo6Hev LRq7g7NPsyBkZcjMUyJXonE5AkcWhByisV25cCUat9OphnijcLdeEw2yIPCnohycvKNjamRB oM1iZglkQShMFmUqVEo0l1++lC677DJ65plnKsnrFA7MEgG/H7MgZPuu8k6rkAUBWRDKUEW5 uoxoFi9eTENDQ+G/uXPn0pw5c2j27Nnhv8HBQbcEcpdffjldffVyevzxx8MGdu3aVU6yttVG FoS2zZjX8rq4DbwegCIcI5r7778/JBTxH08ix4o7Zar88If/NR08eJAefPBBetvb3tZ3ROPv qxDCjLv4X9qkvT0ra/mb4r5BUxnR3HTTRrr11lvD9536kWh8m9gseSp5qbItg22lnL37UuUD DzxQ3qL5sz/7PG3YsCGc2vHx8b6zaFqp0xAaCHQJAWbRgGi6BDa6AQL9igCIpl9nHuMGAl1E AETTRbDRFRDoVwRANP068xg3EOgiAiCaLoKNroBAvyIAounXmce4gUAXEWg90bTiolwXJ7Qr XSGUZwmYe+tlSVcgQDSuSHVQzv3lx+JZEFJxigRBj2PRLjqivH1ccHAgmoKAicVtRIMLe7lZ EJq6sOevRVOEAApmQWB6K71smgYCy10BVb2CAKKpkWhY0735CkKrL+x5SzQOQbhkq2ScaGdW sCRVr4Wn4r4lQSzchTR1+wbasidf/90tLMs6AtHUTDS9mQXBC6JBFoRlNO1AFpqGO1spyIKQ hCJNLMKAoHdO0cIVLBTpoXg7GYdHTYDm38dfcJK9fZqWXZuGMNWDyavtcKszfUjsnrc2DVX7 khwvO7JY3R4gJViva1W98dHwGCvIghDNvXMWBFeFRBYEZEHoGq3oHflFNMiCEM1QHCDciWxc iQZZEOSkbK7hUJEFoRJ68otokAUhmVTnwEeliYbkEyhkQYgIadGwsMCE7ZPRP4Xg5DY28p5o bFHgJWewgwO2Z7Ig8Jmtg2iQBYG2btxCexPrUiBjEI2NU4y/e000Lk91PYFcngPNdGyYd5Qo /uZy5Jh3TM2Pu7cHW6O1lJ2OI5on59O0qonGpT1kQUAWhIJ04zXRIAuCOQuCNMcuxJCcsozQ bjEfkfHpHF8YI/kUZNX6CaJNk3Sv4SlvypS55LoJWrJvMjxyV0mTb034KZDUNikX2oz5qwLS Xk80uSmURv/w7Z94kqP6vZAFoSBVlCvuN9EkC0TMhChfTuvHLAgdEY3pEljmfRj9prJ0NGyo p2VmyMlSeWz/DppaME4LD26lDbft1TKSSn056IC2BLh8O4Noj0GKFf5BFoRyZFGmtjdEU2YQ XtRFFgQvpqFXhHBxG7RprCCaimbL2adSUX8dNeO6zeqocVSqDgEXf2B1vXWjJRBNN1D2qA9k QfBoMoyi4KVKL1+q9F1tIB8QAAJEsGigBUAACNSOAIimdojRARAAAiAa6AAQAAK1IwCiqR1i dAAEgACIBjoABIBA7QiAaGqHGB0AASDQeqJpxUW5XtMzhPIsMaO24OQlmva4Koimxslxj9Hb SRYEpY4aCjJjXJVc2APRlNAaG9Hgwp6XF/b8tWjqzIKgKqOjclb1CgKIpkaiYU3jFQQJ4JmZ meTvm27aSLfeemv49/j4OO3atavEZLhX9ZZoHIJwpaMsqFimhe5AIu4WlgV/EI27gmolbRZN VAEvVQrAiUSzbt2HaWpqih566KGOiAZZENyzIJjJ1UZWyIKALAgl+LFk1cp8NFdccTktX76c Hn/8cZozZ05hiwZZEMSYO3lZELKeiJYnJbIgIAtCSbIoU70yomHWzdKlS+myy5bQs88+2xnR IAtCNJe5WRA6JBpkQUAWhDJMUbIuI5rFixfT0NBQ+G/u3LmhQTJ79uzw3+DgIA2Mjl4UOmPE rZL6N/uN/75mzerOiAZZEJLpzN6jV000yIKgZvhEFoSSrGKozojmzju30cDAgPEfq9Io0SAL QhyNX5i8jnw0mRaNgWiQBQFZECrmGq+JxsXz3o9ZEIy42EKJuhKNw+lVtLXjBOXmhObxgeMz Fbr+U2uTmMGyTssWm4sOaGsi41TMqiumesYEcgo5K8fR+SehZotUH6fb6VTFfFBbc14TDbIg ZGVBiBb3/P1RcO/gMDRn4ca6k+kMVhcNsiBMJTnQef7sIgnkArwNmRvSrBCuRGMj8No4oZaG /SYaNuR40tK8gciCEGmCnEReyxygqYtBcZEFgVJSiW0tKUtlQDA7WSaFIgnkRGIXThKTm9uO RONiWdZCCfU06g3R1DO8LrZq27p0UZSsrry93OgBNr6J0NGW0bdBCPKAaCqanFYs4h57SlY0 dR4201vbJgYwiMZDNatTpEpeqqxTwL5v2/G9tZbhBKJp2YRBXCDQRgRANG2cNcgMBFqGQG1E s/X597UMCogLBIBAXQisPf2r9dwMBtHUNWVoFwi0DwEQTfvmDBIDgdYhAKJp3ZRBYCDQPgRA NO2bM0gMBFqHAIim8Sk7k+76zDm0+MdP05s++1xXpPn0zZfQNfRT+oWbD3WlP7866T7ehcb/ Ly+if7yK6I4P/JBuKFTR78Igmprn5wMfGaOPX/zzHMXpvuKDaLpL7IVUzEY04e+z6eG7H6Xf +lahlhst3NdEE5LAG47RJz46RZ+vYxrevpD+fs0wfS9XKUA0lUFvW6RhR93Hu9D4HMZQu94W EtitMIimRqIJFeL0acsWpfuK37MWjcMi7QWiITqP/vrzZxDt+j699StuC73pUt4QDVf+O+gM uubsCJbpx5jfgkIfxtUni98JvozQajiVRjiSL78oWyjG31+gtwlthlXFemqdnwj+DG6l7DpG i69i/b6asS2KCOSsfYoyKG1PP/ZT+t4bztB8NCEeMQ5ExxNT2UwSOlll1WdDNbYRLtKhVB9V HJNF/DJdFij5aFwymiM+H1yOaEx8zg6ECyJaHLxe9J2o/vLv0nwIMn/i+ZFgKzorrphiL4+X /Zw/L8wntpleb2wrbJyPN8gadA3DheORq2/pPNx3+jnp/KlYJu0b8Ob93v0arRT0WsWrbQ8L v4gmWFgJoILiy9/J+9NP33wR0c3ccRZN9NXTnBhU5g9+v/k0+t83R1slowmqbXeUNrmimZRH XDdhuTl0n+jUi+seFhYZXyDigtWUSNyXn2twFipP8tz6wb5e/T3yI5G074/kEhZrMh/Cd/F4 KCGbGKuTU2KM2mbEoH4n+q30J3TY/0j60NBxUudaIIdcRyqXkT/IIpLMHK/4kAnL5elb2nZK DNHYzhIIWcdb0EuOs6BfJj9f27ZPfhGNdBJiUCSH/bU0ARYfiWmymMKtfF45ARIXsoEsjGYp q3Ppa5J15WSNmAhKGvdJmtkstWut/5xCNFlmuPJ9hhNSxtC0DdQXmmr6G7eYytypxCNZHZxY Cmyd0ocRnz238apzbRq/2rZcxrLtMeJsqOM0VqNmNvJlpUQT5EkIMiEQsSwIRV9B0BehSWnN /gzNbBaeBslvypOJoa0TTfpE0mcjfpI7OXjjp6tENBlbKZU81S2MIAi3euSFyZRwhA5zh7ND fTsxRZ1KpJul2NL32UQj+xPkhaNve/igLVtGVSanxZflE1O+z2krW9/MbesPP8XSFZXN2K+B aIwPlEY4xKnTdhMN38YIJGI2KQUCyS2bRQYClt0gGsUS0mZSVDLVcjJYUmp9H4lGsyIVobP9 SsKdk7qJxqpvIJos1qmYaKLcT92yaEykkrt3VRQxa+uUe5mtJNFkmezJHt6p/ZQQv3uBstVz qC8v2iJbJ/0imdxWZxaNi7+hdqJRcTOQll3fHIjGdmLkatE4kaqTsdGVQq0mmuRkgO/RVUdt 8PdfL36B3spPRVxM7XjrIXn5xXYcFnI4cybT1tB2ppOT5NMz2QkZb80uCJzngQP9u5LzM7be cuo7O4MFZ2yEtXD6wsaojaczoomwOpVSpzJrPCDAm4neGt9ediIap7kxOWwNmJkWsvqddjDg QjTxll1yvqvOYJXQ9YeBCzl3hUEcO1GJZnBwIKiZJpNjzSQJ5NgfWdkq+ffdtGi4HyE5Bg58 M3f8eJiu4Xdj1KNI7chT2FKJp0ian0M9abFdwosXiumug9L2gV1P0+FL1Zuquq9IPkZO2x81 +J74XRF+vMxK555qBb+np0Ox5qjt8kXGj3vjYvKxa4dEkxCzcE1BOKVK5ll9bcJABqn/xHa8 LR/Bq8fp2kMsHq/kn1H1LeOwItMSSo7phflxtGjafLzNSGZgYDBElGeudCYa7gjulGgcibFV xdqmDLngtsxUb5WiFBa23Rf2wjzbYWrckF7C/y9ANJG1A6IRtMbJlC+sZc1UANE0g7uh17Zt m9gQ+NaJWzMgmorVyf5SZcUd1tUciKYuZIu1a7xnU6yJJkozotm2bVu4ZeLbpcSScbFomNCi f4b9vXr1ewrfo2li8OizAAIgmgJgoaiKQBbR8O2TdesUNRhd1OP//Y3feDd97cXfBtpAAAgA gRCB9576Zdq+/R7JmhEdwRrRiBaMiKFo1bzznf+MhoeHadYs/vIb0AYCQKBfETh+/DgdO3aM 7rvvL6slGk5G3DmcHolzy6dfIce4gUDvIyBuh0JLRThdSv0z4S/JiZPRonGxaiJyiYgFRNP7 yoURAgGOgEo0EdnIx9maI5j9Pjp6UeiBMW2VlG8lYlGdxFHZqKnIp4MPEAACvYBAfHAUWinq R3L4hoQT35tJK0VEZCIam1Uj/q7eJu4FYDEGIAAE3BDgxMJKZ5FM+FsW0RQhGzeRUAoIAIFe RMBENnycxpvBJhCyLBa7JYP9Uy8qFcbUbwjo2yUTAiLZpCQT2jLhn/8fbOZ0e5NBKGYAAAAA SUVORK5CYII=

--------------UEln60op0cSX066h04nQooA4--

michaelsimp · 2024-10-24T21:15:24Z

Hi
I did a quick set of tests today.

I created the ip_internal_network project from examples and configured as follows:
Set IDF version to 5.3.0
Set target device to ESP32-S3 with jtag integrated debugger
Set partition table Factory partition to 0x400000
Set device flash size to 16MB - matches my ESP32-S3
Set the Router SSID to "VONETS" and Password to "pass9999"
Set Panic Handler to "Print registers and halt"

Clean and Build project.
Load into 2 ESP32-S3 with serial terminals connected for monitoring.
One becomes MESH_ROOT and the other connects as MESH_NODE
Took turns at powering off the MESH_ROOT and watching the other become MESH_ROOT and then powering it back on and it connects as a MESH_NODE. This seemed to work ok today.

But what I did find easy to reproduce was:
Power off MESH_ROOT and power back on BEFORE other MESH_NODE becomes MESH_ROOT.
The original MESH_ROOT I power cycled, becomes MESH_ROOT again, but the MESH_NODE remains disconnected.

See attached files:
MESH_ROOT powered off at line 171
MESH_NODE loses connection around line 33 and never recovers

Also see .elf and .bin files in attachment. I don't know what or where the "Core dump decode file is", but these tests don't show a CPU crash. I cant run under the debugger due to the power cycle tests.

Please see attachment MESH-Testing.zip two comments down

michaelsimp · 2024-10-24T21:56:34Z

I did some more tests which can easily cause CPU crashes.

Power on both nodes, one becomes MESH_ROOT and one MESH_NODE.
Power off WIFI router.
MESH_ROOT crashes. See file attached MESH_ROOT "Router power off.txt" (crash at end)

Second test. Power up only one node, becomes MESH_ROOT
Power off Router
This time MESH_ROOT does not crash
Power on Router
MESH_ROOT does not crash
Power on a second node which connects to the first MESH_ROOT - see line 492
MESH_ROOT crashes.
See file attached "MESH_ROOT crash on MESH_NODE connect.txt"
MESH_ Testing 2.zip

michaelsimp · 2024-10-24T22:02:15Z

MESH-Testing.zip
This is the attachment for the first tests, 2 entries up. It did not upload properly last before

brianignacio5 · 2024-10-29T08:43:53Z

Hi @michaelsimp

The esp-idf vscode extension allows you to save settings in multiple places: User (Global settings for vscode), Workspace and Workspace folder (your project's .vscode/settings.json). The ESP-IDF: Show Examples command shows you the current esp-idf path used in the current vscode window. You can change where to save settings with the ESP-IDF: Select where to save configuration settings command. It sounds confusing but it does allow to use multiple projects each with different esp-idf versions (even at the same time! Using vscode workspace) More information in here

It seems the example you are trying to use have some components with specific behavior in each esp-idf version. So building a v5.2.2 example using esp-idf v5.3 might produce some compilation problems. How about creating an example using esp-idf v5.3 ?

Open a vscode window.
Select esp-idf v5.3 from status bar (recommended) or the ESP-IDF: Configure ESP-IDF extension.
Run the ESP-IDF: Doctor command. Check that esp-idf is indeed using v5.3
Run the ESP-IDF: Show examples. The esp-idf path shown should be v5.3 now
Create your project from esp-idf example and try to build.

We will try to update the Show examples command to show all available esp-idf versions from esp-idf vscode extension to make this easier.

zhangyanjiaoesp · 2024-10-29T08:58:18Z

@michaelsimp The backtrace of the crash issue is here:

xtensa-esp32s3-elf-addr2line -piaf 0x4208c922:0x3fca7e60 0x4201c951:0x3fca7ee0 0x4201f96f:0x3fca7f20 0x4202345e:0x3fca7f50 0x420167bd:0x3fca7f70 -e ip_internal_network.elf

0x4208c922: parse_msg at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/apps/dhcpserver/dhcpserver.c:993
 (inlined by) handle_dhcp at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/apps/dhcpserver/dhcpserver.c:1190
 (inlined by) handle_dhcp at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/apps/dhcpserver/dhcpserver.c:1106
0x4201c951: udp_input at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/core/udp.c:404
0x4201f96f: ip4_input at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/core/ipv4/ip4.c:746
0x4202345e: ethernet_input at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/netif/ethernet.c:186
0x420167bd: tcpip_thread_handle_msg at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/api/tcpip.c:174
 (inlined by) tcpip_thread at C:\Users\micha\Documents\Cybertek\Software\ip_internal_network\build/C:/Users/micha/esp/v5.3/esp-idf/components/lwip/lwip/src/api/tcpip.c:148

I think this issue is caused by the mismatch between your IDF version and the version in the example. Please update your version according to Brain's suggestion and test it again.

michaelsimp · 2024-10-30T00:35:35Z

I installed IDF version 5.3.1
I had a problem at step 2 in your instructions to entries up states: "Select esp-idf v5.3 from status bar (recommended) or the ESP-IDF: Configure ESP-IDF extension."
When I try is reports, "Open a folder first."
So I open a folder of a the project I made in version IDF 3.0. Then on the status bar I could select Version 5.3.1
When I run Run the ESP-IDF: Doctor command, I get the following errors:

Extension configuration report has been copied to the clipboard with errors.
Cannot open file ../report.txt. Detail: FIles above 50MB cannot be synchnrozied with extensions.
I checked my report,txt and found it was over 181MB
I tried continuing:
When I select Show examples it only shows 5.3.1 which is good.
I created ip_internal_network, but when completed the status bar reports ESP-IDF v5.2.2 again.
So I tried deleting the large report.txt and trying again.
Same problem, it created a report.txt of 181MB again

brianignacio5 · 2024-10-30T01:10:40Z

Delete this file:

%USERPROFILE%\.vscode\extensions\espressif.esp-idf-extension-VERSION\esp_idf_vsc_ext.log

and try to run ESP-IDF: Doctor command again. Seems that your extension log have been logging a lot and vscode limit.

About the ESP-IDF v5.2.2 again, it is because the newly created project does not set settings when created. You can select the v5.3.0 from status bar again.

Again sorry for this issue, will work to make it easier to use in the next release of esp-idf extension.

michaelsimp · 2024-10-30T01:48:55Z

I need to make some real progress on this so I have completely uninstalled esp-idf and manually deleted all the ESP and espressif folders including all 3 IDF versions.
I have reinstalled ESP-IDF and only IDF version 5.3.1 to remove all doubt.
I will rebuild and test and report
Thanks

michaelsimp · 2024-10-30T02:17:28Z

Hi again
Its hard to tell, but seems as if it might be a little more robust, especially with the router power off and on test.
Attachments.zip
But it still crashes, see files attached including my .elf

"Fail 1.txt" is taken from the Mesh_Root
Line 1458 Mesh_Node disconnected
Line 1495 Mesh_Node reconnects
Line 1496 crash

"Fails 2.txt" is taken from the Mesh_Node
Line 1789 Mesh_Root disconnected
Line 1822 Reconnect
Line 1832 Crash divide by zero

zhangyanjiaoesp · 2024-10-30T03:55:45Z

It's weird, I have tested it multiple times as you said (the following two cases) and it can connect normally without any crashing issues.

Power on both nodes, one becomes MESH_ROOT and one MESH_NODE. Power off WIFI router. MESH_ROOT crashes. See file attached MESH_ROOT "Router power off.txt" (crash at end)

Second test. Power up only one node, becomes MESH_ROOT Power off Router This time MESH_ROOT does not crash Power on Router MESH_ROOT does not crash Power on a second node which connects to the first MESH_ROOT - see line 492 MESH_ROOT crashes.

I'm using the Github IDF, and I will try to test with the vscode extension

michaelsimp · 2024-10-30T05:06:50Z

FYI I use vscode for coding and building and sometimes JTAG debugging Most of the time in testing I am am monitoring serial com port using Putty terminals on com ports Thanks On 30/10/2024 4:56 pm, ZYJ wrote: It's weird, I have tested it multiple times as you said (the following two cases) and it can connect normally without any crashing issues. Power on both nodes, one becomes MESH_ROOT and one MESH_NODE. Power off WIFI router. MESH_ROOT crashes. See file attached MESH_ROOT "Router power off.txt" (crash at end) Second test. Power up only one node, becomes MESH_ROOT Power off Router This time MESH_ROOT does not crash Power on Router MESH_ROOT does not crash Power on a second node which connects to the first MESH_ROOT - see line 492 MESH_ROOT crashes. I'm using the Github IDF, and I will try to test with the vscode extension — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***> [ { ***@***.***": "http://schema.org", ***@***.***": "EmailMessage", "potentialAction": { ***@***.***": "ViewAction", "target": "#14720 (comment)", "url": "#14720 (comment)", "name": "View Issue" }, "description": "View this Issue on GitHub", "publisher": { ***@***.***": "Organization", "name": "GitHub", "url": "https://github.com" } } ]

zhangyanjiaoesp · 2024-10-30T08:28:38Z

@michaelsimp

Are you using the completely unmodified example code during your testing?

michaelsimp · 2024-10-30T19:38:58Z

Yes. I did not change anything except:

set target device to ESP32-S3 - Internal JTAG debug - What device are you testing with? Could this be a factor?
change partition table, factory partition size to 0x400000
Using vscode GUI menuconfig:
- change device flash size to 16MB
- set WiFi router SSID to VONETS and password to "pass9999"

See my source files attached where you can see most are untouched with the original install date 30/10/24 02:13pm NZ time.
Source.zip

michaelsimp · 2024-10-30T21:17:06Z

FYI I use vscode for coding and building and sometimes JTAG debugging
Most of the time in testing I am am monitoring serial com port using Putty terminals on com ports

In addition to the crashing, sometimes a MESH_NODE will not reconnect to the MESH network. As a work around for this, I want to stop and restart the wifi mesh network? No matter what I try, my nodes intermittently crash on restart? Is this perhaps related?

I am hoping you could please answer a few questions to help me.

What are the recommended steps to stop and then restart the WiFi Mesh network? I currently have :

    ESP_ERROR_CHECK(esp_mesh_stop());
    ESP_ERROR_CHECK(esp_mesh_deinit());

but this causes a lot of error logging which stops if I add ...
ESP_ERROR_CHECK(mesh_netifs_destroy());

My restart is as follow:

void wifiMeshStart() {
    ESP_LOGW(TAG, "Wifi Mesh switch on");
    /*  mesh initialization */
    ESP_ERROR_CHECK(esp_mesh_init());
    ESP_ERROR_CHECK(esp_mesh_set_max_layer(CONFIG_MESH_MAX_LAYER));
    ESP_ERROR_CHECK(esp_mesh_set_vote_percentage(1));
    ESP_ERROR_CHECK(esp_mesh_set_ap_assoc_expire(10));
    /* set blocking time of esp_mesh_send() to 30s, to prevent the esp_mesh_send() from permanently for some reason */
    ESP_ERROR_CHECK(esp_mesh_send_block_time(30000));
    mesh_cfg_t cfg = MESH_INIT_CONFIG_DEFAULT();
#if !MESH_IE_ENCRYPTED
    cfg.crypto_funcs = NULL;
#endif
    /* mesh ID */
    memcpy((uint8_t *) &cfg.mesh_id, MESH_ID, MAC_SIZE);
    /* router */
    cfg.channel = CONFIG_MESH_CHANNEL;

    cfg.router.ssid_len = strlen(meshProvisionData.ssid);
    memcpy((uint8_t *) &cfg.router.ssid, meshProvisionData.ssid, cfg.router.ssid_len);
    memcpy((uint8_t *) &cfg.router.password, meshProvisionData.password, strlen(meshProvisionData.password));
    
    ESP_ERROR_CHECK(esp_mesh_set_ap_authmode((wifi_auth_mode_t) CONFIG_MESH_AP_AUTHMODE));
    cfg.mesh_ap.max_connection = CONFIG_MESH_AP_CONNECTIONS;
    cfg.mesh_ap.nonmesh_max_connection = CONFIG_MESH_NON_MESH_AP_CONNECTIONS;
    memcpy((uint8_t *) &cfg.mesh_ap.password, CONFIG_MESH_AP_PASSWD, strlen(CONFIG_MESH_AP_PASSWD));
    ESP_ERROR_CHECK(esp_mesh_set_config(&cfg));
    /* mesh start */
    ESP_ERROR_CHECK(esp_mesh_start());
    ESP_LOGI(TAG, "WiFi Mesh started successfully");
}

I notice with my custom application:
I have 1 MESH_ROOT and 5 MESH_NODEs spread around the office. Mesh_Node seem to connect to parents not based on the RSSI as documented.

The IDF documentation states "To prevent nodes from forming a weak upstream connection, ESP-WIFI-MESH implements an RSSI threshold mechanism for beacon frames." Is this configurable and if so where? I cant find it in the API or in MenuConfig. What is the default RSSI threshhold value?

The IDF documentation states in Preferred Parent Node "The preferred parent node is determined based on the following criteria: Which layer the parent node candidate is situated on. The number of downstream connections (child nodes) the parent node candidate currently has".
Does this mean RSSI is not part of the parent selection process?

Is it recommend to use self-organized networking or for serious applications should I manually build the mesh network? I will only have a max of 10 mesh nodes altogether but they do a reasonable amount of MQTT5 communications to the cloud.

zhangyanjiaoesp · 2024-10-31T02:43:12Z

@michaelsimp The following are the answers for your questions:

What are the recommended steps to stop and then restart the WiFi Mesh network? I currently have :
```
    ESP_ERROR_CHECK(esp_mesh_stop());
    ESP_ERROR_CHECK(esp_mesh_deinit());
```
Call esp_mesh_stop() is enough.
but this causes a lot of error logging which stops if I add ...
ESP_ERROR_CHECK(mesh_netifs_destroy());
Where did you add the mesh_netifs_destory() function? What does the error log look like? Can you provide an example？
Where did you call the wifiMeshStart() function?
I have 1 MESH_ROOT and 5 MESH_NODEs spread around the office. Mesh_Node seem to connect to parents not based on
the RSSI as documented.

RSSI is not the only criterion for selecting the parent node, the layer and connections also need to be considered.
The IDF documentation states "To prevent nodes from forming a weak upstream connection, ESP-WIFI-MESH implements an RSSI threshold mechanism for beacon frames." Is this configurable and if so where? I cant find it in the API or in MenuConfig. What is the default RSSI threshhold value?

You can call this API:

esp-idf/components/esp_wifi/include/esp_mesh_internal.h

Line 216 in 9106c43

esp_err_t esp_mesh_set_rssi_threshold(const mesh_rssi_threshold_t *threshold);
The IDF documentation states in Preferred Parent Node "The preferred parent node is determined based on the following criteria: Which layer the parent node candidate is situated on. The number of downstream connections (child nodes) the parent node candidate currently has". Does this mean RSSI is not part of the parent selection process?

Same to the fourth point, selecting parent need to consider RSSI, layer and connections, the doc need to be updated.
Is it recommend to use self-organized networking or for serious applications should I manually build the mesh network? I will only have a max of 10 mesh nodes altogether but they do a reasonable amount of MQTT5 communications to the cloud.

You can use self-organized network.

zhangyanjiaoesp · 2024-10-31T03:21:39Z

@michaelsimp I can reproduce the crash using the vscode, I will check the difference between VSCode and standard IDF

michaelsimp · 2024-10-31T04:36:49Z

Hi Zhangyanjiaoesp
This excellent news for me. Hopefully it is just something simple you will find soon and be able to offer me a fix.
Thank you

michaelsimp · 2024-10-31T05:26:15Z

Hi Zhangyanjiaoesp

Thanks so much for taking the time to answer all my questions.

Please note THESE tests are with MY application (NOT with example program ip_internal_network) running on a network of 6 nodes - all ESP32-S3. My application has a CLI console integrated so I can trigger actions and see the responses on the COM port.

Q1 I will go back to just calling esp_mesh_stop() and see what happens.
To restart should I just be able to call ESP_ERROR_CHECK(esp_mesh_start());

Q2 Triggered from the CLI Console I was calling:

    ESP_ERROR_CHECK(esp_mesh_stop());
    ESP_ERROR_CHECK(esp_mesh_deinit());
    ESP_ERROR_CHECK(mesh_netifs_destroy());

Q3 wifiMeshStart() is also called from my CLI console

The CLI Console is started from my Mainline as is my WiFi Mesh application (built on top of the ip_internal_network source).
Triggering the Mesh Stop and Start would be called from the CLI Console thread. I am assuming this is ok and does not need any mutex protection. Please advise how I should call it if this is a problem.

Q4 I understand this

Q5 Thanks

Q6 Thanks for clarifying this but I am not finding this to be the case. I distributed some MESH_Nodes across the office with the aim of creating a multi-hop network between the far extremes. But it does not form as expected or at all well for healthy RSSI. I have nodes which are close to my Mesh_Root or 2nd layer Mesh_Nodes which are not at parent capacity numbers. When my other Mesh_Nodes do connect to these parents they provide an RSSI on the child to the parent of -35dBm. But they most frequently want connect to nodes a much longer distance away getting a RSSI of < -70dBm.

I read somewhere the default RSSI threshold is -120dBm, but I am finding nodes with RSSIs < -70dBm often lose their MQTT connection to the broker. I have an office environment and I have located the nodes approx 10 to 20 meters apart with a max of one wall between but they are not all line of site. I very much doubt I could even get a connection at RSSI less than -100dBm. I am thinking it may be a signal to noise ratio issue so I have scanned the office for WiFi channel usage and selected channel 1 on my WIFI Router as nothing else is using this channel and no other channels overlap. I know this is not an easy question to answer with precision and I appreciate the many influences, but realistic what is the ballpark min RSSI range at which I can expect a node to work reliably at what sort of distance range.

Q7 Because of my Q6 response above, I have started evaluating the example project "manual_networking" to make my MESH_NODES manually scan and select MESH_NODES with the healthiest RSSI. It sounds like you are saying the IDF framework should already be doing this?
So I am now wondering if the vscode crashing issue is also causing this to not work properly and your fix might fix both.
Should I put manual scanning and parent selection changes to one side and wait for the outcome of the vscode crashing?
I guess I would prefer to use the self-organized network as much as possible, if it works as you describe.
Please advise / confirm manual scanning and parent selection should not be required and I might need you to look at the node parent selection for healthy RSSI next.

Best regards

zhangyanjiaoesp · 2024-10-31T06:41:32Z

@michaelsimp

According to the backtrace of the crash issue, it seems related to DHCP, it won't affect the mesh networking.
I'm sorry I didn't quite understand your question regarding the selection of the parent node. Can you draw a picture to explain it? For example, where are nodes A, B, and C located? What level? How many child nodes are connected below? What is the RSSI of A, B, and C scanning each other? Do you expect A to connect to B but actually connect A to C?

michaelsimp · 2024-10-31T07:34:50Z

See attached:
MeshMaps.zip

"Target Network .png" shows the walls as black lines and nodes as circles. The blue lines are approx what I was expecting to see.

It forms very randomly but with bad choices like the file "Actual example Network.png" with links and dBm in red.

My project is configured for up to 50 nodes and 3 children per node as I wanted to force some layers.

michaelsimp · 2024-10-31T08:50:14Z

"According to the backtrace of the crash issue, it seems related to DHCP, it won't affect the mesh networking."

That may be the case with the crash issue you found, but the root cause of the vscode IDF environment may cause more than one issue. I guess you will know better when you get to the bottom of the vscode IDF environment issue

Do the diagrams I sent help you understand my issue better now?

zhangyanjiaoesp · 2024-10-31T10:04:38Z

Yes, I now understand your question. Once the ROOT node is formed, the chances for other idle nodes to connect to the root node are the same; as long as they can scan the root node within the RSSI threshold range, they can connect and become second-layer nodes. Therefore, it is reasonable for C and D to connect to A and become second-layer nodes. What is the RSSI that B, E, and F receive from A? Since each node can connect to 3 child nodes, if they are within the RSSI threshold range and root A is not yet fully connected, at least one of B, E, or F should be able to connect to A.

I think you can call esp_mesh_set_rssi_threshold() to limit the RSSI threshold for optional parents, which would allow nearby nodes to connect as much as possible. However, nodes D and E are too far from node A. If you set the same RSSI threshold for all nodes, the connection results might still not meet your expectations, unless you configure different RSSI thresholds for each node. Alternatively, you could only call the esp_mesh_set_parent() function to specify the parent of each node.

michaelsimp · 2024-11-13T23:28:03Z

Hi Zhangyanjiaoesp

Note 1: This makes sense, thanks implemented.

Note 2:
My thinking was if I previously called ESP_ERROR_CHECK(esp_mesh_set_self_organized(false, false)); to scan for a closer parent, I don't have a self organized network, so I need to scan for a parent when the node gets disconnected. Is this not correct?

The second block you pointed out was taken from example project manual_networking inside function mesh_scan_done_handler(). In the else of parent_found (so !parent_found) it calls esp_wifi_scan_stop() and esp_wifi_scan_start() without esp_mesh_set_self_organized(false, false);
I have added esp_mesh_set_self_organized(false, false); before the esp_wifi_scan_stop() as per other instances of this.

Note 3: My thinking was to return the network to self organized again after the scan for better Parent. I will remove it soon (don't want to make too many changes at once) if it is not causing a problem for now. Please confirm.

I am not sure what the overall status of this is from your side, please advise:

Could you reproduce the broken mesh problem?
Did you find any problems in the library - I saw only a minor change to stack size in esp_task.h
Were there problems in my original main_mesh.c from internal_networking other than your test code
Is there a specific problem with my code in addition to not call esp_mesh_set_self_organized(false, false); before a mesh scan?

I did some tests. It is much better at establishing a MESH_ROOT when the MESH_ROOT reboots or is powered off. But I still have a few problems:
14Nov.zip

Test1 root.txt: My Mesh_ROOT crashed line 395 Guru Meditation Error

Test 2 MESH_NODE.txt: See notes at top of file. After Wifi Scan for a new parent, Became a MESH_IDLE with a child node

Why does a node become a MESH_IDLE when there are MESH_ROOT and MESH_NODES very close (around -35dBM) which are well below capacity? How do I fix them? Another wifi scan request does not fix it as you can see in the log.

Why does a MESH_NODE connect to a MESH_IDLE when there are MESH_ROOT and MESH_NODEs available with strong RSSIs? Surely this should not be allowed. This MESH_NODE was otherwise healthy and so when I rebooted its parent which was MESH_IDLE, it connected to another MESH_NODE successfully.

I started looking though your many changes to internal_networking mesh_main file and found some changes which I think are important. eg in event <MESH_EVENT_PARENT_DISCONNECTED>

Would it be possible for you to highlight any other important changes I need to merge into my application

ps I also has a node assert reboot on esp_mesh_set_parent(&parent, (mesh_addr_t *)&parent_assoc.mesh_id, my_type, my_layer)
It looks like ESP_ERROR_CHECK is not a good strategy. I don't know which argument could be wrong?
Any advice?

ESP_ERROR_CHECK failed: esp_err_t 0x4008 (ESP_ERR_MESH_ARGUMENT) at 0x4200ff9d
file: "./main/platform/espidf/wifi/wifiMesh.cpp" line 342
func: void findClosestParent(int)
expression: esp_mesh_set_parent(&parent, (mesh_addr_t *)&parent_assoc.mesh_id, my_type, my_layer)

abort() was called at PC 0x4037e45f on core 0

Backtrace: 0x40375f06:0x3fcd2db0 0x4037e469:0x3fcd2dd0 0x40386f5d:0x3fcd2df0 0x4037e45f:0x3fcd2e60 0x4200ff9d:0x3fcd2e90 0x420105ea:0x3fcd3150 0x42129bd2:0x3fcd31f0 0x4212a251:0x3fcd3230 0x4212a368:0x3fcd3280 0x4037f062:0x3fcd32a0


ELF file SHA256: ea203e8cd

michaelsimp · 2024-11-14T02:27:25Z

Also the change above adding to event <MESH_EVENT_PARENT_DISCONNECTED>

        if (!esp_mesh_get_self_organized()) {
            printf(">>>%d, set true, true\n",__LINE__);
            esp_mesh_set_self_organized(true, true); // vote a new root 
        }

...undoes my switch to a close parent after manual scan.

michaelsimp · 2024-11-14T02:33:50Z

After more testing and switching back modifications, I have the following summary:

I think you can ignore the earlier crash reports I posted today. I was working on something else and enabled PSRAM and I think things got a little cludgy and slow. OTA HTTPS started failing part way through. I have disabled PSRAM again and everything seems reliable again. I was not using it yet and had it configured like this, so I didn't think it would have any effect.

Anyway will put PSRAM on hold until later in another ticket after more tests on my side.

In event <MESH_EVENT_PARENT_DISCONNECTED> if I have this code, the networks establishes itself again reliably and everything looks quite stable. BUT: It immediately undoes my switch to a close parent after manual scan.

    case MESH_EVENT_PARENT_DISCONNECTED: {
        mesh_event_disconnected_t *disconnected = (mesh_event_disconnected_t *)event_data;
        ESP_LOGI(TAG, "<MESH_EVENT_PARENT_DISCONNECTED>reason:%d", disconnected->reason);
        mesh_layer = esp_mesh_get_layer();
        mesh_netifs_stop();
        wifiConnected = false;
        ESP_LOGW(TAG, "WiFi Disconnected");
        currentRSSI = NO_RSSI;

        printf(">>>last layer = %d, layer = %d\n", last_layer, mesh_layer);

        if (!esp_mesh_get_self_organized()) {
            printf(">>>%d, set true, true\n",__LINE__);
            esp_mesh_set_self_organized(true, true); // vote a new root 
        }
    }
    break;

Whereas if I have this code, the MESH_NODE parent switch works nicely, but the mesh breaks after the MESH_ROOT reboots or powers off.

    case MESH_EVENT_PARENT_DISCONNECTED: {
        mesh_event_disconnected_t *disconnected = (mesh_event_disconnected_t *)event_data;
        ESP_LOGI(TAG, "<MESH_EVENT_PARENT_DISCONNECTED>reason:%d", disconnected->reason);
        mesh_layer = esp_mesh_get_layer();
        mesh_netifs_stop();
        wifiConnected = false;
        ESP_LOGW(TAG, "WiFi Disconnected");
        currentRSSI = NO_RSSI;

        printf(">>>last layer = %d, layer = %d\n", last_layer, mesh_layer);

        if (disconnected->reason == WIFI_REASON_ASSOC_TOOMANY) {
            esp_mesh_set_self_organized(false, false);
            esp_wifi_scan_stop();
            scan_config.show_hidden = 1;
            scan_config.scan_type = WIFI_SCAN_TYPE_PASSIVE;
            esp_wifi_scan_start(&scan_config, 0);
        }
    }
    break;

zhangyanjiaoesp · 2024-11-14T07:20:50Z

@michaelsimp
The following answers your question：

The patch I provided is the calling method I believe to be correct. I was unable to reproduce your issue locally, as the devices were relatively close to each other during my testing. I cannot deploy my test setup in the same way you have. However, I did test the scenario of whether other nodes can correctly elect a new root after a root power off, and the result was successful.

The second block you pointed out was taken from example project manual_networking inside function mesh_scan_done_handler(). In the else of parent_found (so !parent_found) it calls esp_wifi_scan_stop() and esp_wifi_scan_start() without esp_mesh_set_self_organized(false, false);
I have added esp_mesh_set_self_organized(false, false); before the esp_wifi_scan_stop() as per other instances of this.

In the manual_networking example, the esp_mesh_set_self_organized(false, false) API is called only once throughout the entire process. It ensures that the network remains non self-organized throughout. However, in your project, the esp_mesh_set_self_organized(false, false) , esp_mesh_set_self_organized(true, false), esp_mesh_set_self_organized(true, true) are all called. To ensure that the self-organized network is disabled during the user's scan, it is necessary to call esp_mesh_set_self_organized(false, false). See the doc

It looks like ESP_ERROR_CHECK is not a good strategy. I don't know which argument could be wrong?
Any advice?

Unless the return value check is critical, please avoid calling ESP_ERROR_CHECK, as it may cause the program to abort. This is something we discussed previously. Regarding the esp_mesh_set_parent() issue, could you provide how you have set the parameters? I can help check why the parameter error is occurring.

The crash issue is caused by the error check you added when calling the esp_mesh_scan_get_ap_ie_len() function. After the AP is retrieved, it returns -1.

In event <MESH_EVENT_PARENT_DISCONNECTED> if I have this code, the networks establishes itself again reliably and everything looks quite stable. BUT: It immediately undoes my switch to a close parent after manual scan.

Yes, I’ve noticed that as well. I think I need to find a better solution.

Note 3: My thinking was to return the network to self organized again after the scan for better Parent. I will remove it soon (don't want to make too many changes at once) if it is not causing a problem for now. Please confirm.

I still think this part of the code is unnecessary. Returning to the self-organized network after finding a better parent doesn’t seem meaningful, especially since you need to periodically scan for nearby APs, and disable the self-organized network during scanning. As I understand it, you only need the self-organized network when selecting the root; at other times, you prefer to actively choose a better parent, correct?

I will check the log you sent and get back to you.

zhangyanjiaoesp · 2024-11-14T08:23:47Z

@michaelsimp

I did some tests. It is much better at establishing a MESH_ROOT when the MESH_ROOT reboots or is powered off. But I still have a few problems: 14Nov.zip

Test1 root.txt: My Mesh_ROOT crashed line 395 Guru Meditation Error

Need the elf file when the crash occurred to check the backtrace. By the way, have you added all the following changes when you testing?

wifi_lib_s3_1104.zip
0001-fix-dhcp-add-debug-log-for-dhcp-server.zip

Test 2 MESH_NODE.txt: See notes at top of file. After Wifi Scan for a new parent, Became a MESH_IDLE with a child node

The device is idle because it has not yet connected to the network. Its child should return parent idle error.

michaelsimp · 2024-11-17T01:09:27Z

Hi Zhangyanjiaoesp

I'm really a bit lost now as to where I am and what to do next.

Yes I have applied all the patches you sent me and done a clean build.

I would like to park the Guru Meditation Errors for now as they are infrequent and focus on my main problem which is how to recover after the MESH_ROOT is rebooted.

In event <MESH_EVENT_PARENT_DISCONNECTED> if I have the code below, the networks establishes itself again reliably and everything looks quite stable. BUT: It immediately undoes my switch to a close parent after manual scan.

    case MESH_EVENT_PARENT_DISCONNECTED: {
        mesh_event_disconnected_t *disconnected = (mesh_event_disconnected_t *)event_data;
        ESP_LOGI(TAG, "<MESH_EVENT_PARENT_DISCONNECTED>reason:%d", disconnected->reason);
        mesh_layer = esp_mesh_get_layer();
        mesh_netifs_stop();
        wifiConnected = false;
        ESP_LOGW(TAG, "WiFi Disconnected");
        currentRSSI = NO_RSSI;

        printf(">>>last layer = %d, layer = %d\n", last_layer, mesh_layer);

        if (!esp_mesh_get_self_organized()) {
            printf(">>>%d, set true, true\n",__LINE__);
            esp_mesh_set_self_organized(true, true); // vote a new root 
        }
    }
    break;

Whereas this, the MESH_NODE parent switch works nicely, but the mesh breaks after the MESH_ROOT reboots or powers off.

    case MESH_EVENT_PARENT_DISCONNECTED: {
        mesh_event_disconnected_t *disconnected = (mesh_event_disconnected_t *)event_data;
        ESP_LOGI(TAG, "<MESH_EVENT_PARENT_DISCONNECTED>reason:%d", disconnected->reason);
        mesh_layer = esp_mesh_get_layer();
        mesh_netifs_stop();
        wifiConnected = false;
        ESP_LOGW(TAG, "WiFi Disconnected");
        currentRSSI = NO_RSSI;

        printf(">>>last layer = %d, layer = %d\n", last_layer, mesh_layer);

//        if (!esp_mesh_get_self_organized()) {
//            printf(">>>%d, set true, true\n",__LINE__);
//            esp_mesh_set_self_organized(true, true); // vote a new root 
//        }
    }
    break;

You acknowledged this replying "Yes, I’ve noticed that as well. I think I need to find a better solution."

My outstanding questions are:

Are you working on the better solution (immediate above). I don't mean to sound impatient, I just want to ensure we are on the same page.
The internal_communication\main\mesh_main.c you sent me had many changes to compare. Is this just adding the parent switch code, or are there other changes here which I need to apply to my application in addition to the 3 notes you gave me.
Why does a node become a MESH_IDLE when there are MESH_ROOT and MESH_NODES very close (around -35dBM) which are well below capacity? How do I fix them? Another wifi scan request does not fix it as you can see in the log.

Your replied "The device is idle because it has not yet connected to the network. Its child should return parent idle error."

But my problem is it stays this way. Is there something I should do after a node goes MESH_IDLE, or should it fix itself?

Also when a node becomes MESH_IDLE and it has children (like the example above) why doesn't it drop it's children or why don't the children disconnect themselves and look for a proper connection?

Are these symptoms of the system breaking after the manual scan and set better parent followed by the MESH_ROOT reboot?
Is this something you are still working on?
Or do I have to manage this and if so, how?

zhangyanjiaoesp · 2024-11-18T03:52:42Z

2. The internal_communication\main\mesh_main.c you sent me had many changes to compare. Is this just adding the parent switch code, or are there other changes here which I need to apply to my application in addition to the 3 notes you gave me.

If you had carefully reviewed my changes, you would notice that most of the code has been transplanted from your wifiMesh.cpp file. The three points I raised are the main differences between my code and yours, and they are the changes I suggest you should make.

Are you working on the better solution (immediate above). I don't mean to sound impatient, I just want to ensure we are on the same page.

I am looking into this issue, but I also need to handle other higher-priority tasks, so sometimes I can't respond to you quickly. Additionally, local testing and analyzing logs from multiple devices is quite time-consuming.

Regarding the mesh idle issue, we first need to understand how it occurs in order to find a solution. Could you provide the code you're currently using for testing? I need to confirm what code your current test results are based on.

michaelsimp · 2024-11-18T05:30:22Z

Hi Zhangyanjiaoesp

I did spend time analyzing the changes you made to mesh_main.c with a full compare using winmerge. I just wanted to check with you that I hadn't missed anything important.

I fully understand you have other tasks and cannot respond immediately. The purpose of my question was just to check if you were still planning to help me resolve this.

See my source attached.
wincut2.zip

I have commented out the esp_mesh_set_self_organized(true, false) per your advice.
I have commented out the task which automatically searches for close nodes as I found it better to control this manually from my console using the command "wifi scan"
My event <MESH_EVENT_PARENT_DISCONNECTED> has the following code per your update. This stabilizes the mesh network when the MESH_ROOT reboots, but immediately undoes my manual switch to a closer parent.

        printf(">>>last layer = %d, layer = %d\n", last_layer, mesh_layer);

        if (!esp_mesh_get_self_organized()) {
            ESP_LOGW(TAG, "Reverting to Self Organised");
            printf(">>>%d, set true, true\n",__LINE__);
            esp_mesh_set_self_organized(true, true); // vote a new root 
        }

I really do appreciate your assistance.

1. fix(wifi/pm): Fixed the tbtt interval update error when AP's beacon interval changed Closes #14720 2. fix(wifi/mesh): Enlarge the mesh TX task stack 3. fix(wifi/espnow): Added check for espnow type and length on v1.0 4. fix(wifi/mesh): Fixed delete group id error in wifi mesh Closes #14735

zhangyanjiaoesp · 2024-11-19T07:29:53Z

@michaelsimp

Are you currently testing on v5.3.0? I recommend updating to v5.3.1 for testing, as v5.3.1 fixes the issue of the infinite loop for the [mesh_schedule.c,3130] [WND-RX]max_wnd:2, 1200 ms timeout, seqno:1231, xseqno:579, no_wnd_count:0 log.
please use this change and test again.
Did you disable the Wi-Fi logs during your testing? I didn’t see any log entries like the one below in your logs. Please enable the Wi-Fi logs.

I (18854) wifi:new:<5,1>, old:<5,1>, ap:<5,1>, sta:<5,1>, prof:5, snd_ch_cfg:0x0
I (18854) wifi:state: init -> auth (0xb0)
I (18874) wifi:state: auth -> assoc (0x0)
I (18884) wifi:state: assoc -> run (0x10)

1. fix(wifi/pm): Fixed the tbtt interval update error when AP's beacon interval changed Closes #14720 2. fix(wifi/mesh): Enlarge the mesh TX task stack 3. fix(wifi/espnow): Added check for espnow type and length on v1.0 4. fix(wifi/mesh): Fixed delete group id error in wifi mesh Closes #14735

zhangyanjiaoesp · 2024-11-19T08:13:41Z

@michaelsimp
I would like to discuss the rules for selecting a better parent with you. When choosing a better parent, do you only consider the RSSI value, or do you also take into account layer, assoc, and RSSI? If it’s the latter, which factor do you prioritize the most?

During my testing, I observed the following phenomenon:
Node A initially connects to the root node, becoming a layer2 node. Therefore, its parent’s layer should be 1, i.e., parent_assoc.layer = 1. However, parent_assoc is defined in the findClosestParent() function and is initially set to 6.

void findClosestParent(int num) { // after a WiFi scan
    ESP_LOGW(TAG, "findClosestParent  Current RSSI: %d", currentRSSI);
    int i;
    int ie_len = 0;
    mesh_assoc_t assoc;
    mesh_assoc_t parent_assoc = { .layer = CONFIG_MESH_MAX_LAYER, .rssi = -120 };
    wifi_ap_record_t record;
    wifi_ap_record_t parent_record = { 0, };
    parent_record.rssi = currentRSSI; // has to be better than current RSSI to change parent

If Node A scans an AP with a layer < 6 and an RSSI stronger than its current parent’s RSSI, it will adopt the new AP as a better parent. In my case, Node A found Node B, a layer2 node, as a better parent, and became a layer3 node.

I (18614) mesh_main: <MESH_EVENT_SCAN_DONE>number:2
W (18614) aWifiMesh: findClosestParent  Current RSSI: -6
I (18624) aWifiMesh: <MESH>[0]ESPM_A5B180, layer:2/4, assoc:0/2, 1, 34:85:18:a5:b1:81, channel:5, rssi:0, ID<77:77:77:77:77:76><IE Unencrypted>
>>>layer: 2,6, layer2_cap:1/0
W (18634) aWifiMesh: Closer Parent found: ESPM_A5B180  RSSI: 0
I (18644) aWifiMesh: <MESH>[1]ESPM_E0F6C0, layer:1/5, assoc:2/2, 0, 7c:df:a1:e0:f6:c1, channel:5, rssi:-6, ID<77:77:77:77:77:76><IE Unencrypted>
W (18654) aWifiMesh: <PARENT>ESPM_A5B180, layer:2/4, assoc:0/2, 1, 34:85:18:a5:b1:81, channel:5, rssi:0
I (18664) mesh: [IO]disable self-organizing<reconnect>
I (18674) wifi:state: run -> init (0x0)
I (18684) wifi:pm stop, total sleep time: 0 us / 12567464 us

I (18684) wifi:<ba-del>idx:0, tid:5
I (18684) wifi:new:<5,0>, old:<5,1>, ap:<5,1>, sta:<5,1>, prof:5, snd_ch_cfg:0x0
W (18704) wifi:<MESH AP>adjust channel:5, secondary channel offset:1(40U)
I (18704) wifi:Total power save buffer number: 16
W (18714) wifi:Password length matches WPA2 standards, authmode threshold changes from OPEN to WPA2
I (18754) mesh: [MANUAL]connect to parent:ESPM_A5B180, 34:85:18:a5:b1:81[layer:2], ID:77:77:77:77:77:76<>
I (18934) mesh_main: <MESH_EVENT_PARENT_CONNECTED>layer:2-->3, parent:34:85:18:a5:b1:81, ID:77:77:77:77:77:76

However, this actually resulted in a worse network condition for Node A, because the network path became longer by transitioning from a layer2 node to a layer3 node.

If I move the definition of parent_assoc outside the findClosestParent() function and assign it a value in the MESH_EVENT_PARENT_CONNECTED event, it would solve the issue described above.

    case MESH_EVENT_PARENT_CONNECTED: {
        mesh_event_connected_t *connected = (mesh_event_connected_t *)event_data;
        esp_mesh_get_id(&id);
        mesh_layer = connected->self_layer;
        memcpy(&mesh_parent_addr.addr, connected->connected.bssid, 6);
        parent_assoc.layer = mesh_layer - 1;

However, it would introduce a new problem: a layer 2 node would never switch its parent because its parent is the root node (layer = 1), and no other node has a layer smaller than 1.

michaelsimp · 2024-11-20T06:25:47Z

Hi

In response to your 2nd to last post.

Q 1. Yes I am using 5.3.1. I completed deleted earlier SDKs 5.3.0 and 5.2 a few weeks ago to be 100% sure.

Q 2. I have tried this but no success.

    case MESH_EVENT_PARENT_DISCONNECTED: {
        mesh_event_disconnected_t *disconnected = (mesh_event_disconnected_t *)event_data;
        ESP_LOGI(TAG, "<MESH_EVENT_PARENT_DISCONNECTED>reason:%d", disconnected->reason);
        mesh_layer = esp_mesh_get_layer();
        mesh_netifs_stop();
        wifiConnected = false;
        connectionStatusLed(Provisioned); // update LED flash pattern
        ESP_LOGW(TAG, "WiFi Disconnected");
        currentRSSI = NO_RSSI;

        printf(">>>last layer = %d, layer = %d\n", last_layer, mesh_layer);
    
        if (disconnected->reason == WIFI_REASON_BEACON_TIMEOUT) {
            printf(">>>%d, set true, true\n",__LINE__);
            ESP_LOGW(TAG, "Reverting to Self Organised");
            esp_mesh_set_self_organized(true, true); // vote a new root 
        }
    }
    break;

I did some work around with the disconnect->reason codes. These are named but not described so its hard to know what they mean and how to apply them. Some of them are not even defined eg 100 & 101

On Node reboot during shutdown I get disconnect->reason codes:
8 = WIFI_REASON_ASSOC_LEAVE

On Node startup I get disconnect->reason codes:
101 = not defined
100 = not defined

I found when the MESH_ROOT is lost I get disconnect->reason codes:
209 = WIFI_REASON_SA_QUERY_TIMEOUT
101 = not defined
then it scans
and after scan sometimes I get
2 = WIFI_REASON_AUTH_EXPIRE
105 = not defined

When I manually switch to a closer parent I get disconnect->reason codes:
8 = WIFI_REASON_ASSOC_LEAVE
201= WIFI_REASON_NO_AP_FOUND
sometimes
206=WIFI_REASON_AP_TSF_RESET

On event <MESH_EVENT_PARENT_DISCONNECTED> I thought I could test if not codes 8, 201 and 206 then esp_mesh_set_self_organized(true, true); but the reason codes are inconsistent

        // if (disconnected->reason == WIFI_REASON_BEACON_TIMEOUT) {
        //     printf(">>>%d, set true, true\n",__LINE__);
        //     ESP_LOGW(TAG, "Reverting to Self Organised");
        //     esp_mesh_set_self_organized(true, true); // vote a new root 
        // }
        if ((disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET)) {
            printf(">>>%d, set true, true\n",__LINE__);
            ESP_LOGW(TAG, "Reverting to Self Organised");
            esp_mesh_set_self_organized(true, true); // vote a new root 
        }

I don't feel comfortable with this solution unless it is endorsed by you guys, but anyway after several successful cycles, it failed again with lots of:

W (210492) mesh: [mesh_schedule.c,3130] [WND-RX]max_wnd:2, 1200 ms timeout, seqno:589, xseqno:232, no_wnd_count:0, timeout_count:23
I (210992) mesh: [SCAN][ch:1]AP:5, other(ID:0, RD:0), MAP:3, idle:1, candidate:0, root:0, topMAP:0[c:2,i:2][00:00:00:00:00:00]<>
I (210992) mesh: [FAIL][53]root:0, fail:53, normal:0, <pre>backoff:0

ending in a broken mesh again, see logs attached test 4 below.

Q3. Yes I had changed log_levels at the start of my project. I have removed this now so everything starts at default levels. You can set the levels in my console using "loglevel xxx l" where xxx is the ESP_LOG tag and l = the level.

See logs attached
Nov20.zip

Wifi manual scan switch shows MESH_NODE start up and connect,followed by a manual scan and switch to a closer parent. (No reboot of MESH_ROOT)

Test 1 is a reference with no manual scan for closer parent before reboot of MESH_ROOT and all nodes reconnect successfully

Test 2 is a number of cycles of manual scan and parent switch followed by MESH_ROOT reboot. This was successful for about 3 cycles although it seemed the struggle, before failing on the last attempt. The logs have wrapped but show the end when it gets broken

Test 3 is a more controlled failure:
Test3 Mesh_root showing power off after nodes connect and manual scan for better root
Test3 Node becomes root showing, MESH_NODE, Connects to root, Does manual scan and switch parent, Root powers off, This node becomes MESH_ROOT
Test 3 node becomes MESH_IDLE showing, MESH_NODE starts and connects to root, Power off root, MESH_NODE doesn't scan instead become MESH_IDLE
Test 3 node stays connected to parent which becomes MESH_IDLE, showing MESH_NODE connects to MESH_ROOT, Does manual scan and switch parent, Root powers off, This node doesn't scan, stays MESH_NODE connected same parent which has now become MESH_IDLE

michaelsimp · 2024-11-20T06:35:42Z

Hi

In response to your last post:

At present I only consider the RSSI value.
The test in findClosestParent() does consider nodes already at capacity number of children, and will not try to switch to them.

My thoughts were, I am not wanting to build the mesh network from scratch as I start with a self configured network. I am only planning to make changes to nodes with poor RSSIs. So far my tests have been successful network architecture wise (when I have a fixed ROOT so I don't get the broken mesh problem).

I appreciate what you are saying and will certainly doo more testing and add more intelligence into the parent selection if necessary. I already send all my node network attributes to the MESH_ROOT where I have a table of all node and their parent, children, layers and RSSI. I could broadcast this to all nodes if necessary to enable smarter logic at the selection.

But I can't keep the manual scan and parent switch code while it causes my network to break which leaves me in a real predicament performance wise.

I really need a resolution to this as my priority.

Thanks for your ongoing help

zhangyanjiaoesp · 2024-11-22T03:18:57Z

The definition of reason code 100/101 is here:

esp-idf/components/esp_wifi/include/esp_mesh.h

Line 267 in f420609

} mesh_disconnect_reason_t;

zhangyanjiaoesp · 2024-11-22T03:28:39Z

I don't feel comfortable with this solution unless it is endorsed by you guys, but anyway after several successful cycles, it failed again with lots of:

You can use the reason code to categorize the issues, but this is not entirely reliable, as different scenarios may generate the same reason code.

I have tried this but no success.

Are you saying that using disconnected->reason == WIFI_REASON_BEACON_TIMEOUT for judgment is completely ineffective, or it can work but can't work as well as (disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET)?

zhangyanjiaoesp · 2024-11-22T03:45:17Z

The test 2 1/2/3/4 refer to the four devices in a single round of testing?
where is test 3?

michaelsimp · 2024-11-22T05:48:55Z

Yes Test 2 1.txt through test 2 4.txt were the 4 devices on a single round of testing
Sorry here are the missing test logs from yesterday
Nov22.zip
Tests 1, 2, 3 were done on the software with:
disconnected->reason == WIFI_REASON_BEACON_TIMEOUT

test 4 was done with"
(disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET)

Are you saying that using disconnected->reason == WIFI_REASON_BEACON_TIMEOUT for judgment is completely ineffective, or it can work but can't work as well as (disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET)

Neither work reliably. after 2 or 3 cycles some nodes will fail and go MESH_IDLE and not scan and the network is broken.

zhangyanjiaoesp · 2024-11-25T03:33:16Z

I just reviewed the log for test3, and the device behavior is normal. The device being in the MESH_IDLE state is not permanent; it is a temporary state. Below is my analysis:

At the beginning, the self-organizing network formed the following topology:
root(53:d8) --- node A (39:d4)
|--- node B (72:b8)
|--- node C (5c:68)
node C call manual scan, select A as the better parent, change to layer3 node (self-organized disabled, set parent)
node A call manual scan, still select root as the better parent, still layer2 node (self-organized disabled, set parent)
root power off
node B found root leave, beacon timeout, parent disconnect, enable self-organized, change to be root
node A found root leave, beacon timeout, parent disconnect, enable self-organized. However, at that moment, it was sending data, and it is trying to reconnecting, when you queried, the device was shown as in the MESH_IDLE state. I believe that if the device remains in an idle state and cannot recover, then this is an issue. However, if there are no subsequent logs, I don't consider it a problem. You cannot expect the device to always be in a non-idle state whenever the application layer checks the mesh status.

zhangyanjiaoesp · 2024-11-25T03:55:08Z

In the test4 log, the device eventually connected successfully.

The log you referred to is just a part of the intermediate process.

zhangyanjiaoesp · 2024-11-25T03:59:12Z

According to your test log, I think the disconnected->reason == WIFI_REASON_BEACON_TIMEOUT will be better than (disconnected->reason != WIFI_REASON_ASSOC_LEAVE) && (disconnected->reason != WIFI_REASON_NO_AP_FOUND) && (disconnected->reason != WIFI_REASON_AP_TSF_RESET) , because there are too many disconnect reason, and it is unreasonable to switch to self-organized mode as soon as the reason is not equal to 8, 201, or 206

michaelsimp · 2024-11-25T04:09:41Z

How long should it take to for a MESH_IDLE to find a parent again? I am sure I waited 10s of seconds and it wasn't even scanning.
I am setting up for another run of tests where I will wait longer. I am just worried about my back trace overflowing using Putty

zhangyanjiaoesp · 2024-11-25T04:10:13Z

In the test2 logs, I cannot analyze the entire network change process as I did with test3 because the log only contains part of the information. It is curious why such a reason would occur.

It seems that in test2_2 and test2_3, there was no opportunity to switch to the self-organized network, and the device kept trying to connect to the originally configured parent, but the parent could not be detected.

michaelsimp · 2024-11-25T05:36:48Z

Hi

Today I am getting problems where nodes get stuck in a loop logging forever. I can't keep my trace open long enough as I lose the start, but take my word for it please, once in this state it never comes out no matter how long (minutes). eg

I (00:02:07.516) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:139
W (00:02:07.528) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
W (00:02:07.529) aWifiMesh: <MESH_EVENT_ROUTING_TABLE_REMOVE>remove 1, new:3
I (128652) mesh: [wifi]disconnected reason:201(), continuous:1/max:12, non-root, vote(,stopped)<><>
I (00:02:07.644) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:07.645) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (128772) mesh: [wifi]disconnected reason:201(), continuous:2/max:12, non-root, vote(,stopped)<><>
I (00:02:07.769) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:07.769) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (128882) mesh: 1145[xrsp:1]the asked:19, max window:2, force to increase/decrease(up) xseqno:17 for child 48:ca:43:9b:5d:20, xrsp_seqno:14, heap:101160
I (128892) mesh: 1307[recv]cidx[0]48:ca:43:9b:5d:20 xseqno loss, current/new:15/19, in:17, out:17, pending:0
I (128892) mesh: [wifi]disconnected reason:201(), continuous:3/max:12, non-root, vote(,stopped)<><>
I (00:02:07.893) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:07.894) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (129022) mesh: [wifi]disconnected reason:201(), continuous:4/max:12, non-root, vote(,stopped)<><>
I (00:02:08.018) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:08.019) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (129142) mesh: [wifi]disconnected reason:201(), continuous:5/max:12, non-root, vote(,stopped)<><>
I (00:02:08.143) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:08.144) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1
I (129272) mesh: [wifi]disconnected reason:201(), continuous:6/max:12, non-root, vote(,stopped)<><>
I (00:02:08.268) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201
W (00:02:08.269) aWifiMesh: WiFi Disconnected
>>>last layer = 4, layer = -1

Nov 25.zip

I found I am the author of one way that this can happen, if I scan and switch parents manually. See test1 logs attached where
node A MAC 48:27:e2:18:39:80 switches no node B 48:ca:43:9b:5d:20
Bode B MAC:48:ca:43:9b:5d:20 switches no node A 48:27:e2:18:39:80

This is one cause of the above. I think I can fix this by checking that I am not switching the nodes parent to one of its children.
I think it probably also makes sense to not swap to a parent node which has a higher layer than this node too.
But while not ideal that I am doing this, it shouldn't result in the node getting stuck in a disconnect loop?

But Test 2 looks the same problem but is not triggered by the above. MESH_ROOT is powered off. A panic crash on a MESH_NODE 48:ca:43:9b:5d:20 which still happens from time to time, but my bigger concern is that after this, MESH_NODE MAC: 48:27:e2:18:39:80 gets stuck in the disconnect loop

Are you able to reproduce these problems with the ip_internal_network you modified a week or so back? I get lost in all of this and feel we would make better progress if you were able to test, analyze and debug directly.

michaelsimp · 2024-11-25T06:27:18Z

Hi again
Regarding your analysis of test 3 specifically node A where you said.
node A found root leave, beacon timeout, parent disconnect, enable self-organized. However, at that moment, it was sending data, and it is trying to reconnecting, when you queried, the device was shown as in the MESH_IDLE state. I believe that if the device remains in an idle state and cannot recover, then this is an issue. However, if there are no subsequent logs, I don't consider it a problem. You cannot expect the device to always be in a non-idle state whenever the application layer checks the mesh status.
The disconnect came at 00:01:18
I (00:01:18.371) aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:200
I stopped the log after 00:01:58, 40 seconds later and nothing was happening, no visible signs of scanning for a parent.
I am sure that when this occurs, no matter how long I leave it, it does not recover.
Also when it gets stuck in the disconnect loop, logs the repetitive sequence (above) indefinitely.

michaelsimp · 2024-11-25T06:51:56Z

Two posts back I wrote:

I found I am the author of one way that this can happen, if I scan and switch parents manually. See test1 logs attached where
node A MAC 48:27:e2:18:39:80 switches no node B 48:ca:43:9b:5d:20
Bode B MAC:48:ca:43:9b:5d:20 switches no node A 48:27:e2:18:39:80

This is one cause of the above. I think I can fix this by checking that I am not switching the nodes parent to one of its children.
I think it probably also makes sense to not swap to a parent node which has a higher layer than this node too.
But while not ideal that I am doing this, it shouldn't result in the node getting stuck in a disconnect loop?

When I looked at the code, I am finding it difficult to decipher the variables

parent_record is the best parent candidate found so far
assoc I think is each node found in the scan esp_mesh_scan_get_ap_record(&record, &assoc); is this correct?
parent_assoc I am not clear on what this is or how it gets set.
It is initialized to: mesh_assoc_t parent_assoc = { .layer = CONFIG_MESH_MAX_LAYER, .rssi = -120 }; as a worst case record
and updated to contents of assoc when a better parent is found

the original source (taken from the example project "manual_networking") seems to be already checking :
if (assoc.layer < parent_assoc.layer || assoc.layer2_cap < parent_assoc.layer2_cap) {
But I am not sure if this stops a MESH_NODE selecting a child as a new parent, or do I need to add the line:
if (esp_mesh_get_layer() >= assoc.layer)

Could you take a look at the test 1 logs as they do appear to be setting parent to each other.

The entire routine is currently as follows if you could check and make any changes please.

void findClosestParent(int num) { // after a WiFi scan
    ESP_LOGW(TAG, "findClosestParent  Current RSSI: %d", currentRSSI);
    int i;
    int ie_len = 0;
    mesh_assoc_t assoc;
    mesh_assoc_t parent_assoc = { .layer = CONFIG_MESH_MAX_LAYER, .rssi = -120 };
    wifi_ap_record_t record;
    wifi_ap_record_t parent_record = { 0, };
    parent_record.rssi = currentRSSI; // has to be better than current RSSI to change parent
    bool parent_found = false;
    mesh_type_t my_type = MESH_IDLE;
    int my_layer = -1;
    wifi_config_t parent = { 0, };
    wifi_scan_config_t scan_config = { 0 };

    for (i = 0; i < num; i++) { // iterate through scan records looking for eligible closer parent node
        ESP_ERROR_CHECK(esp_mesh_scan_get_ap_ie_len(&ie_len));
        ESP_ERROR_CHECK(esp_mesh_scan_get_ap_record(&record, &assoc));
        ESP_LOGD(TAG, "ie_len: %d  sizeof(assoc): %d", ie_len, sizeof(assoc));
        if (ie_len == sizeof(assoc)) {
            ESP_LOGI(TAG,
                     "<MESH>[%d]%s, layer:%d/%d, assoc:%d/%d, %d, "MACSTR", channel:%u, rssi:%d, ID<"MACSTR"><%s>",
                     i, record.ssid, assoc.layer, assoc.layer_cap, assoc.assoc, assoc.assoc_cap, assoc.layer2_cap, MAC2STR(record.bssid),
                     record.primary, record.rssi, MAC2STR(assoc.mesh_id), assoc.encrypted ? "IE Encrypted" : "IE Unencrypted");

            // ESP_LOGI(MESH_TAG, "Type: %d  layer_cap %d:  assoc %d  assoc_cap: %d  rssi: %d", assoc.mesh_type, assoc.layer_cap, assoc.assoc, assoc.assoc_cap, record.rssi);
            if (assoc.mesh_type != MESH_IDLE && assoc.layer_cap && assoc.assoc < assoc.assoc_cap) { 
                // ESP_LOGI(MESH_TAG, "assoc.layer: %d  parent_assoc.layer %d:  assoc.layer2_cap %d  parent_assoc.layer2_cap: %d", assoc.layer, parent_assoc.layer, assoc.layer2_cap, parent_assoc.layer2_cap);
                if (assoc.layer < parent_assoc.layer || assoc.layer2_cap < parent_assoc.layer2_cap) {
                    if (record.rssi > parent_record.rssi) { // closer parent found
                        if (memcmp(parent_record.bssid, record.bssid, MAC_SIZE) != 0) { // dont switch to same parent
                            ESP_LOGW(TAG, "Closer Parent found: %s  RSSI: %d", record.ssid, record.rssi);
                            parent_found = true;
                            memcpy(&parent_record, &record, sizeof(record));
                            memcpy(&parent_assoc, &assoc, sizeof(assoc));
                            if (parent_assoc.layer_cap != 1) {
                                my_type = MESH_NODE;
                            } else {
                                my_type = MESH_LEAF;
                            }
                            my_layer = parent_assoc.layer + 1;
                            // break; // MSB removed, keep searching for the closest parent
                        }
                    }
                }
            }
        } else {
            ESP_LOGD(TAG, "[%d]%s, "MACSTR", channel:%u, rssi:%d", i, record.ssid, MAC2STR(record.bssid), record.primary, record.rssi);
        }
    }

    esp_mesh_flush_scan_result();
    if (parent_found) { // parent: Both channel and SSID of the parent are mandatory
        parent.sta.channel = parent_record.primary;
        memcpy(&parent.sta.ssid, &parent_record.ssid, sizeof(parent_record.ssid));
        parent.sta.bssid_set = 1;
        memcpy(&parent.sta.bssid, parent_record.bssid, 6);
        if ((my_type == MESH_NODE) || (my_type == MESH_LEAF) || (my_type == MESH_IDLE)) {
            ESP_ERROR_CHECK(esp_mesh_set_ap_authmode(parent_record.authmode));
            if (parent_record.authmode != WIFI_AUTH_OPEN) {
                memcpy(&parent.sta.password, CONFIG_MESH_AP_PASSWD, strlen(CONFIG_MESH_AP_PASSWD));
            }
            ESP_LOGW(TAG,
                     "<PARENT>%s, layer:%d/%d, assoc:%d/%d, %d, "MACSTR", channel:%u, rssi:%d",
                     parent_record.ssid, parent_assoc.layer,
                     parent_assoc.layer_cap, parent_assoc.assoc,
                     parent_assoc.assoc_cap, parent_assoc.layer2_cap,
                     MAC2STR(parent_record.bssid), parent_record.primary,
                     parent_record.rssi);
            esp_err_t err = esp_mesh_set_parent(&parent, (mesh_addr_t *)&parent_assoc.mesh_id, my_type, my_layer);
            switchParentTimer = currentTimeMs(); // reset timer for event <MESH_EVENT_PARENT_DISCONNECTED>
            if (err != ESP_OK) {
                ESP_LOGE(TAG, "esp_mesh_set_parent Error %d  my_type: %d  my_layer: %d", err, my_type, my_layer);
            }
            selfOrganizeReactivateTimer = SELF_ORGANIZE_REACTIVATE_TIME; // start self organize reactivation timer
        }
    } else {
        ESP_LOGE(TAG, "No eligible closer Parent found");
        if (currentRSSI == NO_RSSI) { // scan again if no connection yet
            esp_mesh_set_self_organized(false, false);
            esp_wifi_scan_stop();
            scan_config.show_hidden = 1;
            scan_config.scan_type = WIFI_SCAN_TYPE_PASSIVE;
            esp_wifi_scan_start(&scan_config, 0);
        }
    }
}

michaelsimp · 2024-11-25T22:35:08Z

Hi

By the way all yesterdays test and logs and today were made with your recommendation of only using
disconnected->reason == WIFI_REASON_BEACON_TIMEOUT

I have been testing getting the 4 nodes stacked up across 4 layers and powering off the NODE on layer 2 rather than the MESH_ROOT as this provide a cleaner set of logs.
Test 1 Nov 26.zip

See test 1

Layer 1 MESH_ROOT 48:ca:43:9b:53:d8
NODE A Layer 2 48:27:e2:18:39:80
NODE B Layer 3 48:ca:43:9b:54:c0
NODE C Layer 4 48:ca:43:9b:5d:20

Then power down NODE A on layer 2

NODE B switched from layer 3 to layer 2 and parent from NODE A to MESH_ROOT - perfect
NODE C stayed on layer 4 with parent 48:ca:43:9b:54:c1 which is now on layer 2, and does not show in the

Is this valid?
Node B moved from layer 3 to 2 when its parent dropped. Why did Node C not move to layer 3 ?

It stayed like this for minutes while I wrote this up

Then I powered of the MESH_ROOT see test 1 MESH_ROOT line 469. This node 48:ca:43:9b:53:d8 now becomes MESH_NODE and child of Node B.

See test 1 Node B.txt line 2038
NODE B which was on layer 2 connected to MESH_ROOT goes to MESH_IDLE with 2 children Node C and the old MESH_ROOT 48:ca:43:9b:53:d8

Remains broken like this indefinitely.

zhangyanjiaoesp · 2024-11-27T11:49:09Z

Regarding the issue of the log infinite loop (aWifiMesh: <MESH_EVENT_PARENT_DISCONNECTED>reason:201), I have already explained it in my previous comment.

It seems that in test2_2 and test2_3, there was no opportunity to switch to the self-organized network, and the device kept trying to connect to the originally configured parent, but the parent could not be detected.

Maybe we should first investigate why the specified parent node cannot be found at this point. Is it due to a power failure, has it become idle, or is there another underlying reason?

Are you able to reproduce these problems with the ip_internal_network you modified a week or so back? I get lost in all of this and feel we would make better progress if you were able to test, analyze and debug directly.

Sorry, I can't reproduce your issue on my side.

I have already discussed this with you in my previous comment: when selecting a better parent node, what criteria do you prioritize? I believe you can completely disregard the conditions in the example and instead design your own criteria based on your specific needs. First, you can move the definitions of parent_assoc and parent_record outside,

then update the parent_assoc->layer when connecting to the parent.

Before scanning, retrieve the current parent information.

Finally, within the findClosestParent() function, design the criteria for selecting a better parent based on your requirements and the issues encountered during testing.

My thoughts were, I am not wanting to build the mesh network from scratch as I start with a self configured network. I am only planning to make changes to nodes with poor RSSIs. So far my tests have been successful network architecture wise (when I have a fixed ROOT so I don't get the broken mesh problem).

You mentioned that you don't want to rebuild the network from scratch, but instead, you want to adjust the initial network formed by the self-organizing process. However, during the actual testing, I've observed that you often call scan at the application layer while the initial network is still being formed, which forcibly interrupts the self-organizing process.

So, when you call scan at the application layer, is it completely random? Would it make sense to first check whether the initial network has been fully formed before manually triggering the scan?

I believe we must first resolve the issues mentioned in points 3 and 4 before proceeding with further problem analysis. If the initial logic framework isn't properly established, it could lead to a range of unforeseen issues down the line, which would be quite painful for me to handle.

michaelsimp added the Type: Bug bugs in IDF label Oct 14, 2024

espressif-bot added the Status: Opened Issue is new label Oct 14, 2024

github-actions bot changed the title ~~WiFi Mesh unstable when parent offline~~ WiFi Mesh unstable when parent offline (IDFGH-13875) Oct 14, 2024

espressif-bot assigned zhangyanjiaoesp Oct 14, 2024

espressif-bot added Status: In Progress Work is in progress and removed Status: Opened Issue is new labels Oct 29, 2024

WiFi Mesh unstable when parent offline (IDFGH-13875) #14720

WiFi Mesh unstable when parent offline (IDFGH-13875) #14720

Comments

michaelsimp commented Oct 14, 2024

Answers checklist.

IDF version.

Espressif SoC revision.

Operating System used.

How did you build your project?

If you are using Windows, please specify command line type.

Development Kit.

Power Supply used.

What is the expected behavior?

What is the actual behavior?

Steps to reproduce.

Debug Logs.

More Information.

zhangyanjiaoesp commented Oct 22, 2024

michaelsimp commented Oct 24, 2024 via email

michaelsimp commented Oct 24, 2024 • edited Loading

michaelsimp commented Oct 24, 2024

michaelsimp commented Oct 24, 2024

brianignacio5 commented Oct 29, 2024

zhangyanjiaoesp commented Oct 29, 2024

michaelsimp commented Oct 30, 2024

brianignacio5 commented Oct 30, 2024

michaelsimp commented Oct 30, 2024

michaelsimp commented Oct 30, 2024

zhangyanjiaoesp commented Oct 30, 2024

michaelsimp commented Oct 30, 2024 via email

zhangyanjiaoesp commented Oct 30, 2024

michaelsimp commented Oct 30, 2024 • edited Loading

michaelsimp commented Oct 30, 2024

zhangyanjiaoesp commented Oct 31, 2024

zhangyanjiaoesp commented Oct 31, 2024

michaelsimp commented Oct 31, 2024

michaelsimp commented Oct 31, 2024

zhangyanjiaoesp commented Oct 31, 2024

michaelsimp commented Oct 31, 2024

michaelsimp commented Oct 31, 2024

zhangyanjiaoesp commented Oct 31, 2024

michaelsimp commented Nov 13, 2024 • edited Loading

michaelsimp commented Nov 14, 2024

michaelsimp commented Nov 14, 2024 • edited Loading

zhangyanjiaoesp commented Nov 14, 2024

zhangyanjiaoesp commented Nov 14, 2024 • edited Loading

michaelsimp commented Nov 17, 2024

zhangyanjiaoesp commented Nov 18, 2024

michaelsimp commented Nov 18, 2024

zhangyanjiaoesp commented Nov 19, 2024 • edited Loading

zhangyanjiaoesp commented Nov 19, 2024

michaelsimp commented Nov 20, 2024 • edited Loading

michaelsimp commented Nov 20, 2024

zhangyanjiaoesp commented Nov 22, 2024

zhangyanjiaoesp commented Nov 22, 2024

zhangyanjiaoesp commented Nov 22, 2024

michaelsimp commented Nov 22, 2024

zhangyanjiaoesp commented Nov 25, 2024

zhangyanjiaoesp commented Nov 25, 2024

zhangyanjiaoesp commented Nov 25, 2024

michaelsimp commented Nov 25, 2024

zhangyanjiaoesp commented Nov 25, 2024

michaelsimp commented Nov 25, 2024

michaelsimp commented Nov 25, 2024

michaelsimp commented Nov 25, 2024 • edited Loading

michaelsimp commented Nov 25, 2024 • edited Loading

zhangyanjiaoesp commented Nov 27, 2024

michaelsimp commented Oct 24, 2024 •

edited

Loading

michaelsimp commented Oct 30, 2024 •

edited

Loading

michaelsimp commented Nov 13, 2024 •

edited

Loading

michaelsimp commented Nov 14, 2024 •

edited

Loading

zhangyanjiaoesp commented Nov 14, 2024 •

edited

Loading

zhangyanjiaoesp commented Nov 19, 2024 •

edited

Loading

michaelsimp commented Nov 20, 2024 •

edited

Loading

michaelsimp commented Nov 25, 2024 •

edited

Loading

michaelsimp commented Nov 25, 2024 •

edited

Loading