Lite-Node Bug Reports

Welcome to the Lite-Node Bug Report forum! The primary function of this forum is to collect reports on issues currently affecting Lite-Nodes. Our goal is that this forum will become an extremely valuable tool to the development team, allowing all of us to help make diode the best network it can be.

The quality assurance team will be monitoring this forum continuously, looking for new issues and then adding them to our bug tracking database.

While you may not see us post here very often, please know that your reports are read and appreciated. In the event that the amount of information in a bug report proves to be insufficient during our investigation, you may be asked for additional information (like your hardware configuration or what time you encountered a specific issue)—so keep an eye out!

Please keep all posts on topic. This means threads should center around bug reports, workarounds, and discussion of impacting issues.

When providing information on a bug report, it is okay to post on an existing thread if your issue is exactly the same as the original post. If you are experiencing a similar issue with slight variations, please create your own topic.

  • If you find multiple bugs, create a separate thread for each one. While this may seem tedious, it is essential to our workflow as it allows our internal tools track which specific bugs have been investigated and it helps us share the workload among our team.
  • Do not hijack threads with your new topic. Posting an issue which is similar but different from the original topic means that your request may be missed or the original request ignored.
  • Please post a subject with a clear summary of your issue. Starting posts with “I need help!” or “Can’t play no matter what” only serves to make your issue harder to resolve.
  • We understand that emotions can run high while dealing with technical issues, but posting in an abusive or disparaging manner will not be tolerated, and does not advance our goal of providing assistance to the widest audience of customers in this forum.
  • Please do not post direct contact info for other companies. It is acceptable to post links to websites where contact info can be found, but posting the information here on the forum is not permitted.

As always, community forums work best when all participants treat each other with respect and courtesy before posting.

Thank you for supporting Diode.

@Knusperhaus when entering the shell and running the diode-node.info I get an error

/snap/bin/diode-node.shell
=ERROR REPORT==== 17-Jul-2024::14:27:44.824279 ===
File operation error: eacces. Target: .. Function: list_dir. Process: application_controller.
=ERROR REPORT==== 17-Jul-2024::14:27:44.887349 ===
File operation error: eacces. Target: .. Function: list_dir. Process: application_controller.
=ERROR REPORT==== 17-Jul-2024::14:27:44.927296 ===
File operation error: eacces. Target: .. Function: list_dir. Process: application_controller.
Erlang/OTP 26 [erts-14.2.3] [source] [64-bit] [smp:128:1] [ds:128:1:10] [async-threads:1] [jit:ns]

Interactive Elixir (1.15.7) - press Ctrl+C to exit (type h() ENTER for help)
iex(diode_node@nodelite001-1cpu-1gb-us-sjo1)1> diode-node.info
** (CompileError) cannot compile code (errors have been logged)

iex(diode_node@nodelite001-1cpu-1gb-us-sjo1)1> h
IEx.Helpers was not compiled with docs
*** ERROR: Shell process terminated! (^G to start new job) ***
iex(diode_node@nodelite001-1cpu-1gb-us-sjo1)2> 

runnign from the path

/snap/bin/diode-node.info
=ERROR REPORT==== 17-Jul-2024::14:31:37.197599 ===
File operation error: eacces. Target: .. Function: list_dir. Process: application_controller.
=ERROR REPORT==== 17-Jul-2024::14:31:37.251027 ===
File operation error: eacces. Target: .. Function: list_dir. Process: application_controller.
=ERROR REPORT==== 17-Jul-2024::14:31:37.287794 ===
File operation error: eacces. Target: .. Function: list_dir. Process: application_controller.
== Diode Node 0x22612cf691c006cc656b0489eaXXXXXXXXXXXXXXX ==
Version          : 1.2.1
Connected Devices: 2
Connected Peers  : 8

Current Epoch    : 664
Ticket Score     : 0

Previous Epoch   : 663
Ticket Score     : 0
1 Like

I have the same error

Yes, confirmed. Looking at that

1 Like

Server crashes and restarts after this exception:

2024-07-19T18:58:30Z diode-node.diode-node[711532]: 18:58:30.252 [error] Process #PID<0.94431.0> on node :diode_node@vps38754 raised an exception
2024-07-19T18:58:30Z diode-node.diode-node[711532]: ** (MatchError) no match of right hand side value: {:error, :enoent}
2024-07-19T18:58:30Z diode-node.diode-node[711532]:     (dets_plus 2.3.2) lib/dets_plus.ex:895: anonymous fn/7 in DetsPlus.spawn_sync_worker/1

EDIT: this happens pretty much exactly every 3 minutes

2 Likes

light nodes causing a client host error, error at web browser end resulting in error 500

**doConnectDevice() for 'media-stream' failed: GetServer(): wrong signature in server object &{Host:[57 52 46 50 51 55 46 50 46 49 54 56] EdgePort:41046 ServerPort:51055 Sig:[0 135 196 209 183 211 137 67 142 37 219 83 38 171 183 72 174 14 223 48 253 153 90 187 192 174 71 74 236 72 224 113 110 105 71 109 8 147 160 231 17 205 160 183 88 110 159 246 16 129 179 35 94 128 75 85 222 167 165 28 81 161 239 176 254] ServerPubKey:[4 170 175 49 75 113 70 225 30 44 47 24 2 46 12 180 57 254 168 61 173 150 50 66 61 65 185 80 116 63 147 62 90 149 59 87 117 14 155 106 34 2 198 196 129 170 62 150 51 110 178 37 34 16 153 219 22 201 112 66 135 88 238 233 222] Extra:map[block:{neg:false abs:[6678668]} name:{neg:false abs:[3275074798935961137 3558811779124520802 7811903006599819565 8315171310995989605 8315171473952368997 28025]} tickets:{neg:false abs:[]} time:{neg:false abs:[1721571447054126263]} uptime:{neg:false abs:[24683378]}]} with 2 candidates**

update on client side


INFO Retry to connect (1/3), waiting 5s server=94.237.2.168:41046 
ERROR Couldn't add block 7597200 [0 0 12 245 94 103 146 111 80 168 80 61 106 28 44 175 118 123 6 19 226 161 251 51 238 158 193 128 247 137 84 143]: don't know direct parent of this block server=95.111.218.30:41046 
DEBUG Added relay#4 [0x982039c3aaef4f40896793ec7ed2fc7afb0f6492] @ 94.237.2.168:41046
ERROR Couldn't add block 7597200 [0 0 12 245 94 103 146 111 80 168 80 61 106 28 44 175 118 123 6 19 226 161 251 51 238 158 193 128 247 137 84 143]: don't know direct parent of this block server=185.70.196.190:41046 
ERROR Couldn't add block 7597200 [0 0 12 245 94 103 146 111 80 168 80 61 106 28 44 175 118 123 6 19 226 161 251 51 238 158 193 128 247 137 84 143]: don't know direct parent of this block server=95.111.218.30:41046 
ERROR Couldn't add block 7597200 [0 0 12 245 94 103 146 111 80 168 80 61 106 28 44 175 118 123 6 19 226 161 251 51 238 158 193 128 247 137 84 143]: don't know direct parent of this block server=185.70.196.190:41046 
ERROR Couldn't add block 7597200 [0 0 12 245 94 103 146 111 80 168 80 61 106 28 44 175 118 123 6 19 226 161 251 51 238 158 193 128 247 137 84 143]: don't know direct parent of this block server=95.111.218.30:41046 
ERROR Couldn't add block 7597204 [0 0 12 220 33 174 142 107 88 36 221 135 93 118 248 150 193 187 172 109 254 70 125 193 173 70 156 63 212 31 225 156]: don't know direct parent of this block server=94.237.2.1

v1.2.5 is released and should a address two issues:

  1. diode-node.info should will not issue these errors anymore:
=ERROR REPORT==== 17-Jul-2024::14:31:37.197599 ===
File operation error: eacces. Target: .. Function: list_dir. Process: application_controller.

:eyes: @JamesT

  1. DetsPlus related errors should auto-recover

:eyes: @fluffycroutons

2 Likes

multiple light nodes causing a host error,

Are you saying you’re running multiple light nodes on the same server? Sorry I’m unclear on this…

No all my light nodes with fallback are still producing a 500 error on the client side

diode-node.info works perfect now

1 Like

DetsPlus error is gone now after update, but the node is still in a reboot loop:

<prints "coinbase" line with address>
2024-07-22T17:08:14Z diode-node.diode-node[788934]: 17:08:14.888 [info] Cron: Executing Broadcast Self...
2024-07-22T17:08:14Z diode-node.diode-node[788934]: 17:08:14.888 [info] Cron: Executing Reload Cert...
2024-07-22T17:08:14Z diode-node.diode-node[788934]: 17:08:14.893 [info] maybe_import_key: Checking ["/var/snap/diode-node/7/priv", "/var/snap/diode-node/7"]'
2024-07-22T17:08:14Z diode-node.diode-node[788934]: 17:08:14.935 [info] PeerHandlerV2 ceca2f@us1.prenet.diode.io: Creating connect worker
2024-07-22T17:08:14Z diode-node.diode-node[788934]: 17:08:14.937 [info] PeerHandlerV2 7e4cd3@us2.prenet.diode.io: Creating connect worker
2024-07-22T17:08:14Z diode-node.diode-node[788934]: 17:08:14.942 [info] PeerHandlerV2 68e0ba@as1.prenet.diode.io: Creating connect worker
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:14.948 [info] PeerHandlerV2 1350d3@as2.prenet.diode.io: Creating connect worker
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:14.989 [info] PeerHandlerV2 937c49@eu1.prenet.diode.io: Creating connect worker
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:14.994 [info] PeerHandlerV2 ae6992@eu2.prenet.diode.io: Creating connect worker
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.099 [info] PeerHandlerV2 d17005@localhost: Creating connect worker
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.955 [info] PeerHandlerV2 1350d3@as2.prenet.diode.io: hello from: fringe_quiz      (0x1350d3b501d6842ed881b59de4b95b27372bfae8)
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.955 [info] PeerHandlerV2 ae6992@eu2.prenet.diode.io: hello from: feel_panther     (0xae699211c62156b8f29ce17be47d2f069a27f2a6)
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.955 [info] PeerHandlerV2 937c49@eu1.prenet.diode.io: hello from: grain_silk       (0x937c492a77ae90de971986d003ffbc5f8bb2232c)
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.955 [info] PeerHandlerV2 68e0ba@as1.prenet.diode.io: hello from: catch_also       (0x68e0bafdda9ef323f692fc080d612718c941d120)
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.956 [info] PeerHandlerV2 7e4cd3@us2.prenet.diode.io: hello from: pause_chalk      (0x7e4cd38d266902444dc9c8f7c0aa716a32497d0b)
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.956 [info] PeerHandlerV2 d17005@127.0.0.1: hello from: hotel_story      (0xd170058e208ac039b50c8342c45b9ed8e975a371)
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.957 [info] PeerHandlerV2 ceca2f@us1.prenet.diode.io: hello from: enrich_turtle    (0xceca2f8cf1983b4cf0c1ba51fd382c2bc37aba58)
2024-07-22T17:08:15Z diode-node.diode-node[788934]: 17:08:15.958 [info] PeerHandlerV2 d17005@localhost: hello from: hotel_story      (0xd170058e208ac039b50c8342c45b9ed8e975a371)
2024-07-22T17:08:19Z diode-node.diode-node[788934]: 17:08:19.839 [info] EdgeV2 746b9e@180.131.145.33: accepted connection
2024-07-22T17:08:20Z diode-node.diode-node[788934]: 17:08:20.158 [info] PeerHandlerV2 24e22f@192.168.0.72: Creating connect worker
2024-07-22T17:08:22Z diode-node.diode-node[788934]: 17:08:22.160 [info] Timeout while getting a result from stool_dress      (0x24e22f3e0843b1df1419397acdfb01bf1bd0aeb3)
2024-07-22T17:08:24Z diode-node.diode-node[788934]: 17:08:24.172 [info] Timeout while getting a result from stool_dress      (0x24e22f3e0843b1df1419397acdfb01bf1bd0aeb3)
2024-07-22T17:08:25Z diode-node.diode-node[788934]: 17:08:25.162 [info] PeerHandlerV2 24e22f@192.168.0.72:51055: Connection failed in ssl.connect(): {:error, :timeout}
2024-07-22T17:08:25Z diode-node.diode-node[788934]: 17:08:25.162 [info] Node stool_dress      (0x24e22f3e0843b1df1419397acdfb01bf1bd0aeb3) down for: :normal
2024-07-22T17:08:35Z diode-node.diode-node[788934]: 17:08:35.171 [info] PeerHandlerV2 24e22f@192.168.0.72: Creating connect worker
2024-07-22T17:08:37Z diode-node.diode-node[788934]: 17:08:37.172 [info] Timeout while getting a result from stool_dress      (0x24e22f3e0843b1df1419397acdfb01bf1bd0aeb3)
2024-07-22T17:08:40Z diode-node.diode-node[788934]: 17:08:40.173 [info] PeerHandlerV2 24e22f@192.168.0.72:51055: Connection failed in ssl.connect(): {:error, :timeout}
2024-07-22T17:08:40Z diode-node.diode-node[788934]: 17:08:40.173 [info] Node stool_dress      (0x24e22f3e0843b1df1419397acdfb01bf1bd0aeb3) down for: :normal
<crashes here and restarts>

I don’t see a crash message. You want to check dmesg -T – If it’s crashing because it’s running out of memory you should see a message there in the kernel log.

dmesg -T shows these references to diode-node but they don’t look like crashes:

[Tue Jul 23 14:14:56 2024] kauditd_printk_skb: 57 callbacks suppressed
[Tue Jul 23 14:14:56 2024] audit: type=1400 audit(1721744105.044:2541): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/proc/1260043/mountinfo" pid=1260043 comm="beam.smp" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:14:56 2024] audit: type=1400 audit(1721744105.092:2542): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/proc/1260043/mountinfo" pid=1260043 comm="beam.smp" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:14:56 2024] audit: type=1400 audit(1721744105.100:2543): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/proc/1260043/mountinfo" pid=1260043 comm="beam.smp" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:14:57 2024] audit: type=1400 audit(1721744105.964:2544): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/home/node/" pid=1260043 comm="erts_dios_10" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:14:57 2024] audit: type=1400 audit(1721744106.060:2545): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/home/node/" pid=1260043 comm="erts_dios_10" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:14:57 2024] audit: type=1400 audit(1721744106.112:2546): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/home/node/" pid=1260043 comm="erts_dios_5" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:15:05 2024] audit: type=1400 audit(1721744113.764:2547): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/proc/1260135/mountinfo" pid=1260135 comm="beam.smp" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:15:05 2024] audit: type=1400 audit(1721744113.764:2548): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/proc/1260135/mountinfo" pid=1260135 comm="beam.smp" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:15:05 2024] audit: type=1400 audit(1721744113.768:2549): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/proc/1260135/mountinfo" pid=1260135 comm="beam.smp" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:15:05 2024] audit: type=1400 audit(1721744114.288:2550): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/home/node/" pid=1260135 comm="erts_dios_9" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:15:05 2024] audit: type=1400 audit(1721744114.336:2551): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/home/node/" pid=1260135 comm="erts_dios_9" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000
[Tue Jul 23 14:15:05 2024] audit: type=1400 audit(1721744114.400:2552): apparmor="DENIED" operation="open" profile="snap.diode-node.info" name="/home/node/" pid=1260135 comm="erts_dios_9" requested_mask="r" denied_mask="r" fsuid=1000 ouid=1000

On v1.2.5 these crashes are happening again every few minutes:

2024-07-23T14:11:30Z diode-node.diode-node[788934]: 14:11:30.131 [error] Process #PID<0.2423365.0> on node :diode_node@vps38754 raised an exception
2024-07-23T14:11:30Z diode-node.diode-node[788934]: ** (MatchError) no match of right hand side value: {:error, :enoent}
2024-07-23T14:11:30Z diode-node.diode-node[788934]:     (dets_plus 2.3.2) lib/dets_plus.ex:895: anonymous fn/7 in DetsPlus.spawn_sync_worker/1

I wrote a script for listing oom values, Just copy and paste the script into an executable script like oom.sh

#!/bin/bash
# This script retrieves and displays the OOM (Out Of Memory) score and the OOM adjusted score 
# for each running process, sorted in descending order by the OOM score.

printf 'PID\tOOM Score\tOOM Adj\tCommand\n'

# Read each process ID and command, check if a corresponding oom_score file exists and its value is not zero.
# If so, print the process ID, OOM score, OOM adjusted score, and command.
while read -r pid comm
do
    if [ -f /proc/$pid/oom_score ] && [ $(cat /proc/$pid/oom_score) != 0 ]
    then
        printf '%d\t%d\t\t%d\t%s\n' "$pid" "$(cat /proc/$pid/oom_score)" "$(cat /proc/$pid/oom_score_adj)" "$comm"
    fi
done < <(ps -e -o pid= -o comm=) | sort -k 2nr

example output, this shows my beam.smp is using a lot of cpu, to avoid crashes I activated a swap file

PID     OOM Score       OOM Adj Command
300659  1186            0       beam.smp
329476  682             0       alloy
670     669             0       networkd-dispat
790     669             0       unattended-upgr
25289   668             0       systemd-resolve
328154  668             0       (sd-pam)
27131   667             0       rsyslogd
328147  667             0       sshd
328246  667             0       bash
49767   667             0       packagekitd
49775   667             0       udisksd
25283   666             0       systemd-network
26271   666             0       systemd-timesyn
300712  666             0       epmd
300716  666             0       erl_child_setup
301118  666             0       inet_gethost
301119  666             0       inet_gethost
301120  666             0       inet_gethost
301121  666             0       inet_gethost
301126  666             0       inet_gethost
328150  666             0       systemd
330982  666             0       bash
330991  666             0       htop
342204  666             0       sshd
342209  666             0       oom.sh
342210  666             0       oom.sh
342211  666             0       sort
49762   666             0       cron
49765   666             0       upowerd
49766   666             0       acpid
49773   666             0       polkitd
677     666             0       systemd-logind
731     666             0       login
25293   501             -250    systemd-journal
201676  72              -900    snapd
664     68              -900    dbus-daemon

my htop here shows diode grabbing swap and memory and cpu

hope it helps

whats your spec of build and os versions etc I am running ubuntu 22.04 and allis good with no crashes, just a memory leak and cpu leaks

I’m ubuntu 22.04 too. Maybe my 1GB memory isn’t enough.

@Knusperhaus original fix did seem to work at first, but now the same issue (the DetsPlus.spawn_sync_worker one) is back.

I am running the same memory config. Check to see if apparmor is denying beam.smp. also create a 1GB swap file to see if that stops the crashing.

What provider are you hosting on and what does dmesg -T show