Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Intermittent error executing cncli sync on 1.32.1 #83

Open
feqifei opened this issue Dec 15, 2021 · 5 comments
Open

[BUG] Intermittent error executing cncli sync on 1.32.1 #83

feqifei opened this issue Dec 15, 2021 · 5 comments
Labels
bug Something isn't working upstream change

Comments

@feqifei
Copy link

feqifei commented Dec 15, 2021

Hi, executing cncli sync on my public testnet node I run into the following error:
thread 'main' panicked at 'called Result::unwrap() on an Err value: Os { code: 104, kind: ConnectionReset, message: "Connection reset by peer" }', /home/ubuntu/.cargo/registry/src/github.com-1ecc6299db9ec823/cardano_ouroboros_network-0.2.8/src/mux/connection.rs:202:67note: run with RUST_BACKTRACE=1 environment variable to display a backtrace

I've not been able to replicate it, so not able to provide the backtrace log.
Node version 1.32.1, p2p feature activated

I informed Andrew Westberg, he tested on his node that faced the same issue on same code line. The second attempt went fine.
Also his node is on 1.32.1 with p2p activated.

image

I didn't face the issue on my mainnet nodes where the p2p feature is off.
I also updated the testnet node to 1.33.0-rc1 tag and I tried dozens of times to replicate the issue without success.

@mark-stopka
Copy link
Member

@AndrewWestberg any chance you've got the backtrace?

@mark-stopka mark-stopka added bug Something isn't working upstream change labels Dec 15, 2021
@mark-stopka mark-stopka changed the title Intermittent error executing cncli sync on 1.32.1 [BUG] Intermittent error executing cncli sync on 1.32.1 Dec 15, 2021
@AndrewWestberg
Copy link
Contributor

@mark-stopka The screenshot above is mine. I couldn't reproduce it when I tried to run it with backtrace. The line of code it fails on is provided so maybe we can figure out the issue by just looking at the code.

@mark-stopka
Copy link
Member

mark-stopka commented Dec 15, 2021

@AndrewWestberg it is the logging of the message size;

let len = self.stream.write(&msg).unwrap();

The TODO for that code section does include a note to handle errors.

@AndrewWestberg
Copy link
Contributor

I guess typically that code never throws any errors, but on rare occasion, some network communication glitch happens.

@pavlix
Copy link
Contributor

pavlix commented Jan 29, 2022

I guess typically that code never throws any errors, but on rare occasion, some network communication glitch happens.

What is your ideal idea of recovery in that case? I would like to refactor the library to avoid most panic!()s but I'm curious how the tool would respond to an actual connection error.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working upstream change
Projects
None yet
Development

No branches or pull requests

4 participants