-
Notifications
You must be signed in to change notification settings - Fork 312
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Request: increase verbosity of pcluster cli when Importing CDK
times out
#6451
Comments
Hello Maurits, Sorry for the late reply. To help us reproduce the issue, can you provide the version of CDK which were giving you troubles? Thank you! |
I just started setting up ParallelCluster for the first time this week, and this is by far the biggest headache and time waster I stumbled upon, as I assumed the issue was somewhere in AWS (IAM, networking etc.). Steps to reproduce on Ubuntu 22.04.5, using a manually compiled Python 3.12: # this installs ancient Node.JS version 12, which triggers this issue
sudo apt-get install nodejs
# setup virtual environment, upgrade pip and install latest aws-parallelcluster
python3.12 -m venv ~/.local/opt/pcluster
~/.local/opt/pcluster/bin/pip install -U pip
~/.local/opt/pcluster/bin/pip install -U aws-parallelcluster
# install symlink to pcluster
ln -sft ~/.local/bin/ ~/.local/opt/pcluster/bin/pcluster
# create-cluster silently hangs, with "Importing CDK..." in ~/.parallelcluster/pcluster-cli.log
pcluster create-cluster --cluster-name test --cluster-configuration cluster-config.yaml --debug Output of
|
I can reproduce the issue. Per my understanding, the issue is because of the outdated NodeJS, right? I am working on improving ParallelCluster code Thank you for the reproducer, |
I'm trying to create a cluster using the
pcluster
cli from my local machineWhen I have errors in the configuration it responds with the correct errors and FSids that are in my account, but when the configuration of my cluster is correct nothing happens;
However, when turning off the Dry-run flag:
It just keeps hanging on this command. Adding the
--debug
flag does not change anything.Note that I am using
granted
to gain access to my aws account and that I'm able to deploy resources using terraform and I can u se the aws cli to interact with my resources.I checked the logs of my pcluster
~/.parallelcluster/pcluster-cli.log
and saw that it was stuck on the following line:cdk_builder.py:34:build_cluster_template() - importing CDK.....
I used
nvm
to switch to a newer version of cdk, and now it runs, but it took me quite a long time to find this specific error.ParallelCluster version:
3.11.0
frankfurt-test-2
pcluster describe-cluster
command.N/A - cluster would not create
N/A - cluster would not create
Bug description and how to reproduce:
I don't get a but, just no feedback at all.
For issues with Slurm scheduler, please attach the following logs:
/var/log/parallelcluster/clustermgtd
,/var/log/parallelcluster/clusterstatusmgtd
(if version >= 3.2.0),/var/log/parallelcluster/slurm_resume.log
,/var/log/parallelcluster/slurm_suspend.log
,/var/log/parallelcluster/slurm_fleet_status_manager.log
(if version >= 3.2.0) and/var/log/slurmctld.log
./var/log/parallelcluster/computemgtd.log
and/var/log/slurmd.log
.The text was updated successfully, but these errors were encountered: