-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
datalad datasets : unable to read volumes #65
Comments
did you |
That is how it was originally downloaded but we copy pasted it to another computer (outside dataalad), that’s probably the source of the error but we can see the MRI scan though so they should be MRI scan data inside there, no? |
I get the same error in the computer where I did the (base) $ 19:04:49 Quark ds002785 >> vqct1 -b $PWD -old
Anatomical MRI module
Time stamp : 2022-07-11 19:04:54
version info: visualqc 0.6.1
numpy 1.17.4 / scipy 1.1.0 / matplotlib 3.5.1
python 3.7.2 (default, Dec 29 2018, 00:00:04)
[Clang 4.0.1 (tags/RELEASE_401/final)]
platform Darwin-21.4.0-x86_64-i386-64bit
Darwin Kernel Version 21.4.0: Fri Mar 18 00:45:05 PDT 2022; root:xnu-8020.101.4~15/RELEASE_X86_64
/Users/Reddy/anaconda3/envs/py36/lib/python3.7/site-packages/bids/layout/models.py:152: FutureWarning: The 'extension' entity currently excludes the leading dot ('.'). As of version 0.14.0, it will include the leading dot. To suppress this warning and include the leading dot, use `bids.config.set_option('extension_initial_dot', True)`.
FutureWarning)
Input folder: /Volumes/work/Pitt/datasets/ds002785
Output folder: /Volumes/work/Pitt/datasets/ds002785/visualqc
outlier detection: disabled, as requested.
Restoring ratings from previous session(s), if they exist ..
To be reviewed : 216
Reviewing MD5E-s6406026--f20d90f38f7122ca08d290b502661802.nii.gz
Traceback (most recent call last):
File "/Users/Reddy/anaconda3/envs/py36/bin/vqct1", line 8, in <module>
sys.exit(main())
File "/Users/Reddy/anaconda3/envs/py36/lib/python3.7/site-packages/visualqc/__t1_mri__.py", line 12, in main
t1_mri.cli_run()
File "/Users/Reddy/anaconda3/envs/py36/lib/python3.7/site-packages/visualqc/t1_mri.py", line 872, in cli_run
wf.run()
File "/Users/Reddy/anaconda3/envs/py36/lib/python3.7/site-packages/visualqc/workflows.py", line 87, in run
self.loop_through_units()
File "/Users/Reddy/anaconda3/envs/py36/lib/python3.7/site-packages/visualqc/workflows.py", line 224, in loop_through_units
skip_subject = self.load_unit(unit_id)
File "/Users/Reddy/anaconda3/envs/py36/lib/python3.7/site-packages/visualqc/t1_mri.py", line 507, in load_unit
self.current_img_raw = read_image(t1_mri_path, error_msg='T1 mri')
File "/Users/Reddy/anaconda3/envs/py36/lib/python3.7/site-packages/visualqc/utils.py", line 38, in read_image
''.format(error_msg, img_spec))
OSError: Given path to T1 mri does not exist!
/Volumes/work/Pitt/datasets/ds002785/.git/annex/objects/jz/W2/MD5E-s6406026--f20d90f38f7122ca08d290b502661802.nii.gz/MD5E-s6406026--f20d90f38f7122ca08d290b502661802.nii.gz
(base) $ 19:08:33 Quark ds002785 >> what commands can I run to ensure it was gotten / installed properly? I tried (base) $ 19:11:34 Quark ds002785 >> datalad metadata -d $PWD
[WARNING] Found no aggregated metadata info file /Volumes/work/Pitt/datasets/ds002785/.datalad/metadata/aggregate_v1.json. You will likely need to either update the dataset from its original location or reaggregate metadata locally.
[WARNING] Dataset at . contains no aggregated metadata on this path [metadata(/Volumes/work/Pitt/datasets/ds002785)]
(base) $ 19:11:39 Quark ds002785 >> |
now that I think about it, I realize I only installed one of the derivatives : freesurfer, and not the base BIDS dataset. I am now running |
I get the following, and it worked this time:
i don't understand the notneeded part in although I see two issues:
|
Let's zoom tomorrow? |
Sure! Tomorrow is a bit tough with dental appointments and other things but Thursday afternoon works. Or Friday? |
sure, just let me know the time ;-) Thu we have ReproNim coworking time 11-5pm which happens in NMIND gather town, so can meet there |
we are trying to use the dataset on an M1 MacBook, and it appears installing datalad on it is not easy (and definitely not for a high school student) i wish openfmri folks let us download the dataset, or parts of it, from a browser :). cc @effigies i will check if AWS CLI works on M1 MacBook |
there is always trade off between "I want the flashiest latest cool gadget from a company which does not really care about science" and "I want a system for doing science" ;) It is all on S3, you can use s3 clients to download straight from S3. Re M1 -- should install rosetta and then git-annex should be installable from brew IIRC. some details here datalad/datalad#5701 |
OpenNeuro permits downloading; I believe recent Chrome or Firefox is needed for the download API needed to work with such large datasets. If you're still using legacy.openfmri.org, then I think there are tarballs, but these datasets are not kept in sync with OpenNeuro. |
damn, that's good to know. I was always seeing it from safari, and there was no indication at all that we could download it from a browser. I would suggest leaving a note to ask folks to use Chrome or Firefox, instead of silently removing that option on safari |
it doesn't seem to work on firefox btw, atleast for 2 datasets I looked at |
You're right, it looks like Mozilla is not implementing this API; for some reason I thought they had. Looks like Chrome, Edge and Opera do implement it. https://developer.mozilla.org/en-US/docs/Web/API/File_System_Access_API#browser_compatibility |
Hi Yarik, I am available in the next few hours if you want to look into this issue. |
pinged you on twitter with url to nmind if you don't know |
ok,since zooming didn't happen, let me follow up on original datalad-related issues from the last related to that comment by @raamana :
most likely those 216 were already obtained
I have not looked inside: if visualqc traversal traverses also regardless of the datalad, you might benefit from speeding up walking through multithreading the walk -- we have it in https://github.com/dandi/dandi-cli/blob/master/dandi/support/threaded_walk.py but there we have not added any vcs folders exclusion yet (used only within zarr folders) -- filed dandi/dandi-cli#1086 to possibly harmonize.
such "resolve to the death" plagues many things, including browsers, AFNI etc. Often they come up with a switch to "do not bother resolving" and since I do not know details here I can only arrogantly state "there should be no need to resolve symlinks since that would incorporate some ad-hoc assumption on their purpose. If there is such ad-hoc assumption -- make it more explicit ". So what is the assumption which makes you to resolve the paths here? ;-) |
thanks Yarik for the detailed notes. I was thinking of potentially excluding certain paths like |
I resolve paths by default as one of the several best practices for file/path management -- I don't understand the case against resolving though, except in extreme situations of large number of layers of sym-linking (which is often not the case with most regular users) |
could you provide reference for such a best practice. My mileage goes against it ;-) |
I guess we approach it with different experiences from the past i guess :). one obvious rationale is to avoid depending on relative paths, which caused some issues for me before, esp. when the same tool is used process different projects and datasets |
"relative path" (e.g., |
VisualQC faces some issue with reading MR volumes from BIDS format downloaded via datalad -- see log below. cc @yarikoptic
The text was updated successfully, but these errors were encountered: