You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There was a recently opened issue #3510 where someone is trying to download a dataset but there is no expression matrix in the file.
From the logs you can see that there are two errors that are not handled correctly.
When parsing the original file we attempt to convert string values in column nuID to a float.
ValueError: could not convert string to float: 'ritxUH.kuHlYqjozpE'
TypeError: Cannot cast array data from dtype('O') to dtype('float32') according to the rule 'safe'
Since all samples in this experiment have similarly structured data there was no output data but the smasher still handled this job as successful.
ERROR [key: HOMO_SAPIENS] [job_id: 29798209]: Was told to smash a key with no frames!
For the failure in smashing these files, this is probably something that we wanted to have addressed in the no_op processor. However, because these files are are already downloaded we may need to account for this in the smasher_tools module.
For the other error, we will just want to add another check that determines if at least one key is present in the expression matrix. If not we should either throw an error or add a flag to indicate that there is data missing from the zip.
Solution or next step
Confirm assessment with someone from the science team
Update smasher to fail instead of creating metadata only download
Determine what to do about nuID column in original file
The text was updated successfully, but these errors were encountered:
Context
There was a recently opened issue #3510 where someone is trying to download a dataset but there is no expression matrix in the file.
From the logs you can see that there are two errors that are not handled correctly.
nuID
to a float.The original file used to determine the errant column.
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM1182499
Problem or idea
For the failure in smashing these files, this is probably something that we wanted to have addressed in the
no_op
processor. However, because these files are are already downloaded we may need to account for this in thesmasher_tools
module.For the other error, we will just want to add another check that determines if at least one key is present in the expression matrix. If not we should either throw an error or add a flag to indicate that there is data missing from the zip.
Solution or next step
nuID
column in original fileThe text was updated successfully, but these errors were encountered: