-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
idr0091-julou-lacinduction S-BIAD852 #650
Comments
Issue with conversion:
I even tried with |
Pretty sure that |
👍 It finally worked with |
50G definitely feels excessive. I recall some improvements were targeting at handling similar issues for large Micro-Manager metadata files in the past. One thing possibly worth testing independently is whether Semi-related, I would expect this particular file format to work without issues with OMERO 5.6.6. What is our policy for these types of submissions of mixed file formats (probably only a handful of them)? Are we converting everything or only the minimal amount of data? /cc @jburel |
Oh, I should test a different image then. Didn't notice that this submission had different file formats. |
ome/bioformats#3229 is the last time we addressed memory issues in Micro-Manager, so I'd be surprised if bioformats2raw 0.6.0 helps. Based on the partial stack trace, I'd guess it's original metadata annotations that are causing the problem. Comparing memory usage for |
Also converted one of the pattern files, and re-imported. Worked fine. But the converted MMStack can't be re-imported, also memory issue:
|
@dominikl - are you able to try the @melissalinkert If that is the case, does it suggest a workaround for A possible option is to use |
bioformats2raw does not have a direct equivalent to bfconvert's |
Going to start exporting with
Export 100 images
|
After about 17 hours we have 50 images... (about 3 an hour):
|
Moved 51 zarrs to
|
Zipping of 51 images in Upload to BioStudies...
|
Other 49 images from batch 1 completed... Also starting to export ALL the remaining images...
|
Looks like the last 2 images here (batch1) didn't export properly - too small:
Deleted them. Rename 49 others (remove
|
Upload the 2nd lot of 49 images from batch1...
|
Current progress.... Exported 127 of 342 Images. (342 - 127) / 3 = 72 hours. First batch of 100 images (2 failed and need re-exporting).
...and this should complete in 3 days. |
Looks like all remaining zarrs exported OK...
rename to remove
|
Started uploading 242 zips...
|
Checked size of zips on BioStudies. 20160912_Pos4_GL06.ome.zarr.zip is smaller than others - as this is only single timepoint: https://idr.openmicroscopy.org/webclient/?show=image-10648217 Use JS to list files from submissions page:
|
Looks like the
|
The sql doesn't contain 4053851.sql
|
@joshmoore I see from https://github.com/IDR/omero-mkngff/blob/4c1e32bb32a7b92f427634630e6b552cbb186509/src/omero_mkngff/__init__.py#L108 that We'll need to pick another file to update the pixels table with. I'll open an issue on the repo: IDR/omero-mkngff#7 |
Running this sql fixes the image
|
Actually, it seems that Bio-Formats is not fussy which file is referenced in
|
We now have all 342 Filesets available at https://uk1s3.embassy.ebi.ac.uk/bia-integrator-data/pages/S-BIAD852.html Lets use next batch (not first 11 above) for testing IDR/omero-mkngff#8 Update to branch
Took about 8 minutes to generate each
Find image from last Fileset created and check pixels name, path...
Realise that this didn't work as I've used the OLD Fileset ID to update pixels after the new Fileset is created. Then re-installed... |
Try with fresh filesets...
|
After running sql commands, get Image IDs from Fileset IDs..
Image is directly viewable! |
Going to generate
NB: First 10 failed sql as had already been run on idr0125-pilot above - Need to sort out... ... took 25 mins in total. |
Also saw another random fail for
just caught this...
|
Re-exporting on idr-ftp with pixels type fix as at ome/omero-cli-zarr#157 with merge branch
|
Also exported "batch2" as above... Renamed ALL 342 filesets to remove pattern
Zip - not deleting...
|
on idr-testing... (goofys is at
On idr-ftp, delete the existing (invalid) data and upload all images...
idr-testing...
E.g. looks good: https://ome.github.io/ome-ngff-validator/?source=https://uk1s3.embassy.ebi.ac.uk/idr0091/zarr/20160526_pos0_GL01.ome.zarr |
On idr-testing, let's try to update symlink to fix dtype issues... Test with Image:
That Image has symlink like this:
As omero-server...
Symlink looks good:
Fixed!
|
We can actually use IDR/idr-utils#54 script to do this, if we provide mapping.csv Test with a single Image on idr-testing...
As omero-server...
login as public user, then..
Success!
|
On idr-testing, make
From that, we can make symlinks mapping file as above:
Now we run managed_repo_symlinks for each Image...
EDIT... took about 15 mins to do 342 images...
|
All good 👍
|
On idr-ftp, the zips created on 18th Dec (above) have been uploaded (not sure of exact date), following deletion of the old
|
Images updated on https://uk1s3.embassy.ebi.ac.uk/bia-integrator-data/pages/S-BIAD852.html New idr0090.csv file at IDR/mkngff_upgrade_scripts@0522d43 and IDR/mkngff_upgrade_scripts@c92c217 based on csv provided by Kola. Running mkngff on idr-next (since this has the NGFF filesets that we wish to replace), using
EDIT: something went wrong as all the .sql files are empty! Fixed the idr0091.csv (mising Pushed at IDR/mkngff_upgrade_scripts@03b02e7 Won't test these yet as idr-testing is being used for microservices testing. |
On new pilot #675 (comment) Ran all the mkngff SQL scripts... ending for idr0091 with...
Last row in idr0091.csv at https://github.com/IDR/mkngff_upgrade_scripts/blob/1b64ab85fab537faafd62d6e19c01cf5ab32d11f/ngff_filesets/idr0091.csv this image is http://localhost:1080/webclient/?show=image-10648367 So, the idr0091.csv above is out of date, and was missed from the update at IDR/mkngff_upgrade_scripts@03b02e7 |
Try to clean-up (delete) the 342 Filesets we created above - last one ID
http://localhost:1080/webclient/?show=image-15150680 in webclient on For all Filesets
For Last Image/Fileset...
This corresponds to the Fileset IDs updated in IDR/mkngff_upgrade_scripts@25c5372 So, NEW Fileset IDs are First row...
idr0091_ids.csv (removed first line NEW Fileset ID, OLD Fileset ID
Then
|
idr0091-julou-lacinduction
The text was updated successfully, but these errors were encountered: