Add option to not save generated data #86

swkeemink · 2020-03-06T17:58:30Z

When defining an experiment it is now possible to define the format in which data is saved with the data_format option. At the moment this can only be 'npy' or None. If it is 'npy', the behavior will be as before. If it is set to None, no data will be saved. This is useful for users that don't need to store any of the data directly on the hard drive, and just want to define how and where to use the data in their own workflows.

I have also laid the basic groundwork for other formats than npy, and am working on saving everything in hdf5 files.

Note that there is currently a bug with our use of numpy.load, see #85.

Can be 'npy' or None

scottclowe · 2020-03-08T20:06:16Z

If I understand correctly, you're intending to merge this as-is with the option of not saving added, and HDF5 support will be in a different PR?

swkeemink · 2020-03-08T21:38:36Z

Correct!

fissa/core.py

examples/Basic usage.ipynb

scottclowe · 2020-03-08T04:05:08Z

fissa/core.py

-                 lowmemory_mode=False, datahandler_custom=None):
+                 lowmemory_mode=False, datahandler_custom=None,
+                 data_format='npy'):


Better code style is to have the ): on its own line, so you don't get this diff where a comma and a new line are inserted later on (which happened here).

Suggested change

lowmemory_mode=False, datahandler_custom=None):

lowmemory_mode=False, datahandler_custom=None,

data_format='npy'):

lowmemory_mode=False, datahandler_custom=None,

data_format='npy',

):

Very good point, I will change this.

scottclowe · 2020-03-08T04:06:55Z

fissa/core.py

@@ -196,6 +197,9 @@ def __init__(self, images, rois, folder, nRegions=4,
            A custom datahandler for handling ROIs and calcium data can
            be given here. See datahandler.py (the default handler) for
            an example.
+        data_format : string or None, optional
+            How FISSA generated data will be saved.
+            Can be 'npy' (default) or None. In the future will also support 'hdf5'.


Shouldn't add comments about prospective future features to docstrings.

I think you need to specify the behaviour better. At the moment it isn't clear what happens if None is provided.

Will clarify this in the docstring.

fissa/core.py

scottclowe · 2020-03-09T01:15:02Z

fissa/core.py

+        data_format : string or None, optional
+            How FISSA generated data will be saved.
+            Can be 'npy' (default) or None. In the future will also support 'hdf5'.


Suggested change

data_format : string or None, optional

How FISSA generated data will be saved.

Can be 'npy' (default) or None. In the future will also support 'hdf5'.

data_format : 'npy' or None, optional

The format in which data generated by FISSA will be saved.

If this is `'npy'`, data will be saved in the numpy format using

`numpy.save`. If this is `None`, the data will not be saved to

a cache file and will have to be generated again if the same

data is separated again. Default is `'npy'`.

fissa/core.py

scottclowe · 2020-03-09T01:28:20Z

fissa/core.py

+            # try to load data from filename
+            if not redo:
+                try:
+                    nCell, raw, roi_polys = np.load(fname)
+                    print('Reloading previously prepared data...')
+                except BaseException:
+                    redo = True


Remove superfluous nesting.

Suggested change

# try to load data from filename

if not redo:

try:

nCell, raw, roi_polys = np.load(fname)

print('Reloading previously prepared data...')

except BaseException:

redo = True

# try to load data from filename

if not redo:

try:

nCell, raw, roi_polys = np.load(fname)

print('Reloading previously prepared data...')

except BaseException:

redo = True

fissa/core.py

swkeemink · 2020-03-16T11:46:10Z

Besides the above changes (which I generally approve of) we should also expand the tests to avoid reducing code coverage as it is now.

Co-Authored-By: Scott Lowe <[email protected]>

codecov · 2020-03-16T11:56:21Z

Codecov Report

Merging #86 into master will decrease coverage by 1.33%.
The diff coverage is 58.62%.

@@            Coverage Diff             @@
##           master      #86      +/-   ##
==========================================
- Coverage   79.27%   77.94%   -1.34%     
==========================================
  Files           9        9              
  Lines         666      680      +14     
  Branches      129      136       +7     
==========================================
+ Hits          528      530       +2     
- Misses         94       99       +5     
- Partials       44       51       +7

Impacted Files	Coverage Δ
fissa/core.py	`85.52% <58.62%> (-4.67%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 50f148c...0a41db9. Read the comment docs.

Co-Authored-By: Scott Lowe <[email protected]>

scottclowe · 2020-05-18T09:32:40Z

Closed in favour of #129, which handles the ability to run fissa without saving to a cache more succinctly. @swkeemink please make a new PR for saving to HDF5 format.

swkeemink added 3 commits March 6, 2020 17:53

ENH: added data_format option

5bc85b1

Can be 'npy' or None

FIX: Matlab saving now creates folder if necessary

949db01

ENH: updated the basic tutorial for data_format option

7c26e7b

swkeemink requested a review from scottclowe March 6, 2020 17:58

swkeemink changed the title ~~Adding the option to change the data format~~ Adding an option to change the data format Mar 6, 2020

swkeemink mentioned this pull request Mar 6, 2020

Adding more options for data storage #82

Open

scottclowe added vX.Y.0 New Minor Release This should be included in a minor release enhancement labels Mar 8, 2020

scottclowe requested changes Mar 9, 2020

View reviewed changes

scottclowe changed the title ~~Adding an option to change the data format~~ Add option to not save generated data Mar 9, 2020

RF: Use set instead of list

4e06308

Co-Authored-By: Scott Lowe <[email protected]>

swkeemink and others added 3 commits March 16, 2020 12:05

RF: Use path.join and reduce nesting

892c139

Co-Authored-By: Scott Lowe <[email protected]>

RF: use path.join

efef4e6

Co-Authored-By: Scott Lowe <[email protected]>

RF: use path.join

0a41db9

Co-Authored-By: Scott Lowe <[email protected]>

scottclowe mentioned this pull request May 4, 2020

ENH: Add option to run without cache #129

Merged

scottclowe closed this May 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to not save generated data #86

Add option to not save generated data #86

swkeemink commented Mar 6, 2020

scottclowe commented Mar 8, 2020

swkeemink commented Mar 8, 2020

scottclowe Mar 8, 2020

swkeemink Mar 16, 2020

scottclowe Mar 8, 2020

scottclowe Mar 8, 2020

swkeemink Mar 16, 2020

scottclowe Mar 9, 2020

scottclowe Mar 9, 2020

swkeemink commented Mar 16, 2020

codecov bot commented Mar 16, 2020 •

edited

Loading

scottclowe commented May 18, 2020

-        data_format : string or None, optional
-            How FISSA generated data will be saved.
-            Can be 'npy' (default) or None. In the future will also support 'hdf5'.
+        data_format : 'npy' or None, optional
+            The format in which data generated by FISSA will be saved.
+            If this is `'npy'`, data will be saved in the numpy format using
+            `numpy.save`. If this is `None`, the data will not be saved to
+            a cache file and will have to be generated again if the same
+            data is separated again. Default is `'npy'`.

Add option to not save generated data #86

Add option to not save generated data #86

Conversation

swkeemink commented Mar 6, 2020

scottclowe commented Mar 8, 2020

swkeemink commented Mar 8, 2020

scottclowe Mar 8, 2020

Choose a reason for hiding this comment

swkeemink Mar 16, 2020

Choose a reason for hiding this comment

scottclowe Mar 8, 2020

Choose a reason for hiding this comment

scottclowe Mar 8, 2020

Choose a reason for hiding this comment

swkeemink Mar 16, 2020

Choose a reason for hiding this comment

scottclowe Mar 9, 2020

Choose a reason for hiding this comment

scottclowe Mar 9, 2020

Choose a reason for hiding this comment

swkeemink commented Mar 16, 2020

codecov bot commented Mar 16, 2020 • edited Loading

Codecov Report

scottclowe commented May 18, 2020

codecov bot commented Mar 16, 2020 •

edited

Loading