Fix moving average of preprocessed OHC data #325

xylar · 2018-04-05T05:45:14Z

This is accomplished by writing out the multi-file data set and reading it in again as as single-file data set.

xylar · 2018-04-05T05:48:03Z

This is intended to address #324. I think the issue at the heart of this is something to do with combining dask and the rolling operator in xarray but I don't have enough time or interest to make a proper issue on the xarray forum right now. This solution seems simple enough and the file generated is tiny.

xylar · 2018-04-05T05:48:40Z

@milenaveneziani, could you check if this works for you in whatever environment(s) you have handy?

xylar · 2018-04-05T05:49:47Z

mpas_analysis/ocean/plot_depth_integrated_time_series_subtask.py

+            # (without dask)
+            dsPreprocessed = dsPreprocessed.drop('xtime')
+            write_netcdf(dsPreprocessed, self.preprocessedFileName)
+            dsPreprocessed = xarray.open_dataset(self.preprocessedFileName)


@pwolfram, if you have time to take a look at this, it's mainly a question of seeing if you're good with this solution for converting a multi-file data set to a single file data set or if you'd suggest some other way of handling the issue.

My understanding from S Hoyer is that this is the preferred way to handle these types of problems. The only issue I foresee with this is that as data becomes larger, this doesn't scale too well. However, recent work has been to develop parallel writing functionality into xarray so I wouldn't worry about this in the short term.

I think this is a reasonable way to convert a multi-file dataset into a single file data set 👍

pwolfram

Looks good to me

pwolfram · 2018-04-09T15:40:45Z

mpas_analysis/ocean/plot_depth_integrated_time_series_subtask.py

                                    'not be plotted.')
                preprocessedReferenceRunName = 'None'

+            # rolling mean seems to have trouble with dask data sets to we


Minor typo: 'to' should be 'so'

pwolfram · 2018-04-09T15:42:31Z

mpas_analysis/ocean/plot_depth_integrated_time_series_subtask.py

+            # (without dask)
+            dsPreprocessed = dsPreprocessed.drop('xtime')
+            write_netcdf(dsPreprocessed, self.preprocessedFileName)
+            dsPreprocessed = xarray.open_dataset(self.preprocessedFileName)


My understanding from S Hoyer is that this is the preferred way to handle these types of problems. The only issue I foresee with this is that as data becomes larger, this doesn't scale too well. However, recent work has been to develop parallel writing functionality into xarray so I wouldn't worry about this in the short term.

I think this is a reasonable way to convert a multi-file dataset into a single file data set 👍

This is accomplished by writing out the multi-file data set and reading it in again as as single-file data set.

xylar · 2018-04-12T14:27:11Z

@milenaveneziani, is this something you might be able to test sometime soon?

milenaveneziani · 2018-04-12T15:00:51Z

sorry for the delay @xylar: I'll be testing this today.

milenaveneziani · 2018-04-12T15:55:12Z

@xylar: if I test this on edison, will I be using the xarray/dask version that was causing the problem?

xylar · 2018-04-12T16:41:31Z

@milenaveneziani, if you use e3sm-unified/1.1.3, I think that is the right version. Even if not, the important thing is that things work with that particular version.

milenaveneziani

Tested on edison with e3sm_unified_1.1.3 and all worked fine.

xylar · 2018-04-12T20:19:14Z

Thanks, @milenaveneziani!

xylar added the bug label Apr 5, 2018

xylar self-assigned this Apr 5, 2018

xylar requested review from milenaveneziani and pwolfram April 5, 2018 05:45

xylar commented Apr 5, 2018

View reviewed changes

pwolfram approved these changes Apr 9, 2018

View reviewed changes

Fix moving average of preprocessed OHC data

668f697

This is accomplished by writing out the multi-file data set and reading it in again as as single-file data set.

xylar force-pushed the fix_moving_average branch from e188671 to 668f697 Compare April 12, 2018 14:26

milenaveneziani approved these changes Apr 12, 2018

View reviewed changes

xylar merged commit dae9187 into MPAS-Dev:develop Apr 12, 2018

xylar deleted the fix_moving_average branch April 12, 2018 20:19

This was referenced Apr 12, 2018

Error in timeSeriesOHCAnomaly: plotDepthIntegratedTimeSeriesGlobal #324

Closed

Update to v0.7.5 in docs, setup and recipe #334

Merged

Fix moving average of preprocessed OHC data #325

Fix moving average of preprocessed OHC data #325

Uh oh!

Conversation

xylar commented Apr 5, 2018

Uh oh!

xylar commented Apr 5, 2018

Uh oh!

xylar commented Apr 5, 2018

Uh oh!

xylar Apr 5, 2018

Choose a reason for hiding this comment

Uh oh!

pwolfram Apr 9, 2018

Choose a reason for hiding this comment

Uh oh!

pwolfram left a comment

Choose a reason for hiding this comment

Uh oh!

pwolfram Apr 9, 2018

Choose a reason for hiding this comment

Uh oh!

pwolfram Apr 9, 2018

Choose a reason for hiding this comment

Uh oh!

xylar commented Apr 12, 2018

Uh oh!

milenaveneziani commented Apr 12, 2018

Uh oh!

milenaveneziani commented Apr 12, 2018

Uh oh!

xylar commented Apr 12, 2018

Uh oh!

milenaveneziani left a comment

Choose a reason for hiding this comment

Uh oh!

xylar commented Apr 12, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants