MLAir issueshttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues2020-10-02T08:52:44+02:00https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/186log scale for feature importance2020-10-02T08:52:44+02:00Ghost Userlog scale for feature importanceIf a network highly depends on a single variable, the influence of the remaining variables is hard to determine graphically because of the linear scale.
Set log scale for axis: https://seaborn.pydata.org/examples/horizontal_boxplot.html...If a network highly depends on a single variable, the influence of the remaining variables is hard to determine graphically because of the linear scale.
Set log scale for axis: https://seaborn.pydata.org/examples/horizontal_boxplot.html
Use symmetric log scale because skill scores can be both positive and negative: https://matplotlib.org/3.1.1/api/scale_api.html#matplotlib.scale.SymmetricalLogScalehttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/182REFAC: batch size minimum2020-09-29T16:25:09+02:00Ghost UserREFAC: batch size minimumThe keras iterator create a distinct file for each batch. If batch size becomes very small, this will result in the creation of many files containing only a very smal amount of data (or only a single data point if batch size is 1). There...The keras iterator create a distinct file for each batch. If batch size becomes very small, this will result in the creation of many files containing only a very smal amount of data (or only a single data point if batch size is 1). Therefore create a threshold to combine batches in a file if the batch size is below.https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/181Model visualisation2020-09-29T16:08:54+02:00Ghost UserModel visualisationWe might integrate better visualisations of the model.
Here is a repo wich summarises different tools (we have to evaluate which ones can read Kras/TF model structures).
https://github.com/ashishpatel26/Tools-to-Design-or-Visualize-Arc...We might integrate better visualisations of the model.
Here is a repo wich summarises different tools (we have to evaluate which ones can read Kras/TF model structures).
https://github.com/ashishpatel26/Tools-to-Design-or-Visualize-Architecture-of-Neural-Networkhttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/179REFAC: unify default experiment name2020-12-10T17:16:16+01:00Ghost UserREFAC: unify default experiment nameCurrently, there are two ways to name an experiment if not provided.
* If using the default workflow, the name will be set to `testrun`.
* Using the setup run module as standalone or in a custom workflow, the name will become `TestExper...Currently, there are two ways to name an experiment if not provided.
* If using the default workflow, the name will be set to `testrun`.
* Using the setup run module as standalone or in a custom workflow, the name will become `TestExperiment`
Choose one of the names above and apply for both cases or think about a new default experiment name.https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/178REFAC: experiment date vs name2021-07-23T18:18:55+02:00Ghost UserREFAC: experiment date vs nameRefactor the behaviour of experiment date and experiment name. It is not clear, why it is not possible to set the experiment name from outside but only the experiment date.
* [ ] Remove the experiment date parameter!
Furthermore, adju...Refactor the behaviour of experiment date and experiment name. It is not clear, why it is not possible to set the experiment name from outside but only the experiment date.
* [ ] Remove the experiment date parameter!
Furthermore, adjust the naming with the appended sampling rate. Since the sampling rate is allowed to be a tuple, the experiment name can become quite ugly!
* [ ] Either check if sampling rate is a tuple (and use the 2nd entry in this case) or remove the code that adds the sampling to the experiment name.https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/167TF light2020-09-16T12:14:22+02:00Ghost UserTF lightIt might be usefull to convert the network to a "production" state.
See https://www.tensorflow.org/lite/convertIt might be usefull to convert the network to a "production" state.
See https://www.tensorflow.org/lite/converthttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/166error on failure2022-08-31T10:41:49+02:00Ghost Usererror on failureIf MLAir raises an error, it currently cannot find the logging file and raises another error. correct this!If MLAir raises an error, it currently cannot find the logging file and raises another error. correct this!https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/163(Too) big data2022-08-31T10:39:40+02:00Ghost User(Too) big dataData stored locally can grow up to 1.5 GB per file (station).
* [ ] check if storing of total data is required or not (it could be, that this data are not used - only the subsets.)
* [ ] Think about another storing strategy to replace `...Data stored locally can grow up to 1.5 GB per file (station).
* [ ] check if storing of total data is required or not (it could be, that this data are not used - only the subsets.)
* [ ] Think about another storing strategy to replace `.pickle` e.g. by xarrays dataset storagehttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/161Custom data split ratio2020-09-21T13:35:39+02:00Ghost UserCustom data split ratiocurrently the data split ratio between training and validation data is hardcoded to 80-20. Add the possibility to change this.currently the data split ratio between training and validation data is hardcoded to 80-20. Add the possibility to change this.https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/158Climatological Skill Scores (hourly resolution)2020-12-10T16:37:35+01:00Ghost UserClimatological Skill Scores (hourly resolution)Is the concept of climatological skill scores applicable for hourly resoluted data? Is it required to subdivide the mean state by daytime in addition to the monthly separation?Is the concept of climatological skill scores applicable for hourly resoluted data? Is it required to subdivide the mean state by daytime in addition to the monthly separation?Hourly data resolutionhttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/156Persistence forecast (hourly resolution)2020-12-10T16:40:31+01:00Ghost UserPersistence forecast (hourly resolution)Create a new heuristic for a persistence forecast when investigating hourly resoluted data.Create a new heuristic for a persistence forecast when investigating hourly resoluted data.Hourly data resolutionhttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/155BUG: empty monthly plot using hourly data2020-12-10T16:40:51+01:00Ghost UserBUG: empty monthly plot using hourly dataThe monthly summary plot seems to be empty when using data with hourly temporal resolution. Investigate origin of this behaviour.The monthly summary plot seems to be empty when using data with hourly temporal resolution. Investigate origin of this behaviour.Hourly data resolutionhttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/150seed for tf2020-07-21T10:13:46+02:00Ghost Userseed for tfis it possible to set the seedis it possible to set the seedhttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/148Implement ROC Curves2020-07-15T11:45:32+02:00Ghost UserImplement ROC CurvesIt might be of interest to generate ROC curves for threshold exceedance predictions.
See Wilks (2006, Ch 7.4.6) for detailed info.It might be of interest to generate ROC curves for threshold exceedance predictions.
See Wilks (2006, Ch 7.4.6) for detailed info.https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/132Tracking of units2020-12-10T16:39:50+01:00Ghost UserTracking of unitsWe should track the units to automatically create correct labels on plotsWe should track the units to automatically create correct labels on plotshttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/128Padding2D: allow to add other user defined paddings2020-08-24T13:16:44+02:00Ghost UserPadding2D: allow to add other user defined paddingsWe should add a method which allows adding new padding types to `allowed_paddings`.
Maybe something like
```python
def add_custom_padding(names, padding_layer):
self.allowed_paddings.update(**dict.fromkeys(names, padding_layer))
```We should add a method which allows adding new padding types to `allowed_paddings`.
Maybe something like
```python
def add_custom_padding(names, padding_layer):
self.allowed_paddings.update(**dict.fromkeys(names, padding_layer))
```https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/120test_build_model not generic enough?2020-09-21T16:31:32+02:00Ghost Usertest_build_model not generic enough?
`test_buld_model()` uses the imported model MyModel. Shouldn't it be better to use a dummy test model for the setup and instead create custom tests for all classes in model_class.py?
`test_buld_model()` uses the imported model MyModel. Shouldn't it be better to use a dummy test model for the setup and instead create custom tests for all classes in model_class.py?https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/117set real path2020-04-30T17:36:49+02:00Ghost Userset real pathWhen using ln -s some links might be not found. Therefore, integrate `os.path.realpath()`
See
https://unix.stackexchange.com/questions/196656/when-i-cd-through-a-symlink-why-does-pwd-show-the-symlink-instead-of-the-real-p/196753When using ln -s some links might be not found. Therefore, integrate `os.path.realpath()`
See
https://unix.stackexchange.com/questions/196656/when-i-cd-through-a-symlink-why-does-pwd-show-the-symlink-instead-of-the-real-p/196753https://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/105Force custom program termination2020-04-03T10:47:36+02:00Ghost UserForce custom program terminationOn run call, it is possible to specify, which steps should be executed. Because every steps depends on its previous steps, it shouldn't be possible to call training only. But on the other side, it should be possible to stop after preproc...On run call, it is possible to specify, which steps should be executed. Because every steps depends on its previous steps, it shouldn't be possible to call training only. But on the other side, it should be possible to stop after preprocessing or any other step. This is required to run different parts of an experiment on different partitions (see #104). If a termination step is given, only all precursory steps and itself are executed. Progress is saved locally (anyway because of #102).HPChttps://gitlab.jsc.fz-juelich.de/esde/machine-learning/mlair/-/issues/104Run single experiment steps on different partitions2020-04-03T10:46:41+02:00Ghost UserRun single experiment steps on different partitionsDepends on #102, #105
* [ ] wait for #102
* [ ] wait for #105
* [ ] create more customised shell script to execute different steps on different partitions (not parallel!): pre on cpu, train on gpu, ...Depends on #102, #105
* [ ] wait for #102
* [ ] wait for #105
* [ ] create more customised shell script to execute different steps on different partitions (not parallel!): pre on cpu, train on gpu, ...HPC