README.md 6.05 KB
Newer Older
oparisot's avatar
oparisot committed
1 2
# gapIt

Olivier Parisot's avatar
Olivier Parisot committed
3 4 5
Copyright 2014 - 2019 Luxembourg Institute of Science and Technology (LIST - [http://www.list.lu/](http://www.list.lu/)). 

Any use of this software constitutes full acceptance of all terms of the [software's license](./LICENSE.txt).
oparisot's avatar
oparisot committed
6 7


Olivier Parisot's avatar
Olivier Parisot committed
8 9 10 11
## Synopsis

gapIT is an interactive software for gap filling in hydrological time series.

oparisot's avatar
oparisot committed
12
Given as input hydrological discharge data series measured at gauging stations, it performs an automated calculation of missing data using different data-infilling techniques. Donor station(s) are automatically selected based on Dynamic Time Warping, geographical proximity and upstream/downstream relationships among stations. For each gap, the tool computes several flow estimates through various data-infilling techniques, including interpolation, multiple regression, regression trees and neural networks. The visual application provides the possibility for the user to select different donor station(s) w.r.t. those automatically selected.
Olivier Parisot's avatar
Olivier Parisot committed
13 14 15 16 17 18 19 20 21 22 23

The results are validated by randomly creating artificial gaps of different lengths and positions along the entire records. Using the Root Mean Squared Error and the Nash-Sutcliffe coefficient as performance measures, the method is evaluated based on a comparison with the actual measured discharge values.

The interactive but automated approach of gapIT, coupled with a visual inspection system for user-defined refinement, allows for a standardized objective infilling, where subjective decisions are allowed but are at the same time traceable.


## Motivation

Missing records in hydrological databases represent a loss of information and a serious drawback in water management (measures of river flow for example). An incomplete time series prevents the computation of hydrological statistics and indicators. Also, records with data gaps are not suitable as input or validation data for hydrological or hydrodynamic modeling.


Olivier Parisot's avatar
Olivier Parisot committed
24
## Installation / execution
Olivier Parisot's avatar
Olivier Parisot committed
25

oparisot's avatar
oparisot committed
26
### Requirements
oparisot's avatar
oparisot committed
27

oparisot's avatar
oparisot committed
28
Hardware: RAM>=4Gb.
oparisot's avatar
oparisot committed
29

oparisot's avatar
oparisot committed
30
Software: 
Olivier Parisot's avatar
Olivier Parisot committed
31 32
* JDK (>=1.7 - [http://www.oracle.com/technetwork/java/javase/downloads/](http://www.oracle.com/technetwork/java/javase/downloads/)).
* Maven (>=3.0.2 - [https://maven.apache.org/](https://maven.apache.org/))
Olivier Parisot's avatar
Olivier Parisot committed
33

Olivier Parisot's avatar
Olivier Parisot committed
34

oparisot's avatar
oparisot committed
35
### Compilation
oparisot's avatar
oparisot committed
36

Olivier Parisot's avatar
Olivier Parisot committed
37 38
    mvn clean package -Dmaven.test.skip=true
    
Olivier Parisot's avatar
Olivier Parisot committed
39
(*Some tests are currently not ok, and the build fails without the '-Dmaven.test.skip=true' option.*)
Olivier Parisot's avatar
Olivier Parisot committed
40

Olivier Parisot's avatar
Olivier Parisot committed
41

oparisot's avatar
oparisot committed
42
### Data preparation
Olivier Parisot's avatar
Olivier Parisot committed
43

Olivier Parisot's avatar
Olivier Parisot committed
44
unzip [data_fake2.zip](./data_fake2.zip) in the same directory
oparisot's avatar
oparisot committed
45

Olivier Parisot's avatar
Olivier Parisot committed
46

oparisot's avatar
oparisot committed
47 48
### Execution for Windows OS

Olivier Parisot's avatar
Olivier Parisot committed
49
    run.bat
oparisot's avatar
oparisot committed
50

oparisot's avatar
oparisot committed
51
### Execution for other OS
Olivier Parisot's avatar
Olivier Parisot committed
52

Olivier Parisot's avatar
Olivier Parisot committed
53
    java -jar -Xmx4G target/gapIt.jar
Olivier Parisot's avatar
Olivier Parisot committed
54 55 56 57


## Data

oparisot's avatar
oparisot committed
58
gapIt was used to fill gaps in water discharge time series at Luxembourg (see the 'References' section).
oparisot's avatar
oparisot committed
59
During tests phases, it was used too for the processing of water level time series.
oparisot's avatar
oparisot committed
60

oparisot's avatar
oparisot committed
61 62
Here, fake data are provided within the distribution [data_fake2.zip] (./data_fake2.zip). These data were derived from real data using a random method. To get the real data or to integrate your own data, please contact Olivier Parisot (olivier.parisot@list.lu) or Ivonne Trebs (ivonne.trebs@list.lu).

Olivier Parisot's avatar
Olivier Parisot committed
63
In this archive ([data_fake2.zip](./data_fake2.zip)), the required data files are:
oparisot's avatar
oparisot committed
64 65 66 67 68 69
* *all_valid_q_series_complete2.arff*: the time series representing the measured discharge for each station, under the [ARFF format] (https://weka.wikispaces.com/ARFF).
* *knowledgeDB20-discharge.arff*: the knowledge database containing the Case-Based Reasoning data, under the ARFF format too.
* *stations_coordinates.txt*: the coordinates of each station.
* *stations_relationships_1.xml*: the upstream/downstreams relationships among the stations (file 1/2).
* *stations_relationships_2.xml*: the upstream/downstreams relationships among the stations (file 2/2).
* *shapeCountry.jpg*: a picture to show the shape of the studied country.
oparisot's avatar
oparisot committed
70

Olivier Parisot's avatar
Olivier Parisot committed
71

oparisot's avatar
oparisot committed
72 73
## Screenshots

oparisot's avatar
oparisot committed
74
![pic1](/pictures/picture0.png)
oparisot's avatar
oparisot committed
75

oparisot's avatar
oparisot committed
76
![pic2](/pictures/picture2.png)
oparisot's avatar
oparisot committed
77

oparisot's avatar
oparisot committed
78
![pic3](/pictures/picture3.png)
oparisot's avatar
oparisot committed
79

oparisot's avatar
oparisot committed
80

oparisot's avatar
oparisot committed
81 82
## Video

Olivier Parisot's avatar
Olivier Parisot committed
83
Want to see the features of gapIt before installing it? Then you can check this [short demo](/video/GapIT_Video2.wmv).
oparisot's avatar
oparisot committed
84

oparisot's avatar
oparisot committed
85

Olivier Parisot's avatar
Olivier Parisot committed
86 87
## References

oparisot's avatar
oparisot committed
88
This section lists scientific publications in which gapIt is used. 
oparisot's avatar
oparisot committed
89

Olivier Parisot's avatar
Olivier Parisot committed
90
 * Laura Giustarini, Olivier Parisot, Mohammad Ghoniem, Renaud Hostache, Ivonne Trebs, Benoît Otjacques: **«A user-driven case-based reasoning tool for infilling missing values in daily mean river flow records»**, «Environmental Modelling and Software», Elsevier, 5/2016 [(link)](http://www.sciencedirect.com/science/article/pii/S1364815216301050)
oparisot's avatar
oparisot committed
91

Olivier Parisot's avatar
Olivier Parisot committed
92
 * Laura Giustarini, Olivier Parisot, Mohammad Ghoniem, Renaud Hostache, Ivonne Trebs, Benoît Otjacques:**«gapIT: a user-driven case-based reasoning tool for infilling missing values in daily mean river flow records»**, «Tag der Hydrologie 2016», Koblenz, Germany, 17/3/2016 [(link)](https://scholar.google.lu/citations?view_op=view_citation&hl=fr&user=OeqhaZ4AAAAJ&sortby=pubdate&citation_for_view=OeqhaZ4AAAAJ:4TOpqqG69KYC)
oparisot's avatar
oparisot committed
93

Olivier Parisot's avatar
Olivier Parisot committed
94
 * Laura Giustarini, Olivier Parisot, Mohammad Ghoniem, Ivonne Trebs, Nicolas Médoc, Olivier Faber, Renaud Hostache, Patrick Matgen, Benoît Otjacques: **«Data-infilling in daily mean river flow records: first results using a visual analytics tool (gapIT)»**, «European Geosciences Union General Assembly 2015 (EGU 2015), Geophysical Research Abstracts Volume 17», Vienna, Austria, 4/2015 [(link)](http://adsabs.harvard.edu/abs/2015EGUGA..1710462G)
oparisot's avatar
oparisot committed
95

Olivier Parisot's avatar
Olivier Parisot committed
96
 * Olivier Parisot, Laura Giustarini, Olivier Faber, Renaud Hostache, Ivonne Trebs, Mohammad Ghoniem: **«gapIT: Un outil visuel pour l'imputation de valeurs manquantes en hydrologie»**, «15ème conférence internationale sur l'extraction et la gestion des connaissances (EGC 2015)», Luxembourg, 1/2015 [(link)](http://editions-rnti.fr/?inprocid=1002107&PHPSESSID=ks64gh4ktvuilvujuu9lcva5i4&lg=en&PHPSESSID=ks64gh4ktvuilvujuu9lcva5i4)
Olivier Parisot's avatar
Olivier Parisot committed
97

Olivier Parisot's avatar
Olivier Parisot committed
98 99 100 101 102
If you also have employed gapIt, send an email to [Olivier Parisot](olivier.parisot@list.lu) to have your publication listed here.


## Contact

Olivier Parisot's avatar
Olivier Parisot committed
103
Any question? Please contact [Olivier Parisot](mailto:olivier.parisot@list.lu) or visit the [LIST website](https://www.list.lu/en/contact/).
Olivier Parisot's avatar
Olivier Parisot committed
104

Olivier Parisot's avatar
Olivier Parisot committed
105

oparisot's avatar
oparisot committed
106

Olivier Parisot's avatar
Olivier Parisot committed
107 108
## License

Olivier Parisot's avatar
Olivier Parisot committed
109
Licensed under GNU General Public License version 3, see [LICENSE.txt](./LICENSE.txt).