Description
After merging data records to common categorizations, and clearing the data of obvious errors, to avoid potential reporting bias and other errors, a slicing algorithm was applied to cut out the less reliable part.
Observation
By going through the data distribution plots, we have the following observations:
- The number of data records varies from country to country, sometimes by orders of magnitude.
- The distribution of records are not even through time.
Examples:
Assumption
The slicing algorithm is designed based on the following assumptions:
- The reliability and accuracy of every country increase through time.
- Hazards tend to be more frequently due to the global warming and climate change.