Even more than with other data sets that Kaggle has featured, there’s a huge amount of data cleaning and preparation that goes into putting together a long-time study of climate trends. Early data was collected by technicians using mercury thermometers, where any variation in the visit time impacted measurements. In the 1940s, the construction of airports caused many weather stations to be moved. In the 1980s, there was a move to electronic thermometers that are said to have a cooling bias.
Given this complexity, there are a range of organizations that collate climate trends data. The three most cited land and ocean temperature data sets are NOAA’s MLOST, NASA’s GISTEMP and the UK’s HadCrut.
We have repackaged the data from a newer compilation put together by the Berkeley Earth, which is affiliated with Lawrence Berkeley National Laboratory. The Berkeley Earth Surface Temperature Study combines 1.6 billion temperature reports from 16 pre-existing archives. It is nicely packaged and allows for slicing into interesting subsets (for example by country). They publish the source data and the code for the transformations they applied. They also use methods that allow weather observations from shorter time series to be included, meaning fewer observations need to be thrown away.
In this dataset, we have include several files:
- Global Land and Ocean-and-Land Temperatures (
GlobalTemperatures.csv
):
Date
: starts in 1750 for average land temp erature and 1850 for max and min land temperatures and global ocean and land temperaturesLandAverageTemperature
: global average land temperature in celsiusLandAverageTemperatureUncertainty
: the 95% confidence interval around the averageLandMaxTemperature
: global average maximum land temperature in celsiusLandMaxTemperatureUncertainty
: the 95% confidence interval around the maximum land temperatureLandMinTemperature
: global average minimum land temperat`ure in celsiusLandMinTemperatureUncertainty
: the 95% confidence interval around the minimum land temperatureLandAndOceanAverageTemperature
: global average land and ocean temperature in celsiusLandAndOceanAverageTemperatureUncertainty
: the 95% confidence interval around the global average land and ocean temperature
- Other files:
- Global Average Land Temperature by Country (
GlobalLandTemperaturesByCountry.csv
) - Global Average Land Temperature by State (
GlobalLandTemperaturesByState.csv
) - Global Land Temperatures By Major City (
GlobalLandTemperaturesByMajorCity.csv
) - Global Land Temperatures By City (
GlobalLandTemperaturesByCity.csv
)
The raw data comes from the Berkeley Earth data page.
This listing originally appeared on Kaggle as berkeleyearth/climate-change-earth-surface-temperature-data.