Green Web Datasets

Since 2006, we have been building the world’s largest database tracking which parts of the internet run on renewable power, and when they switched.

You can check the current state of this information with the Green Web API, but we make this underlying data available for download in bulk, for your analysis.


Our datasets

This dataset is based on the same list of the top million domains as used by the HTTP Archive, for their monthly reports. It shows which of the domains run on green power versus ‘grey’ power – usually fossil fuels, see for an example the link below:

GreenWeb Check Dataset – Top Million Websites – 2019-06-11

Monthly reports

From Jan 2020 onwards, every month we generate a dataset of all the domains listed as running on green infrastructure in our system, along with which organisation is hosting them, and the date of the last check.

To download these, please visit our admin site:

https://admin.thegreenwebfoundation.org/green-urls

For guidance on use of the data, and what the colums mean, please refer to our documentation on github.

If you’re looking to use this data, and would like support in using it, or have questions about it, please don’t hesitate to get in touch at support@thegreenwebfoundation.org – the data is free, and we want to see as many people using it!

Using this data

We make these datasets available below, free of charge, with a permissive licence, to encourage the development of tooling and products using them.

We make this available under the Open Database Licence:

You are free:

  • To Share: To copy, distribute and use the database.
  • To Create: To produce works from the database.
  • To Adapt: To modify, transform and build upon the database.

As long as you:

  • Attribute: You must attribute any public use of the database, or works produced from the database, in the manner specified in the ODbL. For any use or redistribution of the database, or works produced from it, you must make clear to others the license of the database and keep intact any notices on the original database.
  • Share-Alike: If you publicly use any adapted version of this database, or works produced from an adapted database, you must also offer that adapted database under the ODbL.
  • Keep open: If you redistribute the database, or an adapted version of it, then you may use technological measures that restrict the work (such as DRM) as long as you also redistribute a version without such measures.

Disclaimers and Limitation of Liability

The information on the domains (“organisation data”) is drawn from data submitted by the hosting organisations themselves. While we have a series of checks to verify some claims, there may be cases of errors in the source data have errors which we have not yet detected .

Therefore we cannot make any promises as to the quality of organisation data. You use the organisation data entirely at your own risk.

For this reason, and because this data is made available without charge, we limit our liability as follows.

The Green Web Datasets are provided “AS IS” and on an “IS AVAILABLE” basis without any representations or any kind of warranty made (whether express or implied by law) to the extent permitted by law, including the implied warranties of satisfactory quality, fitness for a particular purpose, non-infringement, compatibility, security and accuracy.

Under no circumstances will we be liable for any of the following losses or damage (whether such losses were foreseen, foreseeable, known or otherwise): (a) loss of data; (b) loss of revenue or anticipated profits; (c) loss of business; (d) loss of opportunity; (e) loss of goodwill or injury to reputation; (f) losses suffered by third parties; or (g) any indirect, consequential, special or exemplary damages arising from the use of Green Web Datasets, regardless of the form of action.

This project was funded through the NGI0 Discovery Fund, a fund established by NLnet with financial support from the European Commission’s Next Generation Internet programme, under the aegis of DG Communications Networks, Content and Technology under grant agreement No 825322. Applications are still open, you can apply today.