Data Cleaning with OpenRefine for Ecologists: Setup


Download this data file to your computer:

About the data

The data for this lesson is a part of the Data Carpentry Ecology workshop. It is a teaching version of the Portal Database. The data in this lesson is a subset of the teaching version that has been intentionally ‘messed up’ for this lesson.

The data for this lesson and the workshop are in the Portal Project Teaching Database available on FigShare, with a CC-BY license available for reuse.


For this lesson you will need OpenRefine version 3.6.2 and a web browser.

Note: OpenRefine is a Java program that runs on your machine (not in the cloud). It runs inside your browser, but no web connection is needed.

Download OpenRefine version 3.6.2 from

OpenRefine requires one of these web browsers:

OpenRefine has some issues with Firefox. Internet Explorer is not supported.

Note: Other versions of OpenRefine should work, but the results might be different due to changes in the software or default settings.