In many data projects you may find that you need to combine data from two tables or spreadsheets. In the case of the sample dataset, NEISS product numbers are referenced, but we don’t know which products these numbers refer to. We need to bring over the names of the products from another table. This table is the other CSV file mentioned in the Getting Started section of this Guide, entitled: “NEISS_ProductCodes_ProductNames.csv”.
This brings you back to the Open or Create Project screen. If you have not done so previously, download CSV file “NEISS_ProductCodes_ProductNames.csv" and save it in an accessible place. From OpenRefine, Browse to this downloaded file and open it as a new project.
A second OpenRefine window will now open for the NEISS_ProductCodes_ProductNames spreadsheet. Set this new project up as you would any other. Choose UTF-8 under character encoding, and make sure to parse the first line of data as the column headers.
A screenshot with the resulting project can be seen below. Take note of the project name at the top of the screen, since it will be needed in the next step to join the projects. Instead of a dot there is a space between "ProductNames" and "csv" - this space will need to be maintained in the next step.
The NEISS_ProductCodes_ProductNames csv project has the key column (the one that contains the same unique values in both projects, “Product_1”), and the column that we want to bring over to our original project, “Product_Name.”
Go back to the window that has your original project, "2019_NEISS_SmallSubset csv". From the Product_1 column's drop down arrow, choose Edit column - Add column based on this column.