Excel tips | NENU Data Journalism Terminology An Excel file is called a workbook. A workbook can have one or more worksheets. Each worksheet is list, with columns and rows of information. We often call this a table or spreadsheet or a data set. Each box in a worksheet is called a cell. Every cell has an address, like A1 (Column A, Row 1 -- the top left corner of the worksheet). A table usually has one row containing the labels for each column. We often call this the "header row." The header row in this example is on Row 1: When you have a big table with a lot of data, it’s easy to get lost. It’s a good idea to “hide” the columns you are not interested in – like Column B (the country code). To hide a column, select the entire column by click on the B. Then, do a right-click and select “hide”: Now Column B is no longer taking up space – but it still exists; it hasn’t been deleted: But look what happens when you scroll down on this table: The header row disappears! And that’s bad (and dangerous), because we don’t know if Column C is for 2010 or for 2013. So, let’s lock in place the header row. This is called “freeze panes.” Put your cursor in Cell A2, and then select … Now when you scroll down, you’ll always see what is in each column: Another good habit is to make a copy of the worksheet, and keep the copy in the same workbook. That way, as you work with the data, you will have a trail: You can go back and see exactly what you did. (You can see that I have already made one copy.) Here is how you make a copy. Do a right-click on the tab of the worksheet you want to copy … then select “Move or Copy”: Tell Excel to copy this worksheet to the end of the workbook: Then double-click on the tab of the copy, and rename this worksheet “my analysis”: -------------------------------------------------------------------------------------------------------------------------------OK – now we’re ready to really do some analysis! First, let’s make sure we understand what is in this spreadsheet. It looks like each row is one country, and the columns show how many patents the inventors from that country received each year. Let’s scroll to the bottom of the spreadsheet. (You can get their quickly by clicking the Ctrl key and the End key at the same time.) So the last row of the spreadsheet is not a country – it’s a row of totals. Let’s sort all of the countries to see which ones got the most patents in 2013. First, select the header row (Row 1) by clicking on the “1”: Then, scroll to the bottom of the spreadsheet so that you can see the last rows of data. Hold your Shift key down and select Row 185 (by clicking on the “185”): Now you have selected the data you want to sort – everything except the row that says “All US patents, all inventors, all countries” (because that row is not a country). Then, on the Menu bar, go to Data and Sort: Let’s tell Excel to sort the data by the “Year 2013” column … … and we want to put the results in order from the highest number to the lowest number: Here’s what we get: So Chinese inventors ranked No. 6 among all the countries in 2013. Not bad! Now sort the data for 2012 … and 2011 … and 2010. Do you see a pattern regarding China? -------------------------------------------------------------------------------------------------------------------------------Now let’s add up the total number of patents each country got for all four years (2010 through 2013). First, in Cell G1, type a column heading, such as “Total, 2010-13”: In Cell G2, type =sum(C2:F2) (Instead of typing “C2”, you can click on cell C2. In other words, you can build the formula by clicking on the cells or by typing the cell address.) When you hit “Enter,” you’ll see the results of the formula: Now, click on the cell containing the formula. You’ll see a black dot at the bottom right corner of the cell. This dot is called the AutoFill handle. If you put your cursor on this dot, your cursor changes from a big white cross … … to a thin black cross: When that happens, double-click, and Excel will copy your formula down the entire column: Let’s do another formula. Let’s calculate the percentage change in patents for each country from 2012 to 2013. (Since we don’t need the “Year 2010” and “Year 2011” columns right now, let’s hide them.) Type a column header in Cell H1: In our formula, we need to take the “new number” (2013) … subtract the “old number” (2012) … and then divide by the “old number” (2012). We must tell Excel to do the subtraction first. So in Cell H2 (the formula for Row 2), our formula will be: =(F2-E2)/E2 Click Enter, and you’ll see the result. But the result is not formatted as a percent yet. So click on: Once you have the number formatted correctly, you can copy the formula down the column: Now let’s sort all of the countries by the “percentage change” column. First, select the data you want to sort (from Row 1 through Row 185) … then go to Data > Sort. As you can see, we have some error messages; I’m sure you can figure out why! But what is another problem with, say, the country of Niger? Data journalists warn about the “law of small numbers”: You can’t draw important conclusions from small numbers. So let’s look only at countries that had at least 1,000 patents in 2012. To do this, we must filter our list. Make sure your cursor is on the header row – Row 1. Then click on this button (a picture of a funnel): Now you have a drop-down arrow on each column label: Let’s tell Excel to show us only countries in which the 2012 number was greater or equal to 1000: What do you see? Is that newsworthy? How could you turn this information into a news story?
© Copyright 2026 Paperzz