9. Meet Wide and Long Data: Step by Step

This reading outlines the steps the instructor performs in the following lecture, Meet wide and long data. In this lecture, the instructor presents wide and long data formats and discusses the types of questions each format can help you answer.

Keep this guide open as you watch the lecture. It can serve as a helpful reference if you need additional context or clarification while following the lecture steps. This is not a graded activity, but you can complete these steps to practice the skills demonstrated in the lecture.

What youll need

If you would like to access the spreadsheets the instructor uses in this lecture, select the link to a dataset to create a copy. If you don’t have a Google account, download the data directly from the attachments below.

Link to population datasets:

Population, Latin, and Caribbean Countries, 2010–2019, wide format

Population, Latin, and Caribbean Countries, 2010–2019, long format

OR

Download data:

Population, Latin, and Caribbean Countries, 2010–2019, wide format

XLSX File

Population, Latin, and Caribbean Countries, 2010–2019, long format

XLSX File

Example 1: Examine wide data

Wide data is a dataset in which every data subject has a single row with multiple columns to hold the values of various attributes of the subject. It helps compare specific attributes across different subjects.

Open the Population, Latin, and Caribbean Countries, 2010–2019, wide format spreadsheet.

Each row contains all population data for one country.

The population data for each year is contained in a column.

Find the annual population of Argentina in row 3.

In this wide format, you can quickly compare the annual population of Argentina to the annual populations of Antigua and Barbuda, Aruba, the Bahamas, or any other country.

Find the country with the highest population in 2010

Select column E, which contains each country’s 2010 population data.

Right-click column header E and choose Sort Z to A.

Notice that Brazil is now at the top of the list because it had the highest population in the year 2010.

Find the country with the lowest population in 2013

Select column H.

Right-click column header H and choose Sort A to Z.

Notice that the British Virgin Islands are now at the top because they had the lowest population of all countries in 2013.

Example 2: Examine long data

Long data is data in which each row represents one observation per subject, so each subject will be represented by multiple rows. This data format is useful for comparing changes over time or making other comparisons across subjects.

Open the Population, Latin, and Caribbean Countries, 2010–2019, long format spreadsheet.

Notice the data is no longer organized into columns by year. All of the years are now in one column.

Find Argentina’s population data in rows 12-21. Each row contains one year of Argentina’s population data.

Post it on social media
Dr Nabeela: