The data dictionary is the other document created along with the dataset. The data dictionary contains:
- The names of all the files in the dataset
- A place to add descriptions for each file
- Metadata labels for each file
and for tabular files:
- Column names
- The format of the data in each column
- A place to add a description for each column
You can get to the data dictionary either from the Overview tab (right below the Summary) or from the Documents section in the left pane of the workspace:
Data dictionary entries for each file are edited separately by selecting the Edit link next to the filename in the data dictionary document. Every file--no matter what type--has a data dictionary entry which contains the file metadata for the file:
Tabular files also have a tab for columnar metadata where you can rename the columns, change their format, and add descriptions for them:
Changing column names and adding a description is a great way to avoid the ambiguity that comes from having multiple columns with the same name. It also renders obscure column names understandable.
Changes to column names, descriptions, and data types propagate throughout data.world to every project that references the dataset, and the changes remain even if the data is updated from an external source.