A dataset is a collection of files and the metadata, story, and insights around those files. Here's some of the things you can do with a dataset on data.world:
- store any sort of file, with tabular and graph data getting enhanced for preview, querying, and visualization,
- include metadata about your project, analysis or data,
- query against multiple tables including external sources, then save them for easy access and sharing,
- have discussions within the dataset so context is there for others to reference,
- create visualizations of your query results.
So, ready to create your first dataset? Let's get started:
In the upper right corner, you'll see the + Add link which allows you to easily create a dataset from any page. Select 'New dataset' which will take you to a form to provide some basic details about your dataset:
Title: name of the dataset, up to 60 characters (required).
You can use any characters and it will be altered to create the dataset URL as well. Once set, you will not be able to change the title or URL.
If you're creating a public dataset, your title should be search friendly so other users can easily find and identify it. It is a limited amount of space, but you'll have more room in the short description.
Accessibility of dataset: select open to allow others to find, view, query and download your dataset or private if it should be restricted from others unless you've explicitly invited them as a collaborator (required).
If you're ready to add files, simply drag and drop files to the box, or if your file can be accessed via a URL, you can choose to add the file directly from that URL. This will also allow data.world to store the source URL, so you can easily fetch a new version if it happens to change.
Click Create dataset and you'll be taken to your dataset landing page. This is where you can add your metadata such as a longer description of your dataset as well as a detailed summary using markdown. The summary is a great place to include notes about your dataset, findings, a task list, licensing and citation requirements, and anything else you may find useful for other users. See our best practices for a dataset summary as well as other tips for more ideas.
Under the Settings tab, you'll be able to select a License type, add tags, or modify the visibility settings:
Under the Contributors tab, you can invite people to contribute or modify existing contributors:
Finally you can use the Discussion tab to communicate with others about the dataset. Note you can have multiple topics going and markdown is also supported here so people can easily share visualizations, etc. here as well: