There is a lot of to be found on data.world. If you are in an organization, there is everything from your organization's private data to publicly shared datasets and projects open to everyone. We make finding the data you need easy with a variety of search options. In this article we'll cover:
The search bar enables you to find results based on matches in many parts of a dataset or project including:
- description fields
- column names
- creator or owner
- creation or modification dates
A search on 'bees', e.g., returns the following results:
If you are in an organization, the search results shown are from your own library which holds both your and your organization's data. If you want to see results from all the publicly shared data on data.world, select the 'data.world community' tab under the search bar:
If you want to search for a text string, enclose the string in double quotes and you will only get back results that have that exact string in them. E.g., a search on "honey bee" returns the following:
Besides searching for text anywhere in the data, you can also search for specific operator matches like creator, owner, tag, or modified. If you wanted to find all datasets created by a specific user you could run a search like this one for all data created by the user @us-usda-gov:
You can combine operators to narrow down your searches. E.g., if you wanted all data about women's health you could search on women AND health:
For a complete list of all the operators and the way you can search using them see the article Using search.
One of the operators that can be used in the search bar is tag. But tag is even more powerful than the other search operators as it can easily be used to narrow down your search results or to find other related data.
For example, let's start by searching all items with the string 'census':
We get back a lot of results, but it's very easy to reduce them to a more manageable number using filters. The left sidebar of the results window shows the filters you can use to narrow down the results either by data type or by the most common tags used by the datasets and projects in the results.
By selecting one of the three data types from the left sidebar we could see only:
- datasets and projects relating to 'census'
- insights with the term 'census' in them
- people or organizations with 'census' in their display names or ids.
You can also choose one or more tags from the tags list to see a refresh of the most commonly used tags for the filtered result set. E.g., our initial results (shown above) doesn't include the tags health or housing in the most-cpmmonly-used-tags list in the left sidebar. However, by selecting the tag 'census', we get a whole new list of most common tags for those results--including 'health'. Selecting 'health' takes us to an even smaller set of results:
If we were looking for public data on US health insurance, the dataset in the example above looks like a promising place to start. Clicking on the dataset name takes us to the overview page for the dataset. From here there are many links to click through to find related data including:
- Other datasets with the same creator (right sidebar)
- Other datasets with the same owner (top left)
- Datasets with the same tags
- Projects that use the dataset
Clicking through any one of them will take you to another set of data that might be relevant to your current project or that you might be interested in looking at later. Finally at the bottom of the overview page there is a set of curated suggestions of related data from data.world: