This QuickStart will get you up and running quickly with your Enterprise Free Trial. In it we will cover:
- Creating your dataset and adding live tables
- Documenting your data
- Writing queries against your data
- Saving queries
- Working with your favorite tools
- Sharing and collaborating with other members of your organization
This free trial enables you to work with your organization's live data through a virtual connection including documenting the metadata, querying, analyzing, and collaborating on your analysis with others in your organization. Our business glossary and metadata crawler functionality are outside the scope of the trial, but if you would like a demo or to pilot those features, please contact us at firstname.lastname@example.org.
Here are some links to get you started:
You must connect to your live data before you can continue.
Creating your dataset and adding live tables
After you connect your data source, the Enterprise Free Trial dialog will prompt you to Create a new dataset. The owner will be set to your organization by default, and sharing will be set to No one. Change sharing to with All of <your_organization_name> if you want everyone to be able to see the dataset:
After creating your dataset, write a quick description for it and select Add data:
Choose the live connection you set up when you signed up for the free trial:
Then select Live table:
Depending on your connection you might see one or many databases to choose from. After you choose the appropriate one you might be prompted to choose a schema or taken directly to a table selection depending on the source. Your final selection to make will be which tables in the database or schema to include. You can either select all of them or a subset:
When your data has been connected you will see your description and the tables you added to the dataset:
Selecting Continue takes you to the dataset overview page where you can see the progress of your schema loading:
During this process the metadata from your database is loaded and analyzed, and previews of your tables are generated. It all runs in the background and can take several minutes depending on how much data you have in your database.
Documenting your data
One piece of your data that is pulled over and calculated is the profiling statistics--data that lets us understand more about the shape and quality of the data. Once processing is complete you can see all the information that has been brought in. Though your data still lives entirely on your site in your database, the metadata has all been brought into the dataset. In our example you can see the previews of the 124 tables in the dataset complete with schema information, data types, etc:
When you select an icon next to a column name you can see all the information about that column including the data type, number of cells with values, number of empty cells, number of distinct values, and sample values. You can also see a description of the column if one exists in the database. In addition to seeing the column metadata you can also edit it:
Edits you can make include changing the column name and changing or adding a description:
As you created your dataset you were prompted to write a brief description of it. Further documentation for the dataset--it's purpose, provenance, licensing, and much more--is put in the dataset summary. The summary is accessed from the top of the dataset overview. Click on Add a summary... to go to the workspace for the data:
You can edit the Summary with our rich text editor or Markdown, and you can include images, queries, links, formatted text, and more:
The details about all the columns in all the tables in the dataset are also available on the workspace. To see or edit them, select Data dictionary under the Home section of the left sidebar in the workspace:
Writing queries against your data
The workspace is also where you interact with your data by writing queries in either SQL or SPARQL If you're unfamiliar with one or both of them, see our documentation for data.world's SQL dialect, SQL tutorial, or SPARQL tutorial for more information.
To write a query against your data click on + Add in the upper left of your workspace and select SQL Query:
The query editor will open in the middle of your window with the cursor already in place, and a list of the tables and columns in your dataset will be on the right. Write your query in the editor and then select Run query to see the results:
To reuse your query either in the dataset or in a specific project, you'll need to save it. In the above example the query was written to denormalize the data in the original database and make it more accessible for business analysts. Because this query is a building block to most other queries about products, we are going to save this query to the dataset. That way everyone else who uses this dataset in a project will have access to the query:
To save your query to the dataset, select Save next to the Run query button and choose your dataset name from the dropdown:
Notice that the query now shows up as an asset in the dataset in the left sidebar of the workspace:
Downloading query results
Besides saving your query there are several other things you can do with the results. You can download them in a variety of formats and save them to either your desktop, the dataset or a project:
Working with your favorite tools
Finally, you can take the results of your query and make visualizations from them or work with them in other tools by selecting Open in app:
The link takes you to a list of all of your appropriate third-party integrations and also provides a link to other available integrations:
Sharing and collaborating with other members of your organization
When you created your dataset, if you shared it with the members of your organization then they all have view privileges to it. If you didn't share it automatically, or if you would like to grant more access to some or all of you colleagues, you can make those changes back on the dataset overview page. To get back there, select View dataset on the left sidebar:
Then select the Contributors tab at the top of the screen:
Privileges can be set to discover only (see previews of the tables but not be able to see all the data), view the data, edit the data (including writing queries against it, changing the metadata, and adding new data), or view, edit, and manage:
Permissions can also be extended to individuals by clicking the Invite button. To invite someone you can enter either their email address, or full name or data.world login name if you know them. You can also set the permissions to whatever level you would like and notify them with a custom message:
- See our extensive help documentation complete with articles, tutorials, and references.
- Contact email@example.com with any questions or issues you have as you experience the Free Trial.
- Reach out to firstname.lastname@example.org for a free consultation on your data challenges and live walkthrough demo of the software. We'd love to tell you about additional functionality like metadata crawling, lineage, governance, and usage metrics that data.world offers. This step-by-step QuickStart went through a quick tour of dataset and project creation and data analysis, but data.world has much more to offer!