Browse UNC Dataverse

Browsing Dataverse

UNC Dataverse and its project dataverses, sometimes referred to as collections or sub-dataverses, can be browsed by narrowing results using the facets on the left of the main page. Users can also sort their refined results within a dataverse.

Dataverse vs. Dataset

A dataverse is a collection that can hold datasets as well as other dataverse collections. You can think of it as a container as depicted below.

Dataverse collection depicted as a box with datasets and a smaller dataverse collection box being placed inside

This permits users to organize their research in various ways.

You can search at the Dataverse level or filter down to the Dataset or File level by using the checkbox options on the far left of the UNC Dataverse page.

Facets

Facets are specific terms used to narrow down the results in the UNC Dataverse results list. You can find them on the far left of both the UNC Dataverse main page, but also within dataverses housed under UNC Dataverse. Clicking on a facet will only show those results related to the selected facet.

To remove the facet from your search, navigate to the top of the results list and click the ‘x’ beside the facet label.

Screenshot showing how selected facets appear at the top of the results page. There are 'x' marks beside each facet in case a user wants to remove that facet from their results

Results List

The results list is in the middle of the UNC Dataverse main page. This list will update every time you perform a search or select a facet. You can sort the list by clicking on the ‘Sort’ button to the right.

You will only see published Dataverses, Datasets, and Files in the results list. If you are the Admin, Curator, or Contributor of an unpublished draft Dataverse or Dataset, you may also see them in the results list.

Browsing a Dataset

Each dataverse contains datasets, sometimes referred to as dataset records, which provide contextual information about the data files, documents, and/or code housed within that record. It is important to review a dataset record completely before accessing the files to ensure you are complying with the terms of use and that you have a full understanding of the contents of the dataset record.  

Breakdown of Metadata

A dataset record within UNC Dataverse is comprised of standardized fields that contain information, or metadata, about the contents of the dataset record.

UNC Dataverse requires specific metadata fields to be populated before a dataset record can be published. These metadata fields are part of the citation metadata, as seen below.

Citation metadata is structured so users of the data within a dataset record can cite their source, thus giving credit to the original data producer(s). Within the citation metadata block you will find key information such as Author(s), Title, Date, Repository, Version, and DOI (the persistent URL of the dataset). Users can also export a data citation from the citation metadata block by selecting Cite Dataset and choosing an export format from the dropdown menu.

Domain specific metadata are also provided as part of a dataset record. These fields permit data producers the ability to further describe the contents of their dataset record. The advanced search feature allows users to narrow their search across these specific fields. In addition, any information entered in domain specific metadata fields is also automatically included in basic search results in UNC Dataverse.

File-level metadata may be included for each individual file. Unlike the domain specific metadata fields, file-level metadata describes the contents of a specific file instead of describing the entire dataset record. For example, a user may find information about the study conducted within the domain specific metadata fields; whereas they may learn what the contents of ‘PFHHP2.tab’ is from the file-level metadata.

Contact Dataset Owner

When exploring a dataset, you may find that you have additional questions about the contents or availability of the data. For all dataset records within UNC Dataverse, users may click the Contact Owner button. A pop-up will appear on the page where you can write a message and send it to the dataset owner. The dataset owner will receive the inquiry through UNC Dataverse as well as through their account email.

Accessing Files

Under the Files tab of the dataset record, users will find all data files provided by the data producer. Files are typically listed in alphabetical order; however, users may click the Sort button to sort by newest, oldest, type, size, or Z-A. UNC Dataverse automatically provides information about each file within the dataset record. This information includes file type, file size, number of downloads, and an MD5 checksum. If a data producer provides file-level metadata for a file, it will also show in the Files section.

To download a single file, click the arrow icon to the right of the filename. Select an option from the ‘Download Options’ section of the dropdown menu. We typically recommend users to download the file in its original file format. In some instances, UNC Dataverse will create preservation formats of the data and alternative file formats will be available to download. This is particularly useful if you do not have access to the required statistical software to use the original file format. Please note that not all data files in UNC Dataverse are available in multiple formats. It depends on the original file format provided by the data producer.  

To download multiple files, or all files, within a dataset record, click on the checkbox next to the desired files. Alternatively, click the checkbox at the top of the file list and click ‘select all # files in this dataset’. Next, click the ‘Download’ dropdown menu at the top right of the files list. Choose ‘Original Format’. UNC Dataverse will create a zip that includes all files selected within that record.

There is a size limit for the zip files created by UNC Dataverse. If the total download size is larger than 1GB, you will be prompted to exclude larger files from the download. In this case you will need to download the larger files individually.

Terms of Use & Creative Commons Licensing

Each dataset has its own Terms of Use which can be customized by the dataset owner. As a UNC Dataverse user, you should review the Terms of Use of the dataset record you are interested in downloading files from. These terms will outline any requirements for downloading, citing, and/or re-using the data files within a dataset record.

Creative Commons licenses are assigned to a dataset record before publication and define what users may do with the contents of that record. There are six different licenses and one public domain license (CC0). Creative commons licenses can be used in combination with custom terms of use.

Some dataset records will use the default CC0 license and have no additional terms of use; however, it is still a best practice to cite any data sources you may use in your own research and/or discuss in future publications. To learn more about licensing and data citations, please visit the Dataverse Community Norms.

Versions

A versions tab exists for every dataset record. This tab provides a list of all published versions of a dataset with information that describes how each dataset version differs from its predecessor.

Version history is important to track changes to dataset records over time. These changes can range from updates to metadata fields to deletion of obsolete files and upload of new files. The version history also notes the date a new version was published and the entity that published the new version.

Data Explorer

The Data Explorer is a feature of UNC Dataverse that permits users to browse publicly available data files in particular formats within the Dataverse platform. Through Data Explorer, users can see every variable within a data file and review summary statistics, create cross tabulations, and download subsets from that file. Review the video below to learn more about Data Explorer within UNC Dataverse.

[Video]

Document Previewer

UNC Dataverse also has a built-in document previewer. This feature allows users to read through TXT and PDFs within the Dataverse platform. Simply navigate to the text document you wish to preview, click the download icon, and choose ‘Read Document’ from the dropdown menu.

Request Access to Restricted Data

Some data will not be completely accessible via the UNC Dataverse platform. These files are considered restricted and will have a red lock icon beside the file image within the dataset record. Instead of a download option, users will be able to ‘request access’ to a restricted file.

You must be logged in with your UNC Dataverse account in order to request access.

Once logged in, you may click ‘Request Access’ and fill in the form to submit a request to the data owner. The data owner will be notified via email by UNC Dataverse and can grant or deny the access request.

Please make sure to read the entirety of the dataset record before requesting access, as some data producers will include a Data Use Agreement and further instructions for requesting access to their data. These instructions may require you to email a completed data use agreement to the data owner outside of UNC Dataverse.

When a data owner has granted you permission to access their data, you will receive an email from UNC Dataverse indicating the file is available for download. Simply log back into UNC Dataverse and navigate to the file via either the link provided in the email, or via the Notifications tab under your User Dashboard.

User permissions may be revoked at any time by the data owner.