A monthly series of infographics on research data
Published by the Library's Research Data Services (RDS) department, the Data Literacy Series (DLS) are visually-compelling one-page handouts that break down complex and important data-related topics. DLS complement the RDS's instructional efforts and mission to promote data education and research data management practices. The handouts are distributed under a CC BY-NC-SA 4.0 license. If you’d like to suggest a topic for an upcoming issue, please submit your feedback to rds@library.ucsb.edu.
Sign up for our Data Literacy Series and receive the infographics straight to your inbox!
Search the DLS handouts:
The Intricacies of Managing Qualitative Data from Human Subject Research
Qualitative data is inherently nuanced and deeply contextual. While it offers rich interpretive possibilities, its analysis and presentation often demand considerable resources. Moreover, due to its sensitivity, managing qualitative research data obtained from human subjects—such as interviews and observations— poses distinct ethical challenges. This handout delineates key considerations, practical tips, and available resources for effectively handling this data type across its lifecycle.
PDF - ALTTAGS: Data Management, Data Sharing, Data Ethics, Qualitative Data, Human Subjects, Sensitive Data
DATE: 03-2024
Crafting Effective Data Availability Statements
Many scientific journals require authors to provide a data availability statement (DAS) to adhere to openness and transparency principles. The DAS specifies how and where the underlying data used in a study can be accessed and the terms under which they can be obtained. We cover best practicves for writing such statements in compliance with policy requirements.
PDF - ALTTAGS: Data Sharing, Data Reuse, Data Access, Data Licensing, Open Data, Restricted Data
DATE: 02-2024
Bundle your Project and Computing Environment with Binder
Binder is an open-source and free service for researchers to create a shareable project version that can be viewed and interacted with within a reproducible computational environment that runs in the cloud via a web browser.
PDF - ALTTAGS: Code Documentation, Dependency Management, Reproducibility
DATE: 01-2024
Virtual Environments in Python with Venv
Venv operates independently, ensuring alterations to installed dependencies within one environment remain isolated from others and system-wide libraries. This isolation allows the creation of multiple virtual environments, each hosting its own Python versions and varying sets of libraries.
PDF - ALTTAGS: Code Documentation, Dependency Management, Reproducibility, Python programming
DATE: 12-2023
Reproducible Environments with RENV
Is your project R-based? The renv package helps you set up R projects and manage dependencies to keep your environment consistent and reproducible.
PDF - ALT
TAGS: Reproducibility, Dependency Management, Code Documentation, R Programming
DATE: 11-2023
Taming the Dependency Hell
Everybody has a "dependency hell" horror story to tell. In the spookiest month of the year, we describe the leading causes of this problem and how it impacts scientific reproducibility.
PDF - ALTTAGS: Reproducibility, Dependency Management, Code Documentation
DATE: 10-2023
The ABCs of Web APIs
APIs or Application Programming Interfaces have become increasingly popular in academic research. They simplify data access, streamline data collection and analysis processes, enable real-time updates, support collaboration, provide access to specialized tools, and more.
PDF - ALTTAGS: API, Data Access
DATE: 09-2023
Watch Out for Predatory Publishers
Predatory publishers disguise themselves as credible open access (OA) publishers. They employ deceitful tactics and operate profit-driven schemes that can harm academics' reputations, undermining their chances of disseminating authentic research through established and credible publishing models. Here are tips for avoiding predatory publishers to help you safeguard your work and maintain scholarly honesty while publishing open access.
PDF - ALTTAGS: Open Access, Open Science, Scholarly Communication
DATE: 08-2023
Roll up your Sleeves for some Data Cleaning
Whether you have collected your own data or will be reusing existing datasets, you probably need to clean them up before you move forward with data analysis. This process includes fixing or removing incorrect, corrupted, unformatted, duplicate, or incomplete data. While the cleaning-up process may look different depending on the dataset you have at hand, this handout covers some essential tips to complete this task more efficiently while making your data more consistent, accurate, and high quality.
PDF - ALTTAGS: Data Cleaning, Data Preparation, Tidy Data
DATE: 07-2023
Dublin Core
The Dublin Core Metadata Initiative (DCMI) was named after its inaugural meeting in 1995 in Dublin, Ohio. The organization maintains the DCMI Metadata Element Set, one of the most straightforward and widely adopted metadata schema. Initially intended for web resources, Dublin Core (DC) has proven its versatility in describing various physical and digital objects, including datasets.
PDF - ALT