Open Data and Open Access Week

As part of our OA Week 2017 celebrations, we want to talk about how open data contributes to the open agenda. Open data are freely available to use, repurpose, and republish. Open Definition defines it as “data that can be freely used, re-used, and redistributed by anyone – subject only, at most, to the requirement to attribute and sharealike”.

Many calls to increase the amount of open data have been based on a desire to improve the transparency and reproducibility of research. In those cases, access to the data isn’t enough. There needs to be associated documentation that provide the user with enough information to be able to reproduce the original experiment or data processing.

The White House issued a memo in 2013 to expand public access to federally funded research. This directed all funding agencies that spend more than $100 million on R&D to develop a plan for how the results of that research would be shared. Many of these agencies now have information available online advising researchers on how to comply with the requirements and make their work more usable.

Journals are also starting to come up with data sharing policies for the articles they publish. Authors submitting articles to those journals should be prepared to also share the data that the article is based on. The Open Access Directory has a list of journal data policies: http://oad.simmons.edu/oadwiki/Journal_open-data_policies.

In order to facilitate data sharing, the Linux Foundation recently announced the launch of Community Data License Agreements. These licenses are designed to enable sharing in the same way that open source software licenses have enabled sharing. Right now there are two available licenses: a CDLA-Sharing license and a CDLA_permissive license.

Universities and libraries have contributed to open data efforts by providing technology infrastructure, expert consultants, handouts and templates to make the process of sharing data easier. The UT Libraries have teamed up with the Texas Advanced Computing Center, Information Technology at UT, and Office of the Vice Provost for Research to provide Research Data Services to the UT campus community. In conjunction with the Texas Digital Library, we also offer the Texas Data Repository as a data archiving and sharing option for our researchers. The Texas Data Repository is just one of many data repositories that exist to help researchers share their data. re3data offers a fairly comprehensive list of possible data repositories: https://www.re3data.org/.

Open data has the potential to move science forward, to improve the integrity of scientific research by allowing for the verification of research results, to improve the return on investment by reducing duplication of effort, and to enhance the visibility of research efforts by making data easier to find, use, and cite.

If you are interested in learning more about open data, here are a few helpful resources:

SPARC: https://sparcopen.org/open-data/
Austin Open Data: https://data.austintexas.gov/
Federal Government Data: https://www.data.gov/open-gov/
Open Data Handbook: http://opendatahandbook.org/
The State of Open Data Report 2017: https://figshare.com/articles/The_State_of_Open_Data_Report_2017/5481187

Donuts are BACK!

Research Data Services is delighted to host another semester of the popular Data & Donuts workshop series. There is plenty of new content as well as some old favorites — all aimed at promoting good practice in managing research data from project planning to long-term archiving.

All workshops will be on Fridays at 3:00pm in the The Perry-Castañeda Library (PCL). Dates and details below:

Managing Research Data: A Guide to Good Practice
Jessica Trelogan
September 8
PCL Learning Lab 4
details

Sweet Technical Details (from ITS)
Angela Newell
September 15
PCL Learning Lab 1
details

Cleaning Data with OpenRefine
Jessica Trelogan
September 22
PCL Learning Lab 1
details

Metadata Basics
Melanie Cofield
September 29
PCL Learning Lab 1
details

Introduction to Data Visualization
Allyssa Guzman and Sarah Morris
October 6
PCL Learning Lab 1
details

Introduction to R and the Tidyverse
Spencer Fox
October 13 (2 hours)
PCL Learning Lab 1
details

Introduction to Tableau
Bonnie Brown Real and Krystal Wyatt-Baxter
October 20
PCL Learning Lab 1
details

Archiving and Publishing Research Data with the Texas Data Repository
Jessica Trelogan
October 27
PCL Learning Lab 1
details

Working with Geospatial Data
Jessica Trelogan
November 3
PCL Learning Lab 1
details

Call for workshop papers — Computational Archival Science: digital records in the age of big data.

The workshop will be held on Wednesday, December 13, 2017 in Boston, MA, USA, in conjunction with the 2017 IEEE International Conference on Big Data (IEEE BigData 2017).

See call for papers at: http://dcicblog.umd.edu/cas/ieee_big_data_2017_cas-workshop/

***** Submission Deadline: October 10, 2017 *****

The large-scale digitization of analog archives, the emerging diverse forms of digital records and systems, and new ways of engaging with archival material using technology, are resulting in disruptions to traditional archival theories and practices. Increasing quantities of ‘big archival data’ present challenges for the practitioners and researchers who work with archival material, but also offer enhanced possibilities for use and scholarly exploration through the application of computational methods and tools.

This workshop will explore this conjunction of emerging methods and technologies around digital records and big data with archival theory and practice, and will examine new forms of records generation and historical, social, scientific, and cultural research engagement with archival institutions. We aim to identify and evaluate current trends, requirements, and potential in these areas, to examine the new questions that they can provoke, and to help determine possible research agendas for the evolution of computational archival science in the coming  years, as well as addressing the questions and concerns scholarship is raising about the interpretation of ‘big data’ and the uses to which it is put.

Full papers, of up to 10 pages, should be submitted via the online submission system at https://wi-lab.com/cyberchair/2017/bigdata17/scripts/ws_submit.php. We also encourage submission of short papers (up to 6 pages) reporting work in progress. The submission deadline is October 10, 2017. All papers accepted will be included in the proceedings published by the IEEE Computer Society Press.

The workshop builds on a number of recent developments in Computational Archival Science (see: http://dcicblog.umd.edu/cas/), and in particular on the 1st Computational Archival Science workshop at IEEE Big Data 2016 (see: http://dcicblog.umd.edu/cas/ieee_big_data_2016_cas-workshop/), which attracted a range of professionals and researchers, both from the DC area and internationally.

For more information, see the full workshop Call for Papers at http://dcicblog.umd.edu/cas/ieee_big_data_2017_cas-workshop/

 

Welcome Erika Boghici!

I’m delighted to announce that UT Libraries has a new GIS and Geospatial Data Coordinator, Erika Boghici, on staff. Erika has a strong background as a GIS professional with long experience in geospatial data collection, analysis, and the development of information products. She will be expanding our Research Data Services to offer campus-wide support for geospatial research, teaching, and learning.

Welcome, Erika!

Featured Project: Digital Rocks Portal

Digital Rocks is a data portal for fast storage and retrieval, sharing, organization and analysis of images of varied porous micro-structures. It has the purpose of enhancing research resources for modeling/prediction of porous material properties in the fields of Petroleum, Civil and Environmental Engineering as well as Geology.

This platform allows managing,  preserving, visualization and basic analysis of available images of porous materials and experiments performed on them, and any accompanying measurements (porosity, capillary pressure, permeability, electrical, NMR and elastic properties, etc.) required for both validation on modeling approaches and the upscaling and building of larger (hydro)geological models.

Read more about the project:  https://www.tacc.utexas.edu/-/digital-rock-physics-helps-scientists-understand-porous-media 

Data and Donuts!

Want to learn how to take better care of your research data? Love donuts? Join us on Friday afternoons this Spring for a bit of both! Research Data Services is hosting a special Workshops @ UT Libraries series aimed at promoting good data management practice throughout the research life cycle.

Fridays at 3pm this Spring. Locations vary.

Managing Research Data: A Guide to Good Practice
Jessica Trelogan
February 3
PCL Learning Lab 2

Finding and Accessing Data Sources
Carolyn Cunningham
February 10
PCL Data Lab

Data and Copyright
Colleen Lyon
February 17
PCL Learning Lab 2

Data Storage and Backups
Porcia Vaughn with Jessica Trelogan
February 24
Life Sciences Library, Main 220E

Working with Data in Spreadsheets
Jessica Trelogan
March 3
PCL Data Lab

Big Data Basics/Intro to HPC
Siva Kulesekaran
March 10
PCL Learning Lab 2

Cleaning Data with OpenRefine
Jessica Trelogan
March 24
PCL Data Lab

Introduction to Data Visualization
Sarah Morris and Allyssa Guzman
March 31
PCL Learning Lab 2

Working with Geospatial Data
Jessica Trelogan
April 7
PCL Learning Lab 2

Sharing and Publishing Research Data
Jessica Trelogan
April 14
PCL Learning Lab 2

Web Mapping with Carto
Katherine Strickland
April 21
PCL Data Lab

Texas Digital Library (TDL) new webinar series

The Texas Digital Library (TDL) community is invited to attend a new 3-part webinar series, Laying the Foundation for Research Data Services. This series addresses a need identified at the TDL Fall 2016 Data Symposium: to equip librarians with information on how to communicate data services to their campus communities. In this series, attendees will be introduced to the Texas Data Repository’s role in the research data infrastructure and the Data Information Literacy (DIL) model as a framework for building a data service program. Registration is required, and free to TDL members.

  • Session 1. Getting Started with the Texas Data Repository and Data Competencies
    • Friday, February 17, 2017 | 11:00 AM – 12:00 PM (CST)
    • This initial webinar will cover the Texas Data Repository’s role as an integral part of the research data infrastructure in Texas and introduce a set of competencies for data training on your campus. This session will preview following sessions and include results from the data symposium and how these webinars address some of these needs.
  • Session 2. Teaching Data: Developing Data Instruction Using a Multi-Level Competency Model
    • Tuesday, February 28, 2017 | 11:00 AM – 12:00 PM (CST)
    • In this webinar, we will discuss how the Data Information Literacy (DIL) model has been expanded at the University of Texas at Arlington as an adaptable frame on which to build an entire data services program. The presenter will describe the data competencies, their levels, and expectations for teaching emerging, intermediate, and expert level audiences.
  • Session 3. Learning By Example: Connecting Data Competencies with the Texas Data Repository
    • Thursday, March 9, 2017 | 11:00 AM – 12:00 PM (CST)
    • This webinar session will connect teaching data competencies explored in Session 2 with core functionality of the Texas Data Repository. It will address data competency learning outcomes, including: using the TDR to add, describe, share, and publish data; managing data, including versioning and de-accessioning data; and downloading and using data.

Presenters:

  • Kristi Park, Executive Director, Texas Digital Library
  • Santi Thompson, Head of Digital Repository Services, University of Houston Libraries
  • Peace Williamson, Director of Research Data Services, University of Texas at Arlington Libraries

Visit the TDL Webinars page for more information and to register.

 

Open Research Funders Group (ORFG) launches today

Heather Joseph, the Executive Director of SPARC, shared with members today the formal launch of the Open Research Funders Group (ORFG).

“The ORFG is a partnership of eight high-profile funding organizations who will work together on efforts to promote open access to research outputs. Together, they represent nearly $5 billion in combined annual grant funding and have committed to foster efforts in sharing research articles and data. The inaugural ORFG members believe that this collaboration will accelerate the pace of discovery, reduce information-sharing gaps, encourage innovation, and promote better reproducibility in the research environment. ”

For more information on the Open Research Funders Group, please visit their website at www.orfg.org.

 

Drop-in Hours in the Scholars Commons

Drowning in data? Need help with a Data Management Plan? Looking for a “forever home” for your research results?

Drop-in for a free consultation!

Wednesdays 2-4 pm in the Scholars Commons

UT Libraries’ Research Data Services support faculty, staff, and students with data needs throughout the life-cycle of research, from the earliest planning stage to long-term archiving. We can also help with most software available in the Data Lab.