0% found this document useful (0 votes)
74 views29 pages

Research Data Management by DR RC Gaur

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
74 views29 pages

Research Data Management by DR RC Gaur

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 29

MOOCs on Information Handling Skills for

Teaching , Learning and Research

Research Data Management

Professor (Dr.) Ramesh C. Gaur


Dean and Director (Library & Information)
Indira Gandhi National Centre for Arts (IGNCA), New Delhi
Autonomous body of Ministry of Culture, GOI
ICAR- National Agricultural Higher Education Project on NKMC4AER
Professor Jayashankar Telangana State Agricultural University, Hyderabad
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Introduction
 What is Data, and Research Data
 Why to Manage Research Data
 How to manage Research Data
 Various stakeholders in RDM
 Research Data Management Planning
 Data Sharing and Reuse by Creating Data Repositories
 Role of Libraries in RDM
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

What is data?
facts and statistics collected together for reference or analysis
(dictionary)
 "[a] reinterpretable representation of information in a formalized
manner suitable for communication, interpretation, or processing
Data is a collection of facts, such as numbers, words,
measurements, observations or just descriptions of things.
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Types of data
•quantitative and qualitative.
oPrimary or secondary
Discrete or Continuous
observational data
laboratory experimental data
computer simulation
textual analysis
physical artifacts or relics
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Glossary of data related issues


• Data access management • e-Research
• Data Anonymization • E-Science (or eScience)
• Data audit • E-Scholarship
• Data curation • Linked open data
• Data life cycle • Metadata
• Data privacy • Ontology
• Data publishing • Preservation
• Data repository • Provenance data
• Data standard • Publishing data
• preservation, discovery, use, reuse, and manipulation of • Research life-cycle
scientific data objects supporting published research. • Repository
• Data stewardship • Resource Description Framework (RDF)
• Digital curation • Restricted-use data
• Digital preservation • Semantic web
• Web Ontology Language (OWL)
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Research Data
• Research data can be defined as “the recorded factual material
commonly accepted in the scientific community as necessary to
validate research findings. (University of Leicester)

• “The data needed to validate the results presented in scientific


publications" or "the evidence used to inform or support
research conclusions" (University of Sheffield).
Types of Research Data

Observational :remote sensing data,


survey data, field recordings, sample data
Experimental: example, gene sequences,
chromatograms, magnetic field data
Models or simulations: For example,
climate models, economic models
Derived or compiled: text and data
mining, compiled databases, 3D models
Research Data Life Cycle
Research Data Management
Sometimes known as research data curation, essentially involves collecting,
organizing, Preserving and dissemination of data so that data can access easily
and re-use.
RDM Lifecycle
Why Should We Manage Research Data
Research data are a valuable resource that often requires a great deal of
time and money to create
• Facilitate Data security
• Minimize the risk of data loss.
• Ensure research integrity and
validation of results
• Ensure wider dissemination and
increased impact
• Enable research continuity
through secondary data use.
• Preservation of Research Data
• Mandate from Publishers
• funding agencies
• Help in Research Collaboration
• Improve Efficiency of Research
The FAIR Data Principles , developed and • Reduce duplication of effort by
enabling others to use your data
endorsed by researchers, publishers, funding
agencies and industry partners in 2016.
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Stakeholders in RDM with Role and Responsibilities


A number of different stakeholders are involved in the research process and have a role
to play in ensuring good practices.

Stakeholders include
Institutional leaders;
Research supervisor and Research Scholars;
Dean – Research Office/ Research administrators;
Library
University IT team; and
Funding Agencies or external data repositories.
RDM Role and Responsibilities..1
RDM Role and Responsibilities..2

Source: http://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/reports/dealing_with_data_report -final.pdf


MOOCs on Information Handling Skills for Teaching ,
Learning and Research

How to Manage Research Data


•Data Formats
•Metadata and Documentation
•Dataset Licensing
•Data Management Tools
•Choose relevant Open Data Repositories or Develop
Institutional Data Repositories
•Persistent identifiers
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Data Management Plan (DMP)..1


A researcher needs to make the plan in compliance with funders and Institutional requirements
 Data Collection
 What data will you collect or create?
 What type, format and volume of data?
 Do your chosen formats and software enable sharing and long-term access to the data?
 Are there any existing data that you can reuse?
 How will the data be collected or created?
 What standards or methodologies will you use?
 How will you structure and name your folders and files?
 How will you handle versioning?
 What quality assurance processes will you adopt?
 Documentation and Metadata
 What documentation and metadata will accompany the data?
 What information is needed for the data to be to be read and interpreted in the future?
 How will you capture / create this documentation and metadata?
 What metadata standards will you use and why?

Source: http://www.dcc.ac.uk/sites/default/files/documents/resource/DMP/DMP_Checklist_2013.pdf
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Data Management Plan (DMP)..2


A researcher needs to make the plan in compliance with funders and Institutional
requirements
 Ethics and legal compliance
 How will you manage any ethical issues?
 Have you gained consent for data preservation and sharing?
 How will you protect the identity of participants if required? e.g. via anonymization
 How will sensitive data be handled to ensure it is stored and transferred securely?
 How will you manage copyright and Intellectual Property Rights (IPR) issues?
 Who owns the data?
 How will the data be licensed for reuse?
 Are there any restrictions on the reuse of third-party data?
 Will data sharing be postponed / restricted e.g. to publish or seek patents?
Source: http://www.dcc.ac.uk/sites/default/files/documents/resource/DMP/DMP_Checklist_2013.pdf
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Data Management Plan (DMP)…3


A researcher needs to make the plan in compliance with funders and Institutional requirements
 Storage and Backup

 How will the data be stored and backed up during the research?
 Do you have sufficient storage or will you need to include charges for additional services?
 How will the data be backed up?
 Who will be responsible for backup and recovery?
 How will the data be recovered in the event of an incident?
 How will you manage access and security?
 What are the risks to data security and how will these be managed?
 How will you control access to keep the data secure?
 How will you ensure that collaborators can access your data securely?
 Creating or collecting data in the field how will you ensure its safe transfer into your main secured
systems?

Source: http://www.dcc.ac.uk/sites/default/files/documents/resource/DMP/DMP_Checklist_2013.pdf
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Data Management Plan (DMP)..4


A researcher needs to make the plan in compliance with funders and Institutional requirements

 Selection and Preservation

 Which data should be retained, shared, and/or preserved?


 What data must be retained/destroyed for contractual, legal, or regulatory purposes?
 How will you decide what other data to keep?
 What are the foreseeable research uses for the data?
 How long will the data be retained and preserved?
 What is the long-term preservation plan for the dataset?
 Where e.g. in which repository or archive will the data be held?
 What costs if any will your selected data repository or archive charge?
 Have you costed in time and effort to prepare the data for sharing / preservation?

Source: http://www.dcc.ac.uk/sites/default/files/documents/resource/DMP/DMP_Checklist_2013.pdf
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Data Management Plan (DMP)..5


 Data Sharing

 How will you share the data?


 How will potential users find out about your data?
 With whom will you share the data, and under what conditions?
 Will you share data via a repository, handle requests directly or use another mechanism?
 When will you make the data available?
 Will you pursue getting a persistent identifier for your data?

 Are any restrictions on data sharing required?


 What action will you take to overcome or minimize restrictions?
 For how long do you need exclusive use of the data and why?
 Will a data sharing agreement (or equivalent) be required?
Source: http://www.dcc.ac.uk/sites/default/files/documents/resource/DMP/DMP_Checklist_2013.pdf
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Data Management Plan (DMP)..6


 Responsibilities and Resources

 Who will be responsible for data management?


 Who is responsible for implementing the DMP, and ensuring it is reviewed and revised?
 Who will be responsible for each data management activity?
 How will responsibilities be split across partner sites in collaborative research projects?
 Will data ownership and responsibilities for RDM be part of any consortium agreement or
contract agreed between partners?
 What resources will you require to deliver your plan?
 Is additional specialist expertise (or training for existing staff) required?
 Do you require hardware or software which is additional or exceptional to existing institutional
provision?
 Will charges be applied by data repositories?
Source: http://www.dcc.ac.uk/sites/default/files/documents/resource/DMP/DMP_Checklist_2013.pdf
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Types of Data Management Tools


Cloud Data Management tools - built on the cloud, for the cloud, these tools connect to and integrate multiple
data sources via API’s, webhooks, or direct database connections. Ex. Amazon, Google cloud etc
ETL tools - help organizations load data from multiple sources, define complex, automated transformations of the
data, test the data pipeline, and load data continuously to a target database or data warehouse.
Data Transformation tools - help with the transformation of raw data into clean, aggregated, analyzable data as
it moves from individual data sources to an analytics warehouse--or within the analytics warehouse, at the point of
analysis.
Master Data Management (MDM) tools - help visualize complex sets of master data across the organization,
and facilitate data stewardship by subject matter experts, who oversee creation and maintenance of reference
data.
Reference Data Management (RDM) tools - often provided as part of MDM suites, define business processes
around reference data, and help stakeholders populate reference data and manage it over time.
Data visualization and data analytics tools - explore, analyze and visualize big data sets, and generate reports
and dashboards to extract insights and guide business decision

https://blog.panoply.io/28-data-management-tools-5-ways-of-thinking-about-data-management
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Open Data Repositories


• Figshare figshare is a repository where users can make all of their research outputs available
in a citable, shareable and discoverable manner
• Dryad Digital RepositoryThe Dryad Digital Repository is a curated resource that makes the
data underlying scientific publications discoverable, freely reusable, and citable. Dryad
provides a general-purpose home for a wide diversity of datatypes.
• DataverseA personal dataverse is easy to set up, allows you to display your data on your
personal website, can be branded uniquely as your research program, makes your data more
discoverable to the research community, and satisfies data management plans.
• Open Science Framework OSF is a free, open platform to support your research and enable
collaboration.
• Zenodo Zenodo research data repository manages data from all over the world, and from
every discipline.
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Figshare
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

DRYAD Research Data Repository https://datadryad.org/stash/


MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Open Refine – Data Repository https://openrefine.org/

x
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Data is not always sharable


•preliminary analyses
•drafts of scientific papers
•plans for future research
•peer reviews, or communications with colleagues
•physical objects (e.g., laboratory samples)
•trade secrets
•commercial information
•materials necessary to be held confidential by a
researcher until they are published, or similar
information which is protected under law
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Conclusion-Role of Libraries in RDM


•Does your library currently provide local repository services
for research data (institutional repository, data repository,
other)?
•Which of the following platforms are you using for your data
repository- Dspace, Fedora, CKAN/DKAN, iRODS,
Dataverse, Digital Commons, Customized etc
•What metadata schema are you primarily using for
discovery of data Dublin, MODS, Datacite, Dataverse, Ddi,
custom etc.
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Acknowledgements
I would like express my sincere thanks to Authors of
various Internet sources used to prepare this
presentation.
Wherever possible the links have been provided.
However any omission is duly regretted.
The presentation is mainly prepared to create an
awareness amongst Librarians, students and
researchers about the e-Research Literacy.
MOOCs on Information Handling Skills for Teaching ,
Learning and Research

Contact Details
Professor (Dr.) Ramesh C. Gaur, Fulbright Scholar (VT. USA)
Dean & Director (Library & Information)
Indira Gandhi National Centre for Arts (IGNCA), New Delhi
Autonomous body of Ministry of Culture, GOI
Ph.(Off) 011-23388333
Emails: rcgaur66@gmail.com ; gaur@ignca.nic.in,website:
www.ignca.gov.in
Profile:http://ignca.gov.in/PDF_data/profile_of_dr_ramesh_c_g
aur.pdf

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy