Week Week Title Lecture # Lecture Video Slides Overview Documents Reading Assignments Quizzes
Week 1 Introduction to Data Curation 1 Introduction Syllabus

Course Deadline, Late Policy and Academic Calendar

Meet the staff
Preparing the Workforce for Digital Curation Orientation Quiz
Week 1 Quiz
2 What is data science? IS531_FoundationsofDataCuration_M1V1.pdf
3 What is data curation? IS531_FoundationsofDataCuration_M1V2.pdf
4 Objectives, Activities and Methods IS531_FoundationsofDataCuration_M1V3.pdf
5 Organizations, Conferences and Literature IS531_FoundationsofDataCuration_M1V4.pdf
Week 2 Data Abstraction - Relations 1 Week 2 Introduction Video Week 2 Overview Assignment 1 Information Week 2 Quiz
2 Data Models IS531_FoundationsofDataCuration_M2V1.pdf
3 The Problem IS531_FoundationsofDataCuration_M2V2.pdf
4 The Relational Model IS531_FoundationsofDataCuration_M2V3.pdf
5 How is the Relational Model Implemented? IS531_FoundationsofDataCuration_M2V4.pdf
6 Abstraction, Indirection & Data Independence IS531_FoundationsofDataCuration_M2V5.pdf
Week 3 Data Abstraction - Trees 1 Week 3 Introduction Week 3 Overview https://tei-c.org/release/doc/tei-p5-doc/en/html/SG.html

Optional Readings

https://dl.acm.org/doi/abs/10.1145/872730.806456https://dl.acm.org/doi/10.1145/32206.32209https://companions.digitalhumanities.org/DH/?chapter=content/9781405103213_chapter_17.html
Week 3 Quiz
2 Text and Documents IS531_FoundationsofDataCuration_M3V1.pdf
3 The Problem IS531_FoundationsofDataCuration_M3V2.pdf
4 The Solution: (1) Descriptive Markup IS531_FoundationsofDataCuration_M3V3.pdf
5 The Solution: (2) Trees IS531_FoundationsofDataCuration_M3V4.pdf
6 Why The Solution Works IS531_FoundationsofDataCuration_M3V5.pdf
7 Implementing The Solution: XML IS531_FoundationsofDataCuration_M3V6.pdf
Week 4 Data Abstraction - Ontologies 1 Week 4 Introduction Week 4 Overview https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.526.369&rep=rep1&type=pdfhttps://www.researchgate.net/publication/2934664_Towards_a_Semantics_for_XML_Markup

Optional Readings

https://www.researchgate.net/publication/3420451_What_Are_Ontologies_and_Why_Do_We_Need_Themhttps://www.sciencedirect.com/science/article/pii/S1532046405001310https://pubmed.ncbi.nlm.nih.gov/18289717/https://www.cidoc-crm.org/Resources/the-cidoc-crm-a-standard-for-the-integration-of-cultural-information-2https://link.springer.com/chapter/10.1007/978-3-642-04590-5_13
Week 4 Quiz
2 The Problem: Connecting Data to Information IS531_FoundationsofDataCuration_M4V1.pdf
3 The Solution: Ontologies IS531_FoundationsofDataCuration_M4V2.pdf
4 An ER/Ontology Example: FRBR IS531_FoundationsofDataCuration_M4V3.pdf
5 Implementing Ontologies in RDF/RDFS IS531_FoundationsofDataCuration_M4V4.pdf
Week 5 Data Integration 1 Week 5 Introduction Week 5 Overview https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.59.5998&rep=rep1&type=pdf

Optional Readings

https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.107.7214&rep=rep1&type=pdf
Assignment 2 Information Week 5 Quiz
2 Data Cleaning, Data Integration IS531_FoundationsofDataCuration_M5V1.pdf
3 Managing Heterogeneity IS531_FoundationsofDataCuration_M5V2.pdf
4 Schema Integration IS531_FoundationsofDataCuration_M5V3.pdf
5 Schema Integration: an example IS531_FoundationsofDataCuration_M5V4.pdf
Week 6 Data Concepts 1 Week 6 Introduction Week 6 Overview https://www.semanticscholar.org/paper/Identifying-content-and-levels-of-representation-in-Wickett-Sacchi/28825d6752e6468765622f4bfe8d44f5fef57555?p2df

Optional Readings

https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.476.8466&rep=rep1&type=pdfhttps://academiccommons.columbia.edu/doi/10.7916/D84Q7S4Ghttps://core.ac.uk/download/pdf/27299254.pdf
Week 6 Quiz
2 What is data? A first attempt IS531_FoundationsofDataCuration_M6V1.pdf
3 The Identity Problem IS531_FoundationsofDataCuration_M6V2.pdf
4 Some Ontological Analysis IS531_FoundationsofDataCuration_M6V3.pdf
5 A Way Forward: Roles and Types IS531_FoundationsofDataCuration_M6V4.pdf
6 An Ontology for Data Concepts IS531_FoundationsofDataCuration_M6V5.pdf
7 What is data? IS531_FoundationsofDataCuration_M6V6.pdf
Week 7 Metadata 1 Week 7 Introduction Week 7 Overview https://www.fidgeo.de/wp-content/uploads/2016/07/2017_01-NISO-understanding-metadata.pdfhttps://dl.acm.org/doi/10.1145/1107499.1107503

Optional Readings:

https://onlinelibrary.wiley.com/doi/10.1002/asi.22683http://www.ijdc.net/index.php/ijdc/article/view/66https://pubs.geoscienceworld.org/books/book/641/chapter-abstract/3806354/Geoscience-metadata-No-pain-no-gain?redirectedFrom=fulltexthttps://www.jstor.org/stable/2269427
Week 7 Quiz
2 What is Metadata? IS531_FoundationsofDataCuration_M7V1.pdf
3 Metadata Schemas IS531_FoundationsofDataCuration_M7V2.pdf
4 Common Metadata Ambiguities IS531_FoundationsofDataCuration_M7V3.pdf
5 How does metadata support data curation? IS531_FoundationsofDataCuration_M7V4.pdf
Week 8 Data Practices 1 Week 8 Introduction Week 8 Overview

Data Anonymization

Data Integrity
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126798/ Assignment 3 Information [Graded] Assignment 3 Data Privacy Quiz
2 Data Practices IS531_FoundationsofDataCuration_tempM14V1.pdf
3 What’s going on in the lab? IS531_FoundationsofDataCuration_tempM14V2.pdf
4 Data sharing IS531_FoundationsofDataCuration_tempM14V3.pdf
5 Data Reuse IS531_FoundationsofDataCuration_tempM14V4.pdf
Week 9 Preservation 1 Week 9 Introduction Week 9 Overview Week 9 Quiz
2 Introduction to data preservation challenges IS531_FoundationsofDataCuration_tempM8V1.pdf
3 What is data preservation? IS531_FoundationsofDataCuration_tempM8V2.pdf
4 The preservation<—>integration parallels IS531_FoundationsofDataCuration_tempM8V3.pdf
5 Standard data preservation strategies IS531_FoundationsofDataCuration_tempM8V4.pdf
6 Two data preservation standards IS531_FoundationsofDataCuration_tempM8V5.pdf
Week 10 Identity 1 Week 10 Introduction Week 10 Overview Assignment 3 Report InstructionsReading•. Duration: 10 minutes10 min Week 10 Quiz
2 Why is identification important? IS531_FoundationsofDataCuration_tempM9V1.pdf
3 What are we identifying? IS531_FoundationsofDataCuration_tempM9V2.pdf
4 How do we identify? IS531_FoundationsofDataCuration_tempM9V3.pdf
5 Identifiers and Change Identifiers and Change slides.pdf
6 A practical example: XML canonicalization IS531_FoundationsofDataCuration_tempM9V4.pdf
Week 11 Standards 1 Week 11 Introduction Week 11 Overview Week 11 Quiz
2 Standards and standards organizations IS531_FoundationsofDataCuration_tempM10V1.pdf
3 Some standard standards maneuvers IS531_FoundationsofDataCuration_tempM10V2.pdf
4 Compatibility IS531_FoundationsofDataCuration_tempM10V3.pdf
Week 12 Workflow, Provernance and Reproducibility 1 Week 12 Introduction Week 12 Overview https://dl.acm.org/doi/10.1145/2602649.2602651 Assignment 4 Information Week 12 Quiz
2 Workflow IS531_FoundationsofDataCuration_tempM11V1.pdf
3 Provenance IS531_FoundationsofDataCuration_tempM11V2.pdf
4 Workflow Systems IS531_FoundationsofDataCuration_tempM11V3.pdf
5 Introduction to Docker by Peter Organisciak Docker for Preservation.pdf
6 Provenance Standards LIS-490-guest-lecture-provenance-1.pdf
Week 13 Communication 1 Week 13 Introduction Week 13 Overview https://www.science.org/doi/10.1126/science.1157784https://jbiomedsem.biomedcentral.com/articles/10.1186/2041-1480-3-S1-S1 Week 13 Quiz
2 Communication issues in data curation IS531_FoundationsofDataCuration_tempM13V1.pdf
3 The crisis in data-driven scientific communication IS531_FoundationsofDataCuration_tempM13V2.pdf
4 The solution to the data crisis is... IS531_FoundationsofDataCuration_tempM13V3.pdf
Week 14 - - - - -
Week 15 Data Governance,Secuirty, Policy, Law and Ethics 1 Week 15 Introduction Week 15 Overview https://ieeexplore.ieee.org/document/4720221https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002235 Week 15 Quiz
2 Definitions, types, scope, issues IS531_FoundationsofDataCuration_tempM15V1.pdf
3 A whirlwind tour of science-oriented data policies IS531_FoundationsofDataCuration_tempM15V2.pdf
4 Privacy IS531_FoundationsofDataCuration_tempM15V3.pdf
5 Panel Discussion
Week 16 Practical Data Curation 1 What is it? What is it.pdf Week 16 Overview
2 Models, Integration & Concepts Models, Integration and Concepts.pdf
3 Metadata Metadata.pdf
4 Workflow and Provenance Workflow and Provenance.pdf