Week 1 |
Introduction to Data Curation |
1 |
Introduction |
|
Syllabus
Course Deadline, Late Policy and Academic Calendar
Meet the staff |
Preparing the Workforce for Digital Curation |
|
Orientation Quiz Week 1 Quiz |
2 |
What is data science? |
IS531_FoundationsofDataCuration_M1V1.pdf |
3 |
What is data curation? |
IS531_FoundationsofDataCuration_M1V2.pdf |
4 |
Objectives, Activities and Methods |
IS531_FoundationsofDataCuration_M1V3.pdf |
5 |
Organizations, Conferences and Literature |
IS531_FoundationsofDataCuration_M1V4.pdf |
Week 2 |
Data Abstraction - Relations |
1 |
Week 2 Introduction Video |
|
Week 2 Overview |
|
Assignment 1 Information |
Week 2 Quiz |
2 |
Data Models |
IS531_FoundationsofDataCuration_M2V1.pdf |
3 |
The Problem |
IS531_FoundationsofDataCuration_M2V2.pdf |
4 |
The Relational Model |
IS531_FoundationsofDataCuration_M2V3.pdf |
5 |
How is the Relational Model Implemented? |
IS531_FoundationsofDataCuration_M2V4.pdf |
6 |
Abstraction, Indirection & Data Independence |
IS531_FoundationsofDataCuration_M2V5.pdf |
Week 3 |
Data Abstraction - Trees |
1 |
Week 3 Introduction |
|
Week 3 Overview |
https://tei-c.org/release/doc/tei-p5-doc/en/html/SG.html
Optional Readings
https://dl.acm.org/doi/abs/10.1145/872730.806456https://dl.acm.org/doi/10.1145/32206.32209https://companions.digitalhumanities.org/DH/?chapter=content/9781405103213_chapter_17.html |
|
Week 3 Quiz |
2 |
Text and Documents |
IS531_FoundationsofDataCuration_M3V1.pdf |
3 |
The Problem |
IS531_FoundationsofDataCuration_M3V2.pdf |
4 |
The Solution: (1) Descriptive Markup |
IS531_FoundationsofDataCuration_M3V3.pdf |
5 |
The Solution: (2) Trees |
IS531_FoundationsofDataCuration_M3V4.pdf |
6 |
Why The Solution Works |
IS531_FoundationsofDataCuration_M3V5.pdf |
7 |
Implementing The Solution: XML |
IS531_FoundationsofDataCuration_M3V6.pdf |
Week 4 |
Data Abstraction - Ontologies |
1 |
Week 4 Introduction |
|
Week 4 Overview |
https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.526.369&rep=rep1&type=pdfhttps://www.researchgate.net/publication/2934664_Towards_a_Semantics_for_XML_Markup
Optional Readings
https://www.researchgate.net/publication/3420451_What_Are_Ontologies_and_Why_Do_We_Need_Themhttps://www.sciencedirect.com/science/article/pii/S1532046405001310https://pubmed.ncbi.nlm.nih.gov/18289717/https://www.cidoc-crm.org/Resources/the-cidoc-crm-a-standard-for-the-integration-of-cultural-information-2https://link.springer.com/chapter/10.1007/978-3-642-04590-5_13 |
|
Week 4 Quiz |
2 |
The Problem: Connecting Data to Information |
IS531_FoundationsofDataCuration_M4V1.pdf |
3 |
The Solution: Ontologies |
IS531_FoundationsofDataCuration_M4V2.pdf |
4 |
An ER/Ontology Example: FRBR |
IS531_FoundationsofDataCuration_M4V3.pdf |
5 |
Implementing Ontologies in RDF/RDFS |
IS531_FoundationsofDataCuration_M4V4.pdf |
Week 5 |
Data Integration |
1 |
Week 5 Introduction |
|
Week 5 Overview |
https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.59.5998&rep=rep1&type=pdf
Optional Readings
https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.107.7214&rep=rep1&type=pdf |
Assignment 2 Information |
Week 5 Quiz |
2 |
Data Cleaning, Data Integration |
IS531_FoundationsofDataCuration_M5V1.pdf |
3 |
Managing Heterogeneity |
IS531_FoundationsofDataCuration_M5V2.pdf |
4 |
Schema Integration |
IS531_FoundationsofDataCuration_M5V3.pdf |
5 |
Schema Integration: an example |
IS531_FoundationsofDataCuration_M5V4.pdf |
Week 6 |
Data Concepts |
1 |
Week 6 Introduction |
|
Week 6 Overview |
https://www.semanticscholar.org/paper/Identifying-content-and-levels-of-representation-in-Wickett-Sacchi/28825d6752e6468765622f4bfe8d44f5fef57555?p2df
Optional Readings
https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.476.8466&rep=rep1&type=pdfhttps://academiccommons.columbia.edu/doi/10.7916/D84Q7S4Ghttps://core.ac.uk/download/pdf/27299254.pdf |
|
Week 6 Quiz |
2 |
What is data? A first attempt |
IS531_FoundationsofDataCuration_M6V1.pdf |
3 |
The Identity Problem |
IS531_FoundationsofDataCuration_M6V2.pdf |
4 |
Some Ontological Analysis |
IS531_FoundationsofDataCuration_M6V3.pdf |
5 |
A Way Forward: Roles and Types |
IS531_FoundationsofDataCuration_M6V4.pdf |
6 |
An Ontology for Data Concepts |
IS531_FoundationsofDataCuration_M6V5.pdf |
7 |
What is data? |
IS531_FoundationsofDataCuration_M6V6.pdf |
Week 7 |
Metadata |
1 |
Week 7 Introduction |
|
Week 7 Overview |
https://www.fidgeo.de/wp-content/uploads/2016/07/2017_01-NISO-understanding-metadata.pdfhttps://dl.acm.org/doi/10.1145/1107499.1107503
Optional Readings:
https://onlinelibrary.wiley.com/doi/10.1002/asi.22683http://www.ijdc.net/index.php/ijdc/article/view/66https://pubs.geoscienceworld.org/books/book/641/chapter-abstract/3806354/Geoscience-metadata-No-pain-no-gain?redirectedFrom=fulltexthttps://www.jstor.org/stable/2269427 |
|
Week 7 Quiz |
2 |
What is Metadata? |
IS531_FoundationsofDataCuration_M7V1.pdf |
3 |
Metadata Schemas |
IS531_FoundationsofDataCuration_M7V2.pdf |
4 |
Common Metadata Ambiguities |
IS531_FoundationsofDataCuration_M7V3.pdf |
5 |
How does metadata support data curation? |
IS531_FoundationsofDataCuration_M7V4.pdf |
Week 8 |
Data Practices |
1 |
Week 8 Introduction |
|
Week 8 Overview
Data Anonymization
Data Integrity |
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126798/ |
Assignment 3 Information |
[Graded] Assignment 3 Data Privacy Quiz |
2 |
Data Practices |
IS531_FoundationsofDataCuration_tempM14V1.pdf |
3 |
What’s going on in the lab? |
IS531_FoundationsofDataCuration_tempM14V2.pdf |
4 |
Data sharing |
IS531_FoundationsofDataCuration_tempM14V3.pdf |
5 |
Data Reuse |
IS531_FoundationsofDataCuration_tempM14V4.pdf |
Week 9 |
Preservation |
1 |
Week 9 Introduction |
|
Week 9 Overview |
|
|
Week 9 Quiz |
2 |
Introduction to data preservation challenges |
IS531_FoundationsofDataCuration_tempM8V1.pdf |
3 |
What is data preservation? |
IS531_FoundationsofDataCuration_tempM8V2.pdf |
4 |
The preservation<—>integration parallels |
IS531_FoundationsofDataCuration_tempM8V3.pdf |
5 |
Standard data preservation strategies |
IS531_FoundationsofDataCuration_tempM8V4.pdf |
6 |
Two data preservation standards |
IS531_FoundationsofDataCuration_tempM8V5.pdf |
Week 10 |
Identity |
1 |
Week 10 Introduction |
|
Week 10 Overview |
|
Assignment 3 Report InstructionsReading•. Duration: 10 minutes10 min |
Week 10 Quiz |
2 |
Why is identification important? |
IS531_FoundationsofDataCuration_tempM9V1.pdf |
3 |
What are we identifying? |
IS531_FoundationsofDataCuration_tempM9V2.pdf |
4 |
How do we identify? |
IS531_FoundationsofDataCuration_tempM9V3.pdf |
5 |
Identifiers and Change |
Identifiers and Change slides.pdf |
6 |
A practical example: XML canonicalization |
IS531_FoundationsofDataCuration_tempM9V4.pdf |
Week 11 |
Standards |
1 |
Week 11 Introduction |
|
Week 11 Overview |
|
|
Week 11 Quiz |
2 |
Standards and standards organizations |
IS531_FoundationsofDataCuration_tempM10V1.pdf |
3 |
Some standard standards maneuvers |
IS531_FoundationsofDataCuration_tempM10V2.pdf |
4 |
Compatibility |
IS531_FoundationsofDataCuration_tempM10V3.pdf |
Week 12 |
Workflow, Provernance and Reproducibility |
1 |
Week 12 Introduction |
|
Week 12 Overview |
https://dl.acm.org/doi/10.1145/2602649.2602651 |
Assignment 4 Information |
Week 12 Quiz |
2 |
Workflow |
IS531_FoundationsofDataCuration_tempM11V1.pdf |
3 |
Provenance |
IS531_FoundationsofDataCuration_tempM11V2.pdf |
4 |
Workflow Systems |
IS531_FoundationsofDataCuration_tempM11V3.pdf |
5 |
Introduction to Docker by Peter Organisciak |
Docker for Preservation.pdf |
6 |
Provenance Standards |
LIS-490-guest-lecture-provenance-1.pdf |
Week 13 |
Communication |
1 |
Week 13 Introduction |
|
Week 13 Overview |
https://www.science.org/doi/10.1126/science.1157784https://jbiomedsem.biomedcentral.com/articles/10.1186/2041-1480-3-S1-S1 |
|
Week 13 Quiz |
2 |
Communication issues in data curation |
IS531_FoundationsofDataCuration_tempM13V1.pdf |
3 |
The crisis in data-driven scientific communication |
IS531_FoundationsofDataCuration_tempM13V2.pdf |
4 |
The solution to the data crisis is... |
IS531_FoundationsofDataCuration_tempM13V3.pdf |
Week 14 |
|
- |
- |
|
- |
|
- |
- |
Week 15 |
Data Governance,Secuirty, Policy, Law and Ethics |
1 |
Week 15 Introduction |
|
Week 15 Overview |
https://ieeexplore.ieee.org/document/4720221https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002235 |
|
Week 15 Quiz |
2 |
Definitions, types, scope, issues |
IS531_FoundationsofDataCuration_tempM15V1.pdf |
3 |
A whirlwind tour of science-oriented data policies |
IS531_FoundationsofDataCuration_tempM15V2.pdf |
4 |
Privacy |
IS531_FoundationsofDataCuration_tempM15V3.pdf |
5 |
Panel Discussion |
|
Week 16 |
Practical Data Curation |
1 |
What is it? |
What is it.pdf |
Week 16 Overview |
|
|
|
2 |
Models, Integration & Concepts |
Models, Integration and Concepts.pdf |
3 |
Metadata |
Metadata.pdf |
4 |
Workflow and Provenance |
Workflow and Provenance.pdf |