CLARIN-D User Guide

Table of Contents
Introduction and background
About this book
Who should read this book?
How to use this book
Release history
I. Basic concepts
1. Concepts and data categories
Data Categories and Data Category Registries
ISOcat, a Data Category Registry
2. Metadata
Managing and Accessing Data
Objects, Collections, Granularity
Types of Resources and Metadata Components
Lifecycle Management
Existing MD sets
The Component Metadata Initiative (CMDI)
Aggregation
Recommendations
3. Resource annotations
Aspects of annotations
Exchange and combination of annotations
Recommendations
4. Access to resources and tools – technical and legal issues
Single Sign-on access to the CLARIN-D infrastructure
Legal Issues
5. Quality assurance
Aspects of the quality of resources
Recommendations
II. Linguistic resources and tools
6. Types of resources
General recommendations
Text Corpora
Multimodal corpora
Lexical resources
7. Linguistic tools
Hierarchies of linguistic tools
Automatic and manual analysis tools
Technical issues in linguistic tool management
Automatic segmentation and annotation tools
Manual annotation and analysis tools
Multimedia tools
Recommendations for CLARIN-D tool designers
8. Web services: Accessing and using linguistic tools
Web Services
Service-oriented architectures
WebLicht – A service-oriented architecture for linguistic resources and tools
WebLicht usage scenarios
Integrating existing linguistic tools into WebLicht
Bibliography
List of Figures
List of Examples