In the following sections we will provide a brief overview of the most commonly encountered types of linguistic resources ranging from text corpora and multimodal corpora to lexical resources. In Chapter 7, Linguistic tools a broad range of automatic segmentation, annotation and analysis tools is presented that provide working implementations of the methods already touched upon in Part I, “Basic concepts” and for many tasks beyond. Finally, in Chapter 8, Web services: Accessing and using linguistic tools we demonstrate how resources and tools can be dynamically connected in general and within the CLARIN-D infrastructure in particular.
We assume that readers are already familiar with basic data modeling and representation standards such as the extensible markup language (XML) or different character encodings representing Unicode or older legacy encodings. These standards are of great importance well beyond the linguistic domain and not covered in this user guide.