Linked Data In Linguistics

Author: Christian Chiarcos
Editor: Springer Science & Business Media
ISBN: 3642282490
File Size: 11,13 MB
Format: PDF, ePub, Docs
Read: 340
Download

The explosion of information technology has led to substantial growth of web-accessible linguistic data in terms of quantity, diversity and complexity. These resources become even more useful when interlinked with each other to generate network effects. The general trend of providing data online is thus accompanied by newly developing methodologies to interconnect linguistic data and metadata. This includes linguistic data collections, general-purpose knowledge bases (e.g., the DBpedia, a machine-readable edition of the Wikipedia), and repositories with specific information about languages, linguistic categories and phenomena. The Linked Data paradigm provides a framework for interoperability and access management, and thereby allows to integrate information from such a diverse set of resources. The contributions assembled in this volume illustrate the band-width of applications of the Linked Data paradigm for representative types of language resources. They cover lexical-semantic resources, annotated corpora, typological databases as well as terminology and metadata repositories. The book includes representative applications from diverse fields, ranging from academic linguistics (e.g., typology and corpus linguistics) over applied linguistics (e.g., lexicography and translation studies) to technical applications (in computational linguistics, Natural Language Processing and information technology). This volume accompanies the Workshop on Linked Data in Linguistics 2012 (LDL-2012) in Frankfurt/M., Germany, organized by the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN). It assembles contributions of the workshop participants and, beyond this, it summarizes initial steps in the formation of a Linked Open Data cloud of linguistic resources, the Linguistic Linked Open Data cloud (LLOD).

Linguistic Linked Data

Author: Philipp Cimiano
Editor: Springer Nature
ISBN: 3030302253
File Size: 18,25 MB
Format: PDF, ePub
Read: 4440
Download

This is the first monograph on the emerging area of linguistic linked data. Presenting a combination of background information on linguistic linked data and concrete implementation advice, it introduces and discusses the main benefits of applying linked data (LD) principles to the representation and publication of linguistic resources, arguing that LD does not look at a single resource in isolation but seeks to create a large network of resources that can be used together and uniformly, and so making more of the single resource. The book describes how the LD principles can be applied to modelling language resources. The first part provides the foundation for understanding the remainder of the book, introducing the data models, ontology and query languages used as the basis of the Semantic Web and LD and offering a more detailed overview of the Linguistic Linked Data Cloud. The second part of the book focuses on modelling language resources using LD principles, describing how to model lexical resources using Ontolex-lemon, the lexicon model for ontologies, and how to annotate and address elements of text represented in RDF. It also demonstrates how to model annotations, and how to capture the metadata of language resources. Further, it includes a chapter on representing linguistic categories. In the third part of the book, the authors describe how language resources can be transformed into LD and how links can be inferred and added to the data to increase connectivity and linking between different datasets. They also discuss using LD resources for natural language processing. The last part describes concrete applications of the technologies: representing and linking multilingual wordnets, applications in digital humanities and the discovery of language resources. Given its scope, the book is relevant for researchers and graduate students interested in topics at the crossroads of natural language processing / computational linguistics and the Semantic Web / linked data. It appeals to Semantic Web experts who are not proficient in applying the Semantic Web and LD principles to linguistic data, as well as to computational linguists who are used to working with lexical and linguistic resources wanting to learn about a new paradigm for modelling, publishing and exploiting linguistic resources.

Development Of Linguistic Linked Open Data Resources For Collaborative Data Intensive Research In The Language Sciences

Author: Antonio Pareja-Lora
Editor: MIT Press
ISBN: 0262536250
File Size: 60,60 MB
Format: PDF, Docs
Read: 3850
Download

Making diverse data in linguistics and the language sciences open, distributed, and accessible: perspectives from language/language acquistiion researchers and technical LOD (linked open data) researchers. This volume examines the challenges inherent in making diverse data in linguistics and the language sciences open, distributed, integrated, and accessible, thus fostering wide data sharing and collaboration. It is unique in integrating the perspectives of language researchers and technical LOD (linked open data) researchers. Reporting on both active research needs in the field of language acquisition and technical advances in the development of data interoperability, the book demonstrates the advantages of an international infrastructure for scholarship in the field of language sciences. With contributions by researchers who produce complex data content and scholars involved in both the technology and the conceptual foundations of LLOD (linguistics linked open data), the book focuses on the area of language acquisition because it involves complex and diverse data sets, cross-linguistic analyses, and urgent collaborative research. The contributors discuss a variety of research methods, resources, and infrastructures. Contributors Isabelle Barrière, Nan Bernstein Ratner, Steven Bird, Maria Blume, Ted Caldwell, Christian Chiarcos, Cristina Dye, Suzanne Flynn, Claire Foley, Nancy Ide, Carissa Kang, D. Terence Langendoen, Barbara Lust, Brian MacWhinney, Jonathan Masci, Steven Moran, Antonio Pareja-Lora, Jim Reidy, Oya Y. Rieger, Gary F. Simons, Thorsten Trippel, Kara Warburton, Sue Ellen Wright, Claus Zinn

Linguistic Linked Open Data

Author: Diana Trandabăţ
Editor: Springer
ISBN: 3319329421
File Size: 30,15 MB
Format: PDF, ePub
Read: 1167
Download

This book constitutes the refereed proceedings of the 12th EUROLAN Summer School on Linguistic Linked Open Data and its Satellite Workshop on Social Media and the Web of Linked Data, RUMOUR 2015, held in Sibiu, Romania, in July 2015. The 10 revised full papers presented together with 12 abstracts of tutorials were carefully reviewed and selected from 21 submissions.

Handbook Of Linguistic Annotation

Author: Nancy Ide
Editor: Springer
ISBN: 9402408819
File Size: 47,85 MB
Format: PDF, ePub
Read: 6838
Download

This handbook offers a thorough treatment of the science of linguistic annotation. Leaders in the field guide the reader through the process of modeling, creating an annotation language, building a corpus and evaluating it for correctness. Essential reading for both computer scientists and linguistic researchers.Linguistic annotation is an increasingly important activity in the field of computational linguistics because of its critical role in the development of language models for natural language processing applications. Part one of this book covers all phases of the linguistic annotation process, from annotation scheme design and choice of representation format through both the manual and automatic annotation process, evaluation, and iterative improvement of annotation accuracy. The second part of the book includes case studies of annotation projects across the spectrum of linguistic annotation types, including morpho-syntactic tagging, syntactic analyses, a range of semantic analyses (semantic roles, named entities, sentiment and opinion), time and event and spatial analyses, and discourse level analyses including discourse structure, co-reference, etc. Each case study addresses the various phases and processes discussed in the chapters of part one.

Corpus Linguistics And Linguistically Annotated Corpora

Author: Sandra Kuebler
Editor: Bloomsbury Publishing
ISBN: 1441119809
File Size: 24,95 MB
Format: PDF, ePub, Mobi
Read: 519
Download

Linguistically annotated corpora are becoming a central part of the corpus linguistics field. One of their main strengths is the level of searchability they offer, but with the annotation come problems of the initial complexity of queries and query tools. This book gives a full, pedagogic account of this burgeoning field. Beginning with an overview of corpus linguistics, its prerequisites and goals, the book then introduces linguistically annotated corpora. It explores the different levels of linguistic annotation, including morphological, parts of speech, syntactic, semantic and discourse-level, as well as advantages and challenges for such annotations. It covers the main annotated corpora for English, the Penn Treebank, the International Corpus of English, and OntoNotes, as well as a wide range of corpora for other languages. In its third part, search strategies required for different types of data are explored. All chapters are accompanied by exercises and by sections on further reading.

Automatic Documentation And Mathematical Linguistics

Author:
Editor:
ISBN:
File Size: 45,26 MB
Format: PDF, ePub
Read: 4306
Download


Linguistics

Author:
Editor:
ISBN:
File Size: 54,81 MB
Format: PDF, Mobi
Read: 1994
Download


Library Linked Data In The Cloud

Author: Carol Jean Godby
Editor: Morgan & Claypool Publishers
ISBN: 1627052208
File Size: 43,77 MB
Format: PDF, Mobi
Read: 871
Download

This book describes OCLC’s contributions to the transformation of the Internet from a web of documents to a Web of Data. The new Web is a growing ‘cloud’ of interconnected resources that identify the things people want to know about when they approach the Internet with an information need. The linked data architecture has achieved critical mass just as it has become clear that library standards for resource description are nearing obsolescence. Working for the world’s largest library cooperative, OCLC researchers have been active participants in the development of next generation standards for library resource description. By engaging with an international community of library and Web standards experts, they have published some of the most widely used RDF datasets representing library collections and librarianship. This book focuses on the conceptual and technical challenges involved in publishing linked data derived from traditional library metadata. This transformation is a high priority because most searches for information start not in the library, nor even in a Web-accessible library catalog, but elsewhere on the Internet. Modeling data in a form that the broader Web understands will project the value of libraries into the Digital Information Age. The exposition is aimed at librarians, archivists, computer scientists, and other professionals interested in modeling bibliographic descriptions as linked data. It aims to achieve a balanced treatment of theory, technical detail, and practical application.

Working Papers In Linguistics

Author: Ohio State University. Department of Linguistics
Editor:
ISBN:
File Size: 69,49 MB
Format: PDF, ePub
Read: 1627
Download


Encyclopedia Of Information Systems And Services

Author:
Editor:
ISBN:
File Size: 33,48 MB
Format: PDF, Mobi
Read: 2049
Download


Linguistics In The Netherlands

Author:
Editor:
ISBN:
File Size: 60,53 MB
Format: PDF, ePub
Read: 4190
Download


Cognitive And Linguistic Aspects Of Geographic Space

Author: Martin Raubal
Editor: Springer Science & Business Media
ISBN: 3642343597
File Size: 41,75 MB
Format: PDF, Docs
Read: 5806
Download

20 years ago, from July 8 to 20, 1990, 60 researchers gathered for two weeks at Castillo-Palacio Magalia in Las Navas del Marques (Avila Province, Spain) to discuss cognitive and linguistic aspects of geographic space. This meeting was the start of successful research on cognitive issues in geographic information science, produced an edited book (D. M. Mark and A. U. Frank, Eds., 1991, Cognitive and Linguistic Aspects of Geographic Space. NATO ASI Series D: Behavioural and Social Sciences 63. Kluwer, Dordrecht/Boston/London), and led to a biannual conference (COSIT), a refereed journal (Spatial Cognition and Computation), and a substantial and still growing research community. It appeared worthwhile to assess the achievements and to reconsider the research challenges twenty years later. What has changed in the age of computational ontologies and cyber-infrastructures? Consider that 1990 the web was only about to emerge and the very first laptops had just appeared! The 2010 meeting brought together many of the original participants, but was also open to others, and invited contributions from all who are researching these topics. Early-career scientists, engineers, and humanists working at the intersection of cognitive science and geographic information science were invited to help with the re-assessment of research needs and approaches. The meeting was very successful and compared the research agenda laid out in the 1990 book with achievements over the past twenty years and then turned to the future: What are the challenges today? What are worthwhile goals for basic research? What can be achieved in the next 20 years? What are the lessons learned? This edited book will assess the current state of the field through chapters by participants in the 1990 and 2010 meetings and will also document an interdisciplinary research agenda for the future.

The Computer In Literary And Linguistic Studies

Author: Alan Jones (M.A.)
Editor:
ISBN:
File Size: 12,82 MB
Format: PDF, Mobi
Read: 9597
Download


Proceedings Of The 5th Annual Conference Of The Formal Linguistics Society Of Mid America

Author: Formal Linguistics Society of Midamerica. Meeting
Editor:
ISBN:
File Size: 33,26 MB
Format: PDF, ePub, Mobi
Read: 5926
Download


Mathematical Linguistics And Automatic Translation

Author:
Editor:
ISBN:
File Size: 56,43 MB
Format: PDF, ePub
Read: 2501
Download


Towards The Multilingual Semantic Web

Author: Paul Buitelaar
Editor: Springer
ISBN: 3662435853
File Size: 10,97 MB
Format: PDF, ePub, Docs
Read: 5403
Download

To date, the relation between multilingualism and the Semantic Web has not yet received enough attention in the research community. One major challenge for the Semantic Web community is to develop architectures, frameworks and systems that can help in overcoming national and language barriers, facilitating equal access to information produced in different cultures and languages. As such, this volume aims at documenting the state-of-the-art with regard to the vision of a Multilingual Semantic Web, in which semantic information will be accessible in and across multiple languages. The Multilingual Semantic Web as envisioned in this volume will support the following functionalities: (1) responding to information needs in any language with regard to semantically structured data available on the Semantic Web and Linked Open Data (LOD) cloud, (2) verbalizing and accessing semantically structured data, ontologies or other conceptualizations in multiple languages, (3) harmonizing, integrating, aggregating, comparing and repurposing semantically structured data across languages and (4) aligning and reconciling ontologies or other conceptualizations across languages. The volume is divided into three main sections: Principles, Methods and Applications. The section on “Principles” discusses models, architectures and methodologies that enrich the current Semantic Web architecture with features necessary to handle multiple languages. The section on “Methods” describes algorithms and approaches for solving key issues related to the construction of the Multilingual Semantic Web. The section on “Applications” describes the use of Multilingual Semantic Web based approaches in the context of several application domains. This volume is essential reading for all academic and industrial researchers who want to embark on this new research field at the intersection of various research topics, including the Semantic Web, Linked Data, natural language processing, computational linguistics, terminology and information retrieval. It will also be of great interest to practitioners who are interested in re-examining their existing infrastructure and methodologies for handling multiple languages in Web applications or information retrieval systems.

Documentation Abstracts

Author:
Editor:
ISBN:
File Size: 18,93 MB
Format: PDF, Mobi
Read: 7065
Download


Publishing And Consuming Linked Data

Author: L. Rietveld
Editor: IOS Press
ISBN: 1614996237
File Size: 19,76 MB
Format: PDF, ePub, Docs
Read: 5516
Download

This dissertation addresses several problems in the context of publishing and consuming Linked Data. It describes these problems from the perspectives of three stakeholders: the Linked Data provider, developer and scientist. The Linked Data provider is faced with impractical data re-use and costly Linked Data hosting solutions. Developers face difficulties in finding, navigating and using Linked Datasets. Scientists lack the resources and methods to evaluate their work on Linked Data at large. This dissertation presents a number of novel approaches that address these issues, such as: - The LOD Laundromat: a centralized service that re-publishes cleaned, queryable and structurally annotated Linked Datasets. In 2015 the Laundromat was awarded first prize in the Dutch national Linked Open Data competition, and third prize in the European equivalent; - SampLD: A relevance-based sampling algorithm that enables publishers to decrease Linked Data hosting costs; - YASGUI: A feature-rich query editor for accessing SPARQL endpoints; - LOD Lab: An evaluation paradigm that enables scientists to increase the breadth and scale of their Linked Data evaluations. This work provides a unique overview of problems related to publishing and consuming Linked Data. The novel approaches presented here improve the state-of-the-art for Linked Data publishers, developers and scientists, and are a step towards a web of Linked Data that is more accessible and technically scalable.