Semantic Metadata

What is Metadata

  • Metadata is information about data
  • Metadata is typically
    • Manually provided
    • Often missing
  • Metadata can be automatically captured
    • By a sensor or instrument
    • By a workflow system

Uses of Metadata

  • Facilitate reuse by others
  • Support queries on data repositories
  • Explain the context for the data in terms of how it was collected or generated
  • Enable automated data integration

Types of Metadata

  • Data characteristics: Size, statistical properties
  • Descriptive metadata: Location, collection, frequency
  • Provenance metadata: What workflow was used, what its components were, what the input data was

Metadata Vocabulary

  • A metadata vocabulary is the set of the terms used to describe metadata

  • A metadata standard is a vocabulary that is agreed upon by a community and are adopted for metadata

    • Based on its broad applicability and how well it supports uses of the metadata

Metadata from general to specific

Representing Metadata

  • Metadata captures knowledge about objects in the domain of interest

  • It is important to learn computational concepts for knowledge representation

  • These representations are key to communicate important expertise to collaborating data scientists and computer scientists

Knowledge Representation

  • Knowledge is a set of beliefs held by an agent that determine its behavior
  • Knowledge representation is a field of artificial intelligence devoted to developing and implementing computer languages to capture knowledge

Meta-Knowledge

Descriptive Knowledge


- Note: Properties is about relationship

Knowledge Systems

  • Knowledge systems use a knowledge base of beliefs to generate their behavior
  • Behavior changes when new beliefs are added
    • If they exhibit wrong behaviors/answer, beliefs can be changed to fix them
  • They reason: use logical inference over their beliefs to answer queries/deduce new beliefs
    • uses a logic system to reason over the beliefs/data in the knowledge base
  • Knowledge system uses knowledge base that contains the beliefs, and then uses a logic system (such as knowledge representation system, like a frame system) to reason over beliefs

Reasoning

  • Reasoning is done over symbols that make up the beliefs much like calculations are done over numbers
  • Reasoning uses a logic system to do inference: a system of general logic rules to deduce new beliefs from initial beliefs in a knowledge base
  • Natural deduction is an example of a logic system

Knowledge base

  • Symbol are labels that can be used to refer to entities in the world
  • Several symbols may exists for the same object
  • A knowledge representation language specifies
    • Symbols and a notation ofr how to combine the symbols to represent beliefs
  • A knowledge base is a set of beliefs and can be used by a knowledge system to generate its behavior

Knowledge Representation System(KRS)

  • A knowledge representation system is a logic system that has three components:
    • Knowledge representation language: what symbols can be used an how to combine them
    • Logic rules: how can the system infer new beliefs given initial beliefs
    • Reasoning algorithm: how teh system uses the language and the logic rules

An example of KRS: Frame Systems

  • A frame system is a kind of knowledge representation system for descriptive knowledge
  • Each class is represented as a frame that captures is main characteristics

Ontology

  • An ontology contains descriptive knowledge about entities of interest in a domain
    • A shared conceptualization of the world
  • An ontology is typically a collection of generic frames
  • Vs. a vocabulary, which is just a set of terms but not organizing them within a hierarchy
  • Ontology is a “generic frame only” knowledge representation system
  • Ontologies thus have the 3 components and not associated instances/data

Ontology vs. Instances

Knowledge Representation System: Important Characteristics

  • Decidability
    • Undecidable if it may never return an answer
  • Expressivity
    • What its formal language can represent
  • Soundness
    • Whether the logic works as intended
  • Computational complexity
    • How much computation is required to get an answer
  • Explainability
    • Whether an understandable proof/explanation is generated

OWL:The Web Ontology Language

  • A standard language for the web (like HTML), specifically for authoring ontologies
  • Knowledge representation language used to author ontologies - ‘generic frame only’ knowledge representation system - for the web
    • Represents descriptive knowledge on the web
    • Can result in a more elaborate version of a frame system
  • Provides for:
    • Language
    • Rules
    • Reasoning algorithm

Description Logics

  • OWL as description logics:
    • Knowledge representation language
    • Descriptive knowledge
  • Description logics - language designed for knowledge representation and reasoning for descriptive knowledge