PolyglotDB
  • Introduction
  • Getting started
  • Tutorial
  • Interacting with a local Polyglot database
  • Importing corpora
  • Enrichment
    • Subset enrichment
    • Creating syllable units
    • Creating utterance units
    • Hierarchical enrichment
    • Enrichment via CSV files
    • Enrichment via queries
    • Acoustic measures
    • Subannotation enrichment
  • Querying corpora
  • Developer documentation
  • Changelog
PolyglotDB
  • Enrichment
  • View page source

Enrichment

Following import, the corpus is often fairly bare, with just word and phone annotations. An important step in analyzing corpora is therefore enriching it with other information. Most of the methods here are automatic once a function is called.

Contents:

  • Subset enrichment
  • Creating syllable units
    • Encoding syllabic segments
    • Encoding syllables
    • Encoding syllable properties from syllabics
  • Creating utterance units
    • Encoding non-speech elements
    • Encoding utterances
  • Hierarchical enrichment
    • Encode count
    • Encode rate
    • Encode position
  • Enrichment via CSV files
    • Enriching the lexicon
    • Enriching the phonological inventory
    • Enriching speaker information
    • Enriching discourse information
    • Enriching arbitrary tokens
  • Enrichment via queries
  • Acoustic measures
    • Encoding acoustic measures
    • Querying acoustic measures
  • Subannotation enrichment
Previous Next

© Copyright 2015-2024, Montreal Corpus Tools.

Built with Sphinx using a theme provided by Read the Docs.