brown_metadata {tlda}R Documentation

Text metadata for Brown corpora

Description

This dataset provides metadata for the text files in the Brown family of corpora. It maps standardized file names to the textual categories genre and subgenre.

Usage

brown_metadata

Format

brown_metadata

A data frame with 500 rows and 3 columns:

text_file

Standardized name of the text file (e.g. "A01", "J58", "R07")

macro_genre

4 macro genres ("press", "general_prose", "learned", "fiction")

genre

15 genres (e.g. "press_editorial", "popular_lore", "adventure_western_fiction"))

Source

McEnery, Tony & Andrew Hardie. 2012. Corpus linguistics. Cambridge: Cambridge University Press.


[Package tlda version 0.1.0 Index]