Frequent frames analysis pipeline

Details

For more information about the ACQDIV project, see: http://www.acqdiv.uzh.ch/. The research leading to these results has received funding from the European Union’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no 615988 (PI Sabine Stoll).

Pipeline

Generate the bigrams

Run the script: bigrams.py to generate the bigram frequencies. Note this make take a long time.

Generate the trigram counts for each analysis

For words, morphemes, dyads, etc. This generates counts for the frequency calculation of the frames and for operationalization.

Call: python3 utils.py

Make sure to set the words or morphemes in the script!

For global recall (DB) get the word types:

Call: sh get_types.sh

Run the frames processing and PR analysis

This step requires that the bigrams are calculated and pickled first. Then run the frames analysis with frames.py. The script outputs a TSV file to load into R.

Make sure to set words or morphemes (pos or glosses) in the script!

Statistical analysis with R scripts

The frames-functions.R file contains various functions for processing and plotting the frequent frames analyses. The words, morphemes and dyads analyses are in words.R, morpheme-gloss-pos.R and dyads.R.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Frequent frames analysis pipeline

Details

Pipeline

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
bigrams		bigrams
data		data
README.md		README.md
bigrams.py		bigrams.py
frames-functions.R		frames-functions.R
frames.py		frames.py
get_types.sh		get_types.sh
utils.py		utils.py

acqdiv/frequent-frames

Folders and files

Latest commit

History

Repository files navigation

Frequent frames analysis pipeline

Details

Pipeline

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages