Diagnostics to Assess the Effects of Text Preprocessing Decisions


[Up] [Top]

Documentation for package ‘preText’ version 0.6.2

Help Pages

preText-package preText: Diagnostics to Assess The Effects of Text Preprocessing Decisions
calculate_prediction_errors Calculate mean prediction error for preprocessing decisions.
dfm_scaling_test Comparison of dfms using N-dimensional scaling, with a test for difference from the mean dfm scaled position.
document_position_plots Document Position Plots
factorial_preprocessing A function to perform factorial preprocessing of a corpus of texts into quanteda document-frequency matrices.
mantel_comparison Ensemble Mantel Tests
mantel_comparison_to_base Ensemble Mantel Tests
optimal_k_comparison Optimal Topic Model k Comparison
preprocessing_choice_regression Preprocessing Choice Regressions
preText preText: Diagnostics to Assess The Effects of Text Preprocessing Decisions
preText_score_plot preText specification plot
preText_test preText Test
regression_coefficient_plot Regression Coefficient Plot
remove_infrequent_terms Remove infrequently occurring terms from quanteda dfm.
scaling_comparison Scaling Comparison.
topic_key_term_plot Plot Prevalence of Topic Key Terms
topic_novelty_score Topic Top-Terms Novelty Score
UK_Manifestos Full text of 69 UK party manifestos from 1918-2001.
wordfish_comparison Wordfish Comparison.
wordfish_rank_plot Plot of Wordfish rankings of documents