The package rticulate
facilitates UTI data import and plotting with spline data exported from Articulate Assistant Advanced (AAA). To use the package, load it as usual.
library(rticulate)
The function read_aaa()
can quickly import spline data and transform it into the long format (where each observation is a point on a fan line and the coordinates values are two variables, X
and Y
, see ?tongue
for more details).
To correctly import AAA data, it is required that the file exported from AAA does not contain the header. This must be supplied as an argument to the read_aaa()
function. We thus create a character vector with a concatenation of column names as strings.
<- c(
columns "speaker",
"seconds",
"rec_date",
"prompt",
"label",
"TT_displacement",
"TT_velocity",
"TT_abs_velocity",
"TD_displacement",
"TD_velocity",
"TD_abs_velocity"
)
Now we can use read_aaa()
to import the spline data as a tibble. The function requires a string with the file path and name, and a vector with the names of the columns.
# system.file() is needed here because the example files reside in the package.
# You can just include the file path directly in read_aaa, like
# read_aaa("~/Desktop/splines.tsv", columns)
<- system.file("extdata", "it01.tsv", package = "rticulate")
file_path
<- read_aaa(file_path, columns) tongue
##
## ── Column specification ────────────────────────────────────────────────────────
## cols(
## .default = col_double(),
## speaker = col_character(),
## rec_date = col_character(),
## prompt = col_character(),
## label = col_character()
## )
## ℹ Use `spec()` for the full column specifications.
To check the head of the tibble, just do:
tongue
## # A tibble: 462 x 15
## speaker seconds rec_date prompt label TT_displacement TT_velocity
## <chr> <dbl> <chr> <chr> <chr> <dbl> <dbl>
## 1 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 2 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 3 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 4 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 5 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 6 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 7 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 8 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 9 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 10 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## # … with 452 more rows, and 8 more variables: TT_abs_velocity <dbl>,
## # TD_displacement <dbl>, TD_velocity <dbl>, TD_abs_velocity <dbl>,
## # fan_line <int>, X <dbl>, Y <dbl>, .index <int>
Sometimes is useful to add extra information for each prompt (like vowel, consonant place, phonation, etc.). We can do so by using functions from the dplyr
package (word()
is from the stringr
package).
<- read_csv(system.file("extdata", "stimuli.csv", package = "rticulate")) stimuli
##
## ── Column specification ────────────────────────────────────────────────────────
## cols(
## item = col_double(),
## word = col_character(),
## ipa = col_character(),
## c1 = col_character(),
## c1_phonation = col_character(),
## vowel = col_character(),
## anteropost = col_character(),
## height = col_character(),
## c2 = col_character(),
## c2_phonation = col_character(),
## c2_place = col_character()
## )
<- mutate(tongue, word = stringr::word(prompt, 2)) %>%
tongue left_join(y = stimuli) %>%
mutate_if(is.character, as.factor)
## Joining, by = "word"
Let’s check tongue
again.
tongue
## # A tibble: 462 x 26
## speaker seconds rec_date prompt label TT_displacement TT_velocity
## <fct> <dbl> <fct> <fct> <fct> <dbl> <dbl>
## 1 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 2 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 3 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 4 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 5 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 6 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 7 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 8 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 9 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## 10 it01 1.20 29/11/2… Dico … max_… 77.9 -7.73
## # … with 452 more rows, and 19 more variables: TT_abs_velocity <dbl>,
## # TD_displacement <dbl>, TD_velocity <dbl>, TD_abs_velocity <dbl>,
## # fan_line <int>, X <dbl>, Y <dbl>, .index <int>, word <fct>, item <dbl>,
## # ipa <fct>, c1 <fct>, c1_phonation <fct>, vowel <fct>, anteropost <fct>,
## # height <fct>, c2 <fct>, c2_phonation <fct>, c2_place <fct>
To plot splines from a spline data frame, use plot_tongue()
. This function is a wrapper of a ggplot
call (from the ggplot2
package). The coordinates must be in two variables named X
and Y
. read_aaa()
creates them automatically while importing the raw data.
plot_tongue(tongue)
## `geom_smooth()` using formula 'y ~ x'
You can also easily filter the tibble with the filter()
function from dplyr
.
filter(tongue, label == "max_TD") %>%
plot_tongue()
## `geom_smooth()` using formula 'y ~ x'
You can specify geom
options and aesthetics in the usual ggplot
way (remember to load the package with library(ggplot2)
if you need this). geom
options are arguments of plot_tongue()
, while aesthetics can be called with aes()
.
plot_tongue(tongue, alpha = 0.5) +
aes(group = rec_date, colour = c2_place) +
theme(legend.position = "bottom")
## `geom_smooth()` using formula 'y ~ x'
To plot points instead of splines, use plot_tongue(geom = "point")
.
plot_tongue(tongue, geom = "point", alpha = 0.5) +
aes(group = rec_date, colour = c2_place) +
theme(legend.position = "bottom")
Finally, if you want to plot the palate profile, you can do so by specifying a data frame with the coordinates for the palate spline.
<- read_aaa(system.file("extdata", "it01-palate.tsv", package = "rticulate"), columns) palate
##
## ── Column specification ────────────────────────────────────────────────────────
## cols(
## .default = col_double(),
## speaker = col_character(),
## rec_date = col_character(),
## prompt = col_character(),
## label = col_logical(),
## X_1 = col_logical(),
## Y_1 = col_logical(),
## X_2 = col_logical(),
## Y_2 = col_logical(),
## X_3 = col_logical(),
## Y_3 = col_logical(),
## X_4 = col_logical(),
## Y_4 = col_logical(),
## X_5 = col_logical(),
## Y_5 = col_logical(),
## X_6 = col_logical(),
## Y_6 = col_logical(),
## X_7 = col_logical(),
## Y_7 = col_logical(),
## X_8 = col_logical(),
## Y_8 = col_logical()
## # ... with 20 more columns
## )
## ℹ Use `spec()` for the full column specifications.
filter(tongue, label == "max_TD") %>%
plot_tongue(palate = palate, alpha = 0.5) + aes(group = rec_date)
## `geom_smooth()` using formula 'y ~ x'
## `geom_smooth()` using formula 'y ~ x'
## Warning: Removed 18 rows containing non-finite values (stat_smooth).
To import multiple files with AAA data, simply use a list of paths with read_aaa
, for example using list.files
.
<- list.files(
tongue2 path = system.file("extdata", package = "rticulate"),
pattern = "*\\d.tsv",
full.names = TRUE
%>%
) read_aaa(., columns)
##
## ── Column specification ────────────────────────────────────────────────────────
## cols(
## .default = col_double(),
## speaker = col_character(),
## rec_date = col_character(),
## prompt = col_character(),
## label = col_character()
## )
## ℹ Use `spec()` for the full column specifications.
##
##
## ── Column specification ────────────────────────────────────────────────────────
## cols(
## .default = col_double(),
## speaker = col_character(),
## rec_date = col_character(),
## prompt = col_character(),
## label = col_character()
## )
## ℹ Use `spec()` for the full column specifications.
We can now plot splines for both speakers.
plot_tongue(tongue2, alpha = 0.5) +
aes(group = rec_date) +
facet_grid(. ~ speaker)
## `geom_smooth()` using formula 'y ~ x'