Skip to main content

Table 4 Speech corpus used to evaluate the automatic phonetic aligners

From: Free resources for forced phonetic alignment in Brazilian Portuguese based on Kaldi toolkit

Dataset

Duration

# Files

# Words

# Tokens

Male

7 m:58 s (7 m:40 s)

200 (193)

1260 (665)

5275

Female

7 m:34 s (7 m:18 s)

199 (192)

1258 (664)

5262

Total

15 m:32 s (14 m:58 s)

399 (385)

2518 (686)

10,537

  1. Actual duration and number of files after discard are shown between parentheses, as well as the number of unique words