Tool for the Automatic Analysis of Lexical Sophistication (TAALES)

  • TAALES is a tool that measures over 400 classic and new indices of lexical sophistication, and includes indices related to a wide range of sub-constructs. TAALES indices have been used to inform models of second language (L2) speaking proficiency, first language (L1) and L2 writing proficiency, spoken and written lexical proficiency, genre differences, and satirical language. The tool was developed by Kristopher Kyle.
  • This page was contributed by Karla Csuros, Jacob Dirkx, Zeinab Rahimi, and Hakyung Sung (listed in alphabetical order) from the LCR-ADS lab at the University of Oregon. When constructing this documentation page, we carefully followed the original index description spreadsheet, which you can find on the official webpage.

How to read the documentation

Map
│
├── User manual
│   ├── Getting started
│   ├── Input
|   ├── Output
|   ├── Indices and options
|   ├── Diagnostics
│   │   ├── Index coverage diagnostics
│   │   ├── Individual item diagnostics   
│   └── Citations
│
├── Sub-constructs
│   ├── [Sub-construct 1]
│   │   ├── Definition
│   │   ├── Methodology
│   │   ├── Corpus used
│   │   ├── Calculated indices
│   │   │   ├── Index 1
│   │   │   ├── Index 2
│   │   │   └── ...
│   │   └── References
│   │
│   ├── [Sub-construct name 2]
│   │   ├── Definition
│   │   ├── Methodology
│   │   ├── Corpus Used
│   │   ├── Calculated indices
│   │   └── References
│   │
│   └── ...
│
└── Appendix
    ├── Full list of indices
    └── Other resources

Quick overview of the sub-constructs