For full functionality of Sketch Engine it is necessary to
enable JavaScript
cordial
Reset settings
Search
Word list
Corpus info
User guide
All words
All lemmas
This action may take several minutes for large corpora, please wait.
Word list options
Subcorpus:
Search attribute:
word
lemma
pos
doc.tipologia
doc.id
use n-grams
. Value of n: from
2
3
4
5
6
to
2
3
4
5
6
hide/nest sub-n-grams
Filter options:
Filter word list by:
Regular expression:
Minimum frequency:
Maximum frequency:
(0 = no maximum frequency)
Whitelist:
Blacklist:
format
Word list whitelists and blacklists must be plain text (.txt), encoded in UTF-8, with one item per line. The items must correspond to the selected attribute, so, eg, if 'lemma' is selected from the attribute menu, then the list should be a list of lemmas. We use exact matching, not regular-expression matching, for file input.
Include non-words
Output options:
Frequency figures:
Hit counts
Document counts
ARF
Output type:
Simple
Keywords
Reference (sub)corpus
C-Or-DiAL
(whole corpus)
Prefer:
rare words
common words
Change output attribute(s)
---
word
lemma
pos
---
word
lemma
pos
---
word
lemma
pos
You can select one or more output attributes. Please note that this option can be time-consuming.