WebCorp Advanced Wordlist Generator Guide Publications   Feedback
 
Background
How does it work?
 Basic Options
Pattern Matching
Advanced Options: Format
Advanced Options: Concordances
Advanced Options: Domains
Advanced Options: Word Filter
Advanced Options: Date Filter
Advanced Options: Collocation
Advanced Options: Hypertext
Post-Processing
Other Tools

 
Advanced Options: Collocation

The Advanced Interface provides a number of options which allow you to analyse the collocates of your search term, the words which appear most frequently in its vicinity in texts:

Collocation Options
WebCorp Collocation Options

  • External Collocates: When selected, this option outputs a table of frequencies for words in the four positions to the left and to the right of the search term.
     
  • Internal Collocates: If this option is selected and your search term is a pattern containing wildcards, a table is returned containing the most frequent words occupying those wildcard positions. The wildcard positions are numbered sequentially from left to right as they appear in the search term.
     
  • Exclude Stopwords: This option filters out high frequency words (e.g. the, of) from the collocate tables.
     
  • One concordance line per website: When this option is selected, WebCorp will only display the first concordance line it finds on any given web page. This is useful when the analysis involves pages containing repeated text which may skew collocation frequencies (e.g. song lyrics, discussion boards where replies quote the message to which they are replying, etc).

Next: Advanced Options: Hypertext >>

 

 © 1999-2008 Research and Development Unit for English Studies   Privacy Policy