WebCorp: linguistic search engine to treat the web as corpus

WebCorp Advanced Wordlist Generator Guide Publications   Feedback
   


NEWS: Significant improvements in the speed of WebCorp and range of processing options available, as it becomes part of a fully-tailored linguistic search engine... [more]

Search term:

Enter a word, phrase (no quotes necessary) or pattern

See the Guide for an explanation of the options
Search Engine:
Case Options:
Output Format:
Web Addresses (URLs):
Concordance Span:
word(s) to left and right (max 50)
OR
Full sentences?
Number of Concordance Lines:
Site Domain:

(Works with Google and AltaVista only)
Leave blank to search the whole web.

For a specific domain search enter a URL (without the http://) - e.g. www.nytimes.com
or part of a URL - e.g. ac.uk for all UK academic institutions.
Use OR to specify multiple domains (Google only).


Newspaper Domains:

This will override any domain set above
Textual Domain:

Select Open Directory category
Word Filter:

Include extra words which must or must not appear on the same web page as the search term.
Use the minus sign (-) to exclude words;
e.g. for the search term 'plant' you may specify leaf -nuclear as a filter, to restrict the range of senses retrieved.

Pages Last Modified:

OR
Between and (dd/mm/yy)

Collocation:
External Collocates    Internal Collocates (for phrase internal search)    Exclude Stopwords
One concordance line per web site
Exclude link text
Exclude wildcard match to e-mail address
Send Results by Email: 
Option temporarily unavailable



Note : To avoid wrap-around text on large span searches, set your display for small fonts.

By using the WebCorp tools you are agreeing to be bound by the Terms of Use.

   

 © 1999-2008 Research and Development Unit for English Studies   Privacy Policy