|
|
| WebCorp | Advanced | Wordlist Generator | Guide | Publications | Feedback |
|
|
How is WebCorp different from search engines? WebCorp actually visits each one of these pages, extracting concordance
lines from them. Although some search engines, such as Google, do give Key
Word in Context style output for some of the URLs in the results list,
this is not true for all of the URLs and not all instances of the search
term on each page are given in these short extracts. It may be the case
that the search term occurs many times on a given page, but a Google-user
could not know this without clicking on each of the links manually. Google
is an excellent search engine but it is not designed as a corpus
linguistics tool and is not ideal for this purpose. WebCorp contains
options (customisable concordance span, output format, etc) specifically
designed for linguistic research.
Why is WebCorp slow to return results? In order to conduct a full linguistic analysis of how a particular word
or phrase is used on the Web, the alternative to using WebCorp would be to
use a search engine to find a list of pages containing the word or phrase,
and then to access each of the URLs in this list manually, locate each of
the examples of the word/phrase on the page and copy these into a file.
WebCorp automates this whole process, which is why it is slower
than a standard search engine. It is still a vast time-saver over the
equivalent manual process.
|
|
© 1999-2008 Research and Development Unit for English Studies Privacy Policy |