WebCorp: providing a renewable data source for corpus linguists

WebCorp: providing a renewable data source for corpus linguists

antoinette renouf

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

The many electronic text corpora available nowadays present ever fewer obstacles to a wide range of corpus linguistic study. However, corpora are expensive resources to create and to update, and there remain problems for linguists if they seek access to very large, very recent, or changing language. The World Wide Web, whilst intended as an information source, is an obvious resource for the retrieval of linguistic information, being the largest store of texts in existence, freely-available, covering a range of domains, and constantly added to and updated. Individual linguistic researchers have been trying to retrieve instances of rare or neologistic language use from the web by manipulating existing web search engines. Whilst this strategy is possible, in particular via Google, the output is rather haphazard and not linguist-friendly. The Research and Development Unit for English Studies has been seeking to remedy the situation through the creation of' 'WebCorp', a tool designed to search the Internet and provide on-line tailored access to linguists. A demonstration tool is available at

Related papers

Corpus Linguistics with Google?

Stefan Diemer

This paper aims at revisiting the issue whether it is possible to use commercial web search tools such as the Google interface for meaningful corpus research, given recent advances in search technology. It is argued that with the proper methodology these web tools can and should be used for corpus research, since they provide considerable advantages in comparison with both closed corpora and web-based linguistic search tools.

Log In

WebCorp: providing a renewable data source for corpus linguists

Sign up for access to the world's latest research

Abstract

Related papers