Contact

Send us your feedback

Thank you for your feedback. An email has been sent to the ESRC support team.

An error occured whilst sending your feedback. Please review the problems below.

DEVELOPMENT OF AN EFFECTIVE GRAMMATICAL QUERY METHODOLOGY IN THE CONTEXT OF A PARSED CORPUS

  • Start date: 01 March 1998
  • End date: 31 January 1999

In recent years, corpus linguistics has developed dramatically, due to increased computing power and improvements in annotation software. This has precipitated a growth in the scale and complexity of corpora, including the new grammatically annotated ICE-GB corpus. Text corpora have been used both to improve software tools, such as grammatical parsers, and to improve our understanding of language. The research is to develop a linguistically plausible and transparent method of forming queries for grammatical corpora. The proposal is to use fragments of grammatical trees as the main representation for queries. These fuzzy tree fragments appeal because of the obvious parallel with familiar grammatical structure. The difference is that a query must capture both what is known and what is unknown: some components and relations may be ommitted or fuzzy. Developing this notion of fuzziness is a major part of the research. Complex queries may then be constructed by combining these tree fragments with sociolinguistic variables using a logical language. This project will run concurrently with the first release of the ICE-GB corpus, and early prototype of the system will be provided at this point. Feedback from end users to aid further development.

  • Outputs (9)