Author: Sean A Wallis Date: 08 August 2003 Book chapter
Our Research Catalogue contains grants and outputs data up to the end of April 2014. Records will no longer be updated after this date.
DEVELOPMENT OF AN EFFECTIVE GRAMMATICAL QUERY METHODOLOGY IN THE CONTEXT OF A PARSED CORPUS
In recent years, corpus linguistics has developed dramatically, due to increased computing power and improvements in annotation software. This has precipitated a growth in the scale and complexity of corpora, including the new grammatically annotated ICE-GB corpus. Text corpora have been used both to improve software tools, such as grammatical parsers, and to improve our understanding of language. The research is to develop a linguistically plausible and transparent method of forming queries for grammatical corpora. The proposal is to use fragments of grammatical trees as the main representation for queries. These fuzzy tree fragments appeal because of the obvious parallel with familiar grammatical structure. The difference is that a query must capture both what is known and what is unknown: some components and relations may be ommitted or fuzzy. Developing this notion of fuzziness is a major part of the research. Complex queries may then be constructed by combining these tree fragments with sociolinguistic variables using a logical language. This project will run concurrently with the first release of the ICE-GB corpus, and early prototype of the system will be provided at this point. Feedback from end users to aid further development.
- Outputs (9)
Author: G Nelson Date: 17 June 2002 Book
Author: Sean A Wallis Date: 23 October 2001 Journal article
Author: Sean A Wallis Date: 30 January 2001 Journal article
Creator: G Nelson Date: 07 November 2000 Software/multimedia package
Author: Bas Aarts Date: 07 November 2000 Journal article
Author: G Nelson Date: 07 November 2000 Conference paper/presentation
Author: G Nelson Date: 07 November 2000 Book chapter
Author: Sean A Wallis Date: 07 November 2000 Book chapter