Projects

This page lists some of the projects that our lab members are/were involved in.

Language Data Commons of Australia (LDaCA)

The LDaCA project (led by the University of Queensland) makes nationally significant language data available for academic and non-academic use and provides a model for ensuring continued access with appropriate community control. It also connects these data to an improved analysis infrastructure for text analytics. This project received investment (https://doi.org/10.47486/HIR001) from the Australian Research Data Commons (ARDC). The ARDC is funded by the National Collaborative Research Infrastructure Strategy (NCRIS). Further information about the lab’s involvement in this project is available here.

Australian Text Analytics Platform (ATAP)

The Sydney Corpus Lab is collaborating with the University of Queensland (project lead), Sydney Informatics Hub, and AArNet on the Australian Text Analytics Platform (ATAP). ATAP received investment (https://doi.org/10.47486/PL074) from the Australian Research Data Commons (ARDC). The ARDC is funded by the National Collaborative Research Infrastructure Strategy (NCRIS).

Health in the Media (Monika Bednarek)

We use corpus linguistics and discourse analysis to better understand how journalists write about health, including diabetes, obesity, disability, and other health-related conditions or issues. This research project is associated with the University’s Charles Perkins Centre. For more information go to Health in the Media. As part of this project, we designed and built the Diabetes News Corpus (see corpus Guide). The project on diabetes has now been completed (see summary). We have also completed a study on disability in Australian newspapers. We are now working on Australian news coverage of obesity as part of an international collaboration, and have built a new 16-million-word corpus of Australian news coverage of obesity, supported by the Sydney Informatics Hub. Information about this corpus can be found in the corpus manual. Project publications include ‘Weight stigma: Towards a language-informed analytical framework‘ and ‘Trialling corpus search techniques for identifying person-first and identity-first language‘, with others in progress.

Constructing Teacher Identities: Representations of Teachers in the Print Media (Nicole Mockler)

This project, funded by a University of Sydney Research Accelerator (SOAR) prize, used innovative research methods (including corpus-assisted discourse analysis) to map print media representations of teachers in Australia (with some comparison to other Anglophone countries) over the past two decades. This research is the focus of a research monograph, published by Bloomsbury in 2022.

News Reporting of Conflict (Alex Garcia)

Motivated by the obvious misconception of the Colombian conflict among her undergraduate students, Alexandra Garcia’s PhD thesis investigated the representation of the conflict in the press. She has blogged about the conflict and her research at https://laperorata.wordpress.com/. Work-in-progress focusses on corpus linguistic analysis of transgender people in the Australian press.

Aboriginal English(es) in Australian film and television

A project investigating the representation of Aboriginal English(es) and other Indigenous languages in the media – project description available here.

Other Projects

Some of our other past and current projects – including by students – are described in our series of blog posts.