New features available in the ATAP Quotation Tool

Written by Monika Bednarek We have just updated the ATAP Quotation Tool (Jufri & Sun 2022), which is an Australian Text Analytics Platform tool that allows users to identify and extract quotes from English-language newspaper texts. In addition to extracting the quotes, the tool also provides information about who the speakers are, identifies the reporting…

Using constructed week sampling to compile a newspaper corpus

Written by Kelvin Lee This blog post introduces the Honi Soit corpus, a training dataset that we compiled using a variation of what is called constructed week sampling (as explained below). We hope that the description of this sampling method will be of interest to others who want to compile newspaper corpora. The Honi Soit…

2023: The year in review for the Sydney Corpus Lab

(written by Monika Bednarek) 2023 was a fairly busy year for the Sydney Corpus Lab, as we continued working on various projects, including our international collaboration with the Centre for Corpus Approaches to Social Science (CASS) on obesity in the news (see associated articles here and here), and our national collaboration on the ARDC-funded Australian…

Interview with Monika Bednarek

In 2023, the Sydney Corpus Lab featured edited extracts from 12 episodes of Dr Robbie Love’s CorpusCast podcast about corpus linguistics, covering the first year of the podcast (2022). In each blog post published from January to November, we presented the answers of leading corpus linguists to three questions. Specifically, all blog posts presented answers…

Interview with Ute Römer

In 2023, the Sydney Corpus Lab is pleased to be featuring edited extracts from Dr Robbie Love’s CorpusCast podcast about corpus linguistics. In each blog post published throughout the year, we present the answers of leading corpus linguists to three questions. Specifically, all blog posts present answers to the following two questions: Posts from episodes…

Interview with Michaela Mahlberg

In 2023, the Sydney Corpus Lab is pleased to be featuring edited extracts from Dr Robbie Love’s CorpusCast podcast about corpus linguistics. In each blog post published throughout the year, we present the answers of leading corpus linguists to three questions. Specifically, all blog posts present answers to the following two questions: Posts from episodes…

Exploring Newspaper Headlines with Corpus Linguistic Tools

written by Martin Luginbühl In June 2023, I had the opportunity to work for three weeks as a Visiting Scholar at the Sydney Corpus Lab. I first met Monika Bednarek some years ago when she was a Fellow at the Freiburg Institute for Advanced Studies, which is only a stone’s throw away from my University,…

Interview with Tony McEnery

In 2023, the Sydney Corpus Lab is pleased to be featuring edited extracts from Dr Robbie Love’s CorpusCast podcast about corpus linguistics. In each blog post published throughout the year, we present the answers of leading corpus linguists to three questions. Specifically, all blog posts present answers to the following two questions: Posts from episodes…

Interview with Mark McGlashan

In 2023, the Sydney Corpus Lab is pleased to be featuring edited extracts from Dr Robbie Love’s CorpusCast podcast about corpus linguistics. In each blog post published throughout the year, we present the answers of leading corpus linguists to three questions. Specifically, all blog posts present answers to the following two questions: Posts from episodes…

Interview with Stephanie Evert

In 2023, the Sydney Corpus Lab is pleased to be featuring edited extracts from Dr Robbie Love’s CorpusCast podcast about corpus linguistics. In each blog post published throughout the year, we present the answers of leading corpus linguists to three questions. Specifically, all blog posts present answers to the following two questions: Posts from episodes…