Publication Date: 2024-11-04
The official launch of the LiRI Corpus Platform was held on November 1, 2024 at the first ever LCP Day. The event took place in a hybrid format, allowing those who could not attend in person to participate online - the recordings can be found here (many thanks to Clemens Lutz for producing the clean recordings):
Congratulations to the speakers of the day: Noah Bubenhofer (Introduction), Jeremy Zehr (presentation of catchphrase), Jonathan Schaber (presentation of soundscript) and Teodora Vukoviฤ (presentation of videoscope).
The Linguistic Corpus Platform (LCP) is a web-based tool for handling and analyzing linguistic data. It allows users to carry out various tasks on corpora, such as querying and performing analyses across multiple modalities. The LCP is designed to support a range of linguistic research needs, from corpus creation to complex analysis, offering both user-friendly interfaces and advanced query options for researchers. Users can query corpora directly from their browser and import their own corpora using a command-line interface.
The LCP includes three interfaces: Catchphrase (for text corpora), Soundscript (for spoken corpora) and Videoscope (for audiovisual/video corpora), which can all be accessed from the browser on the following website:
To learn more about the design features of the LiRI Corpus platform, read the following paper published in the CLARIN proceedings: The LiRI Corpus Platform
The LiRI Corpus Platform is currently available as a beta version. The LiRI team is thus looking for user feedback to improve and develop the platform further according to user experience and needs.
If you are interested in using the LCP, please fill in the following survey: