National Corpus of Irish

National Corpus of Irish

An initiative to develop a large national corpus of contemporary Irish encompassing both written and spoken sources. Various subcorpora and related specialised corpora will also be hosted here in what will become a hub for corpus-based research on the Irish language.

The Gaois research group, Fiontar & Scoil na Gaeilge, DCU will begin compiling the National Corpus of Irish in January, 2022. The project is being funded by the Department of Tourism, Culture, Arts, Gaeltacht, Sport and Media for the period 2022-2024 with support from the National Lottery.

Written and spoken data will be compiled during the project, and the contents of the corpora will be presented in a way that facilitates use by researchers and members of the public. The corpus contents will be accessible on this website, and users will be able to search the corpora and download data generated during the project. Comprehensive documentation pertaining to the data will also be available.

The following are the projected sizes of the corpora: Major Corpus: c.155 million words; the Corpus of Written Irish: c.100 million words; the Corpus of Spoken Irish: c.6.5 million words; the Monitor Corpus of Irish: one million words per annum from the year 2000 on.

Word frequency and n-gram lists will be published at regular intervals, and language models will be published here and on other appropriate platforms during the course of the project.

Members of the Advisory Committee:

  • Dr Anne O’Keefe (Mary Immaculate Collegel)
  • Dr Katie Ní Loingsigh (NUI Galway)
  • Eoin McEvoy (UCD)
  • Aoife Ní Ghloinn (Maynooth University)
  • Dr Neasa Ní Chiaráin (TCD)
  • Professor Ailbhe Ní Chasaide (TCD)
  • Dr Charles Dillon (Foclóir Stairiúil na Gaeilge, Royal Irish Academy)
  • Professor Kevin Scannell (Saint Louis University)
  • Dr Elaine Uí Dhonnchadha (TCD)
  • Cormac Breathnach (Foras na Gaeilge)
  • Dr Pádraig Ó Mianáin (Foras na Gaeilge)
  • Dr Jennifer Foster (ADAPT & DCU)
  • Donncha Ó Cróinín (Foras na Gaeilge)
  • Colm Ó Ciardubháin (Fiontar & Scoil na Gaeilge, DCU)

Contact

Dr Úna Bhreathnach

Gaois, Fiontar & Scoil na Gaeilge, DCU