The Parsed Corpus of
Middle English Poetry (PCMEP)


Welcome to the Homepage of the Parsed Corpus of Middle English Poetry.

The PCMEP is a fully parsed and annotated corpus of Middle English verse texts. It currently includes 46 Middle English poems with a total of 190751 parsed words.
Its main goal is to help close the substantial gap in English prose texts between c. 1250 and 1350 with available poetic records from the same period. However, in order to be able to assess the genre difference between prose and poetry, the corpus covers a slightly greater time span than that, namely c. 1150 to 1420 (Helsinki periods M1, M2, M3).
The PCMEP is parsed according to the same guidelines as its much larger sister corpus, the Penn-Parsed Corpus of Middle English, second edition (PPCME2). Thus, researchers familiar with the PPCME2 do not have to learn any new annotation schemes and can use their PPCME2 search queries without modification on the PCMEP text files as well.
If you have any questions about the PCMEP, please feel free to send me a message. My e-mail address is I am constructing the corpus as a part of my PhD thesis. (To learn more about my research, go to my research page.) The goal is to parse about 100,000 words in total. However, the PCMEP is not the primary focus of my work. Hence, it may take some time before this project is finished. The corpus will be released once the target size has been reached.
The PCMEP was funded by a Doc.Mobility grant from the Swiss National Science Foundation (P1GEP1_148611).

The header image shows three manuscripts containing Middle English poetry. From left to right,
  • the Vernon Manuscript (c. 1390), f. 105v,
  • Cambridge, Trinity College Manuscript R.3.2. (c. 1420), f. 1v
  • and the Auchinleck Manuscript (c. 1330), f. 167r.