Extensively cited research output featuring more than 250 publications on various topics of Natural Language Processing. Research output highlighted as internationally leading in UK Research Evaluations. Author of the monograph Anaphora resolution (Longman) and sole Editor of The Oxford Handbook of Computational Linguistics (Oxford University Press). Executive Editor of the Journal of Natural Language Engineering (Cambridge University Press) and Editor-in-Chief of the Natural Language Processing book series of John Benjamins publishers. Programme Chair and keynote speaker of major international conferences. Director of the Research Institute in Information and Language Processing (University of Wolverhampton). Recipient of many professional distinctions and awarded the title of Doctor Honoris Causa at Plovdiv University in 2011 and Professor Honoris Causa at Veliko Tarnovo University in recognition of his outstanding scholarly/research achievements.
What matters more – the quality of the corpora or their size? The case of automatic translation of multiword expressions using comparable corpora
This study investigates (and compares) the impact of the size and the similarity/quality of comparable corpora on the specific task of extracting translation equivalents of verb-noun collocations from such corpora. The comprehensive evaluation of different configurations of English and Spanish corpora sheds some light on the more general and perennial question: what matters more – the quantity or quality?