English   Italiano

Publications

Abel, A. / Anstein, S. (2011): "Korpus Südtirol – Varietätenlinguistische Untersuchungen." In: Abel, A. / Zanin, R. (eds.): Korpora in Lehre und Forschung. Bozen-Bolzano: University Press. pp. 29-54. [book as pdf]

Baroni, M. (2010): "Corpora di italiano." In: Simone, R. (ed.): Enciclopedia dell'italiano, vol. 1. Roma: Istituto della Enciclopedia Italiana. pp. 300-303.

Baroni, M. / Bernardini, S. (to appear): "Corpus query tools for lexicography." In: Heid, U. (ed.): Lexicography: An International Handbook. Berlin: Mouton de Gruyter.

Borghetti, C. / Castagnoli, S. / Brunello, M. (2011): "I testi del web: una proposta di classificazione sulla base del corpus PAISÀ." In: Cerruti, M. / Corino, E. / Onesti, C. (eds.): Scritto e parlato, formale e informale: La comunicazione mediata dalla rete., Roma: Carocci, pp. 147-170.

Bosco, C. / Montemagni, S. / Mazzei, A. / Lombardo, V. / Dell'Orletta, F. / Lenci, A. (2009): "Evalita'09 Parsing Task: comparing dependency parsers and treebanks". In: Proceedings of Evalita'09 - Evaluation of NLP and Speech Tools for Italian. Reggio Emilia, Italy, December 2009. [paper as pdf]

Bosco, C. / Montemagni, S. / Mazzei, A. / Lombardo, V. / Dell'Orletta, F. / Lenci, A. / Lesmo, L., Attardi, G. / Simi, M. / Lavelli, A. / Hall, J. / Nilsson, J. / Nivre, J. (2010): "Comparing the Influence of Different Treebank Annotations on Dependency Parsing." In: LREC 2010: 7th International Conference on Language Resources and Evaluation. Valletta, Malta, 17-23 May 2010. [paper as pdf]

Culy, C. / Lyding, V. (2010): "Visualizations for exploratory corpus and text analysis." In: Proceedings of the 2nd International Conference on Corpus Linguistics (CILC-10), A Coruña, Spain, 13-15 May 2010. pp. 257-268. [prefinal paper as pdf]

Culy, C. / Lyding, V. (2011): "Corpus Clouds - Facilitating Text Analysis by Means of Visualizations." In: Vetulani, Z. (ed.): Human Language Technology: Challenges for Computer Science and Linguistics. Berlin: Springer. pp. 351-360. [link to article]

Culy, C. / Lyding, V. / Dittmann, H. (2011a): "Structured Parallel Coordinates: a visualization for analyzing structured language data." In: Proceedings of the 3rd International Conference on Corpus Linguistics (CILC-11), Valencia, Spain, 6-9 April 2011. pp. 525-533. [prefinal paper as pdf]

Culy, C. / Lyding, V. / Dittmann, H. (2011b): "xLDD: Extended Linguistic Dependency Diagrams." In: Proceedings of the 15th International Conference on Information Visualisation (IV2011), London, United Kingdom, 13-15 July 2011. pp. 164-169. [link to article]

Culy, C. / Lyding, V. / Dittmann, H. (2011c): "Visualizing Dependency Structures." In: Proceedings of the Conference of the German Society for Computational Linguistics and Language Technology (GSCL) 2011, Hamburg, Germany, 28-30 September 2011. pp. 81-86. [prefinal paper as pdf]

Dell'Orletta, F. (2009): "Ensemble system for Part-of-Speech tagging." In: Proceedings of Evalita'09 - Evaluation of NLP and Speech Tools for Italian, Reggio Emilia, Italy, December 2009. [paper as pdf]

Dell'Orletta, F. / Marchi, S. / Montemagni, S. / Venturi, G. / Agnoloni, T. / Francesconi, E. (2012): "Domain Adaptation for Dependency Parsing at Evalita 2011." In: Proceedings of Evalita'11 - Evaluation of NLP and Speech Tools for Italian, Rome, Italy, 24-25 January 2012. [paper as pdf]

Dell'Orletta, F. / Venturi, G. / Montemagni, S. (2011): "ULISSE: an Unsupervised Algorithm for Detecting Reliable Dependency Parses." In: Proceedings of the 15th Conference on Computational Natural Language Learning (CoNLL-2011), Portland, Oregon, USA, 23-24 June 2011. [paper as pdf]

Dittmann, H. / Ďurčo, M. / Geyken, A. / Roth, T. / Zimmer, K. (2012): "Korpus C4 – a distributed corpus of German varieties." In: Schmidt, T. / Wörner, K. (eds.): Multilingual Corpora and Multilingual Corpus Analysis. Amsterdam: John Benjamins. pp. 339–346. [link to article]

Lyding, V. / Lapshinova-Koltunski, E. / Degaetano-Ortlieb, S. / Dittmann, H. / Culy, C. (2012): "Visualising Linguistic Evolution in Academic Discourse." In: Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH, Avignon, France, 23-24 April 2012. pp. 44-48. [link to article]

Lyding, V. / Stemle, E. / Borghetti, C. / Brunello, M. / Castagnoli, S. / Dell'Orletta, F. / Dittmann, H. / Lenci, A. / Pirrelli, V. (2014): "The PAISÀ Corpus of Italian Web Texts" In: Proceedings of the 9th Web as Corpus Workshop (WaC-9), Association for Computational Linguistics, Gothenburg, Sweden, April 2014. pp. 36-43. [link to article]


Presentations

Baroni, M. (2010): "The Paisà project", presented at Human Language Technologies for Italian - 2010, Workshop of the Natural Language Processing Group of the AI*IA in conjunction with the 11th AI*IA Symposium on Artificial Intelligence, Brescia, Italy, 1 December 2010. [slides as PDF]

Baroni, M. (2010): "Web 2.0 as corpus: One decade of textual analysis with Web data", invited keynote talk at JADT 2010, Sapienza University, Rome, Italy. [slides as PDF]

Borghetti, C. / Castagnoli, S. / Brunello, M. (2010): "I generi del web tra tradizione e innovazione: un'analisi linguistica sulla base del corpus PAISÀ", invited presentation at the conference Scritto e parlato, formale e informale: La comunicazione mediata dalla rete, Università di Torino, Torino, Italy, 29-30 October 2010. [slides as PDF]

Brunello, M. (2011): "PAISÀ - A Creative Commons corpus", presented at NLP group meeting, University of Leeds, Leeds, United Kingdom, 20 January 2011. [slides as PDF]