J. Monti, F. Dell'Orletta, F. Tamburini, Francesca Masini, M. Silvia Micheli, Andrea Zaninello, Sara Castagnoli, Malvina Nissim, Monti, J., Dell'Orletta, F., Tamburini, F., Masini, F, Micheli, M, Zaninello, A, Castagnoli, S, Nissim, M
The paper describes the creation of a manually validated dataset of Italian multiword expressions, building on candidates automatically extracted from corpora of written Italian. The main features of the resource, such as POS-pattern and lemma distribution, are also discussed, together with possible applications.