Manually Annotated Data

In [1,2,3] two annotated Dataset has been produced:

Italian Lexical Unit dataset

297 Lexical Unit has been manually translated for 11 Frames.

The involved Frames are:

  • BUILDINGS
  • CLOTHING
  • KILLING
  • KINSHIP
  • MAKE_NOISE
  • MEDICAL_CONDITIONS
  • NATURAL_FEATURES
  • POSSESSION
  • SELF_MOTION
  • TEXT
  • WEALTHINESS

The resource is available as a tab separated file in which each row correspond to a tuple <Frame, LU, POS>

E.g.:

Frame Lexical Unit POS
Buildingsalloggion
Buildingsaziendan
Buildingsbaraccan

Lexical Unit to Wordnet Sysnset mapping

This resource is derived trough the Paradigmatic Model[1] for four Frames:
  1. PEOPLE_BY_AGE
  2. KILLING
  3. STATEMENT
  4. CLOTHING.

306 Lexical Units has been validated and 786 pairs <LU , Synset> are available. The WordNet version is the 1.7.

This resource is available as Excel file in which for each Frame is reported:

  • Frame
  • LU
  • Pos
  • WordNet Synset Terms
  • WordNet Synset Gloss
  • Valid Flag (1 if the term is synset is a valid interpretation of the Lexical Unit, 0 otherwise)

E.g.:

Frame Lexical Unit Pos Synset Terms Gloss Valid Flag
People_by_age adolescent n adolescent stripling teenager a juvenile between the onset of puberty and maturity 1
People_by_age adult n adult any mature animal 0
People_by_age adult n adult grownup a fully developed person from maturity onward 1

To obtain these resources please fill the form

References

  1. Marco Pennacchiotti, Diego De Cao, Roberto Basili, Danilo Croce, Michael Roth. Automatic induction of FrameNet lexical units. In Proceedings of the Int. Conference on EMNLP, Hawaii, USA, October, 2008.
  2. Marco Pennacchiotti, Diego De Cao, Paolo Marocco, Roberto Basili. Towards a Vector Space Model for FrameNet-like Resources. In Proceedings of the LREC Conference 2008, May 2008, Marrakesh, Morocco.
  3. Diego De Cao, Danilo Croce, Marco Pennacchiotti, Roberto Basili. Combining word sense and usage for modeling frame semantics.In Proceedings of the Symposium On Semantics In Systems For Text Processing (STEP 08), September 22-24, 2008 - Venice, Italy
 
resources/rtv/annotated_data.txt · Last modified: 2014/11/07 09:55 (external edit)
 
ART - Artificial Intelligence @ Rome Tor Vergata
c/o Dept. of Computer Science, Systems and Production - University of Roma, Tor Vergata
Via del Politecnico 1, 00133 Roma