FrameNet

FrameNet
Mission statementBuilding a lexical database based on a theory of meaning called Frame Semantics.
Commercial?No (freely available for download)
Type of projectLexical database (containing: frames, frame elements(FE), lexical units (LU), examples sentences, and frame relations)
LocationInternational Computer Science Institute in Berkeley, California
OwnerCollin Baker (current project manager)
FounderCharles J. Fillmore
Established1997; 25 years ago (1997)
Websiteframenet.icsi.berkeley.edu

FrameNet is a research and resource development project based at the International Computer Science Institute (ICSI) in Berkeley, California, which has produced an electronic resource based on a theory of meaning called frame semantics. The data that FrameNet has analyzed show that the sentence "John sold a car to Mary" essentially describes the same basic situation (semantic frame) as "Mary bought a car from John", just from a different perspective. A semantic frame is a conceptual structure describing an event, relation, or object along with its participants. The FrameNet lexical database contains over 1,200 semantic frames, 13,000 lexical units (a pairing of a word with a meaning; polysemous words are represented by several lexical units) and 202,000 example sentences. Charles J. Fillmore, who developed the theory of frame semantics which serves as the theoretical the basis of FrameNet, founded the project in 1997 and continued to lead the effort until he died in 2014. Frame Semantic theory and FrameNet have been influential in linguistics and natural language processing, where it led to the task of automatic Semantic Role Labeling.

Concepts

Frames

A frame is a schematic representation of a situation involving various participants, props, and other conceptual roles. Examples of frame names are Being_born and Locative_relation. A frame in FrameNet contains a textual description of what it represents (a frame definition), associated frame elements, lexical units, example sentences, and frame-to-frame relations.

Frame elements

Frame elements (FE) provide additional information to the semantic structure of a sentence. Each frame has a number of core and non-core FEs which can be thought of as semantic roles. Core FEs are essential to the meaning of the frame while non-core FEs are generally descriptive (such as time, place, manner, etc.).[1]

Some examples include:

  • The only core FE of the Being_born frame is called Child; non-core FEs being Time, Place, Relatives, etc.[2]
  • Core FEs of the Commerce_goods-transfer include the Seller, Buyer, Goods, among other things, while non-core FEs include a Place, Purpose, etc.[3]

FrameNet includes shallow data on syntactic roles that frame elements play in the example sentences. For example, for a sentence like "She was born about AD 460", FrameNet would mark "She" as a noun phrase referring to the Child FE, and "about AD 460" as a noun phrase corresponding to the Time frame element. Details of how frame elements can be realized in a sentence are important because this reveals important information about the subcategorization frames as well as possible diathesis alternations (e.g. "John broke the window" vs. "The window broke") of a verb.

Lexical units

Lexical units (LU) are lemmas, with their part of speech, that evoke a specific frame. In other words, when an LU is identified in a sentence, that specific LU can be associated with its specific frame(s). For each frame, there may be many LUs associated to that frame, and also there may be many frames that share a specific LU, this is typically the case with LUs that have multiple word senses.[4] Alongside the frame, each lexical unit is associated with specific frame elements by means of the annotated example sentences.

Example:

Lexical units that evoke the Complaining frame (or more specific perspectivized versions of it, to be precise), include the verbs "complain", "grouse", "lament", and others.[5]

Example sentences

Frames are associated with example sentences and frame elements are marked within the sentences. Thus, the sentence

She was born about AD 460

is associated with the frame Being_born, while "She" is marked as the frame element Child and "about AD 460" is marked as Time. (See the FrameNet Annotation Report for born.v.) From the start, the FrameNet project has been committed to looking at evidence from actual language use as found in text collections like the British National Corpus. Based on such example sentences, automatic semantic role labeling tools are able to determine frames and mark frame elements in new sentences.

Valences

FrameNet also exposes the statistics on the valences of the frames, that is the number and the position of the frame elements within example sentences. The sentence

She was born about AD 460

falls in the valence pattern

NP Ext, INI --, NP Dep

which occurs two times in the example sentences in FrameNet, namely in:

She was born about AD 460, daughter and granddaughter of Roman and Byzantine emperors, whose family had been prominent in Roman politics for over 700 years.
He was soon posted to north Africa, and never met their only child, a daughter born 8 June 1941.

Frame relations

FrameNet additionally captures relationships between different frames using relations. These include the following:

  • Inheritance: When one frame is a more specific version of another, more abstract parent frame. Anything that is true about the parent frame must also be true about the child frame, and a mapping is specified between the frame elements of the parent and the frame elements of the child.
  • Perspectivized_in: A neutral frame (like Commerce_transfer-goods) is connected to a frame with a specific perspective of the same scenario (e.g. the Commerce_sell frame, which assumes the perspective of the seller or the Commerce_buy frame, which assumes the perspective of the buyer)
  • Subframe: Some frames like the Criminal_process frame refer to complex scenarios that consist of several individual states or events that can be described by separate frames like Arrest, Trial, and so on.
  • Precedes: The Precedes relation captures a temporal order that holds between subframes of a complex scenario.
  • Causative_of and Inchoative_of: There is a fairly systematic relationship between stative descriptions (like Position_on_a_scale frame, e.g. "She had a high salary") and causative descriptions (like Cause_change_of_scalar_position frame, e.g. "She raised his salary") or inchoative descriptions (like Change_position_on_a_scale frame, e.g. "Her salary increased").
  • Using: A relationship that holds between a frame that in some way involves another frame. For instance, the Judgment_communication frame uses both the Judgment frame and the Statement frame, but does not inherit from either of them because there is no clear correspondence of the frame elements.
  • See_also: Connects frames that bear some resemblance but need to be distinguished carefully.

Applications

FrameNet has proven to be useful in a number of computational applications, because computers need additional knowledge in order to recognize that "John sold a car to Mary" and "Mary bought a car from John" describe essentially the same situation, despite using two quite different verbs, different prepositions and a different word order. FrameNet has been used in applications like question answering, paraphrasing, recognizing textual entailment, and information extraction, either directly or by means of Semantic Role Labeling tools. The first automatic system for Semantic Role Labeling (SRL, sometimes also referred to as "shallow semantic parsing") was developed by Daniel Gildea and Daniel Jurafsky based on FrameNet in 2002.[6] Semantic Role Labeling has since become one of the standard tasks in natural language processing, with the latest version (1.7) of FrameNet now fully supported in the Natural Language Toolkit.[7]

Since frames are essentially semantic descriptions, they are similar across languages, and several projects have arisen over the years that have relied on the original FrameNet as the basis for additional non-English FrameNets, for Spanish, Japanese, German, and Polish, among others.

See also

References

  1. ^ "Glossary | fndrupal".
  2. ^ "FrameNet Data | fndrupal".
  3. ^ "FrameNet Data | fndrupal".
  4. ^ "Glossary | fndrupal".
  5. ^ https://framenet2.icsi.berkeley.edu/fnReports/data/frameIndex.xml?frame=Complaining
  6. ^ Gildea, Daniel; Jurafsky, Daniel (2002). "Automatic Labeling of Semantic Roles" (PDF). Computational Linguistics. 28 (3): 245–288. doi:10.1162/089120102760275983. S2CID 207747200.
  7. ^ Schneider, Nathan; Wooters, Chuck (2017). "The NLTK FrameNet API: Designing for Discoverability with a Rich Linguistic Resource". EMNLP 2017: Conference on Empirical Methods in Natural Language Processing. arXiv:1703.07438. Bibcode:2017arXiv170307438S.

Further reading

  • Ruppenhofer, Josef; Ellsworth, Michael; Petruck, Miriam R. L.; Johnson, Christopher R.; Baker, Collin F.; Scheffczyk, Jan (November 1, 2016). FrameNet II: Extended Theory and Practice (revised ed.). Berkeley, CA: International Computer Science Institute.

External links

  • FrameNet home page
  • Chinese FrameNet
  • Danish FrameNet
  • German FrameNet
  • Japanese FrameNet
  • Korean FrameNet
  • Polish FrameNet
  • Portuguese FrameNet (Brazil)
  • Spanish FrameNet
  • Swedish FrameNet
  • v
  • t
  • e
General termsText analysis
Text segmentation
Automatic summarizationMachine translationDistributional semantics modelsLanguage resources,
datasets and corpora
Types and
standards
Data
Automatic identification
and data captureTopic modelComputer-assisted
reviewingNatural language
user interfaceOther software