The city council refused the women a permit because they advocated violence.
But in this case the refers to the women
Corpuses suggest that in English corpora, 60-70% of pronouns occur in subject position,and of these, 70% have an antecedent insubject position. It's been called s
These features help us to quickly assign theantecedent correctly with high probability
It says that the more recently introducedentities are more likely antecedents.Hobbs (1978) reported that in his corpus,
90% of all pronoun antecedents werein the current sentence,
98% in the current or the previoussentence,Given (1992) found that
25% of definite antecedents were inthe current clause,
60% in the current or previous 20clauses but
40% were further apartSo it appears that we can just refer to the firstantecedent as the correct one with highprobability.There is a lot of evidence for a
a preference to refer to firstmentioned entities in a sentence combined;these results provide support for a searchstrategy like that proposed by Hobbs (1978
):going back one sentence at a time, then left-to-right.
The earliest, best-known syntax-basedalgorithm, often used as a baseline, t
raversesthe surface parse tree breadth-first, left-to-right, and then going backwards onesentence at a time
preference for first mentioned entities.
It goes up to the dominating NP and finds anantecedent in the left tree & if not found goesup the tree to an S or NP node, finding alongthe left of the path and then along the right of the path if the node is S node and continueabove till it hits an S node where uponcontinue the same with previous sentencesSteps 2, 3 ensure that no NP within the samebinding domain as a pronoun will be chosenas antecedent, in that step 3 requires anotherNP or S node to occur in between the topnode (node X) and any candidate: thus forexample [NP John] will not be chosen as acandidate antecedent of pronoun him in theexample
1: Begin at the NP node immediatelydominating the pronoun.
2: Go up the tree to the first NP or Snode encountered. Call this node X,and call the path used to reach it p.
3: Traverse all branches below node Xto the left of path p in a left-to-right,breadth first fashion. Propose as theantecedent any NP node that isencountered which has an NP or Snode between it and X.
4: if node X is the highest node in thesentence then
5: traverse the surface parse trees of previous sentences in the text inorder of recency, the most recentfirst; each tree is traversed in a left-to-right, breadth-first manner, andwhen an NP is encountered, it isproposed as antecedent
7: (X is not the highest node in thesentence) continue to step 9.
8: end if
9: From node X, go up the tree to thefirst NP or S node encountered. Callthis new node X, and call the pathtraversed to each it p.