Part of Speech Tags for Common Collocations and Phrases
Collocations are specific arrangements of words or words habitually
found together. They tend to function as a semantic unit (together
they have a meaning in the same way a single word has a meaning). The
tagger often guesses incorrectly for words in collocation because it
doesn't recognize the context; for example, it might guess 'due/JJ to
recent advances' for 'due/IN to recent advances' because due is
usually a JJ.
This list was compiled by Jo Wright from Santorini's tagging guide
and our biomedical addenda but it also contains new entries. Please
note that most of the Santorini entries can also be found in the
addendum.
- a lot
- a/DT lot/NN (addendum,
Santorini 26)
- a priori
- a/FW priori/FW (addendum)
- all but
- all/RB but/RB (Santorini 22)
- all right
- all/RB right/JJ (Santorini 22)
- another one
- another/DT one/NN (Santorini 22)
- apart from
- apart/RB from/IN
- as well as
- as/RB well/RB as/IN (Santorini 23)
- at all
- at/IN all/DT (Santorini 23)
- because of
- because/IN of/IN
- close to
- close/RB to/TO (Santorini 24)
- closer to
- closer/RBR to/TO (Santorini 24)
- downstream of
- downstream/IN of/IN
Please note that downstream is only /IN when in collocation with of ; in all other cases it should be /RB. See Santorini 9, 'IN or RB.'
- due to
- due/IN to/TO (addendum, Santorini 24)
- each other
- each/DT other/JJ (Santorini 24)
- further from
- further/RBR from/IN (Santorini 25)
Please note that further is often not comparative,
unlike farther:
- For further/JJ details see...
- Further/RB , we found that...
See Santorini
12.
- in common
- in/IN common/NN
- in general
- in/IN general/NN
- in order that
- in/IN order/NN that/IN
- in particular
- in/IN particular/NN
- in respect to *
- in/IN respect/NN to/TO
- in situ
- in/FW situ/FW
- in spite of
- in/IN spite/NN of/IN
- in vivo
- in/FW vivo/FW
- in vitro
- in/FW vitro/FW (addendum)
- insofar as
- insofar/IN as/IN
- insomuch as, inasmuch as
- insomuch/IN as/IN, inasmuch/IN as/IN
Also as single word: inasmuchas/IN
- near to
- near/RB to/TO (Santorini 27)
- nearer to
- nearer/RBR to/TO (Santorini 27)
*
- no one
- no/DT one/NN (the
addendum, Santorini 28)
- on par with
- on/IN par/NN with/IN
(Merriam Webster Online "par")
- par for the course
- par/JJ for/IN the/DT course/NN
(Merriam Webster Online "par")
- rather than
- rather/IN than/IN
(addendum,
Santorini 29)
- such as
- such/JJ as/IN (Santorini 30)
- together with
- together/RB with/IN
- up to
- up/IN to/TO
up can be /RB or /RP but it is /IN in collocation with
to, as seen in such phrases as
- up/IN to/TO thirty Altairian dollars a day
- up/IN to/TO five minutes
[Santorini's example, 'look up to someone' (which will probably not
come up in a biomedical context), has an entirely different meaning
but the same tagging, "up/IN to/TO".]
(Santorini 9,
Merriam Webster Online "up to")
- upstream of
- upstream/IN of/IN
Please note that upstream is only /IN when in collocation with
of ; in all other cases it should be /RB. See Santorini 9, 'IN or RB.'
- with respect to
- with/IN respect/NN to/TO
CHANGE NOTES
(Generally excluding additions dated with change tags.)
2004-09-20: "in respect to",
"IN IN TO" ==> "IN NN TO" [Jo Wright says it was a typo.]
2004-09-21:
"nearer/RB to" ==> "nearer/RBR to"
2004-09-21