I have previously discussed the limitations of using the count of forward citations and some other parameters to predict patent quality. But some new evidence is suggesting that these limitations are stronger than we first thought.
Many analysts in the patent landscaping field use forward citation count, either alone or in combination with othe fields such as the size of the patent family, number of claims etc, to predict patent quality. But can forward citation count be trusted?
The argument for forward citation count as an important is that a higher number of forward citations suggest that the patent being analysed is important, because
- applicants (or even the applicant for the patent being analysed) are also filing patents in the same area,
- the applicant or examiner thought that the patent being reviewed is similar to these later patents.
In fact, these are arguments for citation analysis in general, which is at the heart of Network Patent Analysis. But at Ambercite, we believe that you need to consider all of the citations in the patent network to understand the importance of individual patents, and not just the direct citations. By only relying on forward citations (or even only forward and backward citations) you may be missing out on a lot of valuable information in the rest of the network.
The evidence for this is coming from the area of Alzheimer's treatments. We have previously reported on a white paper we have published on the patents in this area. 48,000 patents in the area of Alzheimer's treatment were whittled down to a final dataset of 2153 patents, which formed into 23 clusters. The very highest rated patent of all was US7189819, which protects the pahse III trial Alzheimer's treatment bapineuzumab, being marketed by Pfizer . In 14th position was US7195761, which protects the Eli Lilly phase III trial drug solanezumab. What we have not reported to date is that another promising alzheimers drug, which is now in phase II trials, ended up with its main patent in around the 200th position (but for commercial reasons we are not able to say any more than this about this drug or patent).
When we wrote this white paper, we decided that one of the internal tests of whether NPA worked was whether it was able to rank highly the drugs being tested in phase I, II or III Alzheimer's trials. So these results were pleasing. But would we have got the same results if we had simply counted forward citations? Table 1 below helps to answer this. And in this table, I have also added data for these three patents for backward citation count, total citation count, and the number of members in the INPADOC patent family, which is also used by some analysts as predictor of patent quality. I have also 'normalised' forward citation count by the how long the patent application has been published for, in order to determine the number of forward citations per year, which is thought to be an important parameter by some.
Table 1: NPA and other rankings for known Alzheimer treatment candidate drugs
| Drug | Patent protecting drug | NPA ranking | Count | Ranking based on number of (48,000 patents) | ||||||||
| Forward citations | Forward citations per year | Backward citations | Total citations | INPADOC family members | Forward citations | Forward citations per year | Backward citations | Total citations | INPADOC family members | |||
| Bapineuzumab | US7189819 | 1 | 14 | 1.6 | 304 | 318 | 395 | ~5100 | 4942 | 93 | 95 | 308 |
| Solanezumab | US7195761 | 14 | 15 | 1.9 | 35 | 50 | 52 | ~4800 | 3949 | ~2750 | ~3170 | ~4400 |
| Phase II trial Alzheimer's drug | US7xxxxxx | ~around 200th | 18 | 0 | 0 | 18 | ~30 | ~37,000 | ~21,400 | ~4750 | ~9830 | ~12,000 |
As you can see, relying on forward or backward citation count, or the number of family members, would have given a very different ranking of these obviously important patents. The importance of these patents would have been missed. However the special algorihms used by NPA was able to rank these Alzheimer's drug very highly due to where they sat within the total patent network.
But what would you have ended up with if you had used these more conventional measures? I have reversed the question, and in Table 2 below asked which patents would have come up top in using the above measures:
Table 2: Highest ranked patents in Alzheimer's treatment data set based on non-NPA ranking measures
| Criteria | Patent (value) | Title | Applicant | NPA patent ranking |
| Highest forward citation count |
US5223409 (892 forward citations)
|
Directed evolution of novel binding proteins
|
Dyax Corporation | 894 |
| 2nd highest forward citation count |
US5580859 (465 forward citations)
|
Delivery of exogenous DNA sequences in a mammal |
Vical Incorporated/Wisconsin Alumni Research Foundation |
907 |
| 3rd highest forward citaton count |
US5399346 (451 forward citations) |
Gene therapy
|
US Dept of health and human services |
Not in leading 2153 patents |
|
Highest rate of forward citations
|
WO2001075067 (50 forward citations per year) | Novel nucleic acids and peptides | Hyseq, Inc. | 753 |
|
Highest backward citation count
|
US7678808 (900 backward citations) |
5 HT receptor mediated neurogenesis |
Braincells, Inc | 848 |
| Highest total citation count |
US5223409 (926 total citations)
|
Directed evolution of novel binding proteins |
Dyax Corporation | 894 |
| Highest family member count |
WO1999046281 (2115 INPADOC family members)
|
Novel Polypeptides and nucleic acids encoding the same | Genentech, Inc | 309 |
This of course is a very different set of patents to what NPA showed was important for Alzheimer's treatments.
Which raises the questions:
- if you are relying on metrics that rely on forward citation count to identify high value patents, what key patents are you (or your clients) missing?
- What impact could this missing information make on your business decisions?
- What business risks are you taking due to this missing information?
Contact us for further information about how we provide an unique NPA patent quality ranking for your area of business.




