. 2011 Sep-Oct;18(5):660-7.

doi: 10.1136/amiajnl-2010-000055. Epub 2011 May 25.

Recommending MeSH terms for annotating biomedical articles

Minlie Huang¹, Aurélie Névéol, Zhiyong Lu

Affiliations

Affiliation

¹ State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, PR China.

PMID: 21613640
PMCID: PMC3168302
DOI: 10.1136/amiajnl-2010-000055

Recommending MeSH terms for annotating biomedical articles

Minlie Huang et al. J Am Med Inform Assoc. 2011 Sep-Oct.

. 2011 Sep-Oct;18(5):660-7.

doi: 10.1136/amiajnl-2010-000055. Epub 2011 May 25.

Authors

Minlie Huang¹, Aurélie Névéol, Zhiyong Lu

Affiliation

¹ State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, PR China.

PMID: 21613640
PMCID: PMC3168302
DOI: 10.1136/amiajnl-2010-000055

Abstract

Background: Due to the high cost of manual curation of key aspects from the scientific literature, automated methods for assisting this process are greatly desired. Here, we report a novel approach to facilitate MeSH indexing, a challenging task of assigning MeSH terms to MEDLINE citations for their archiving and retrieval.

Methods: Unlike previous methods for automatic MeSH term assignment, we reformulate the indexing task as a ranking problem such that relevant MeSH headings are ranked higher than those irrelevant ones. Specifically, for each document we retrieve 20 neighbor documents, obtain a list of MeSH main headings from neighbors, and rank the MeSH main headings using ListNet-a learning-to-rank algorithm. We trained our algorithm on 200 documents and tested on a previously used benchmark set of 200 documents and a larger dataset of 1000 documents.

Results: Tested on the benchmark dataset, our method achieved a precision of 0.390, recall of 0.712, and mean average precision (MAP) of 0.626. In comparison to the state of the art, we observe statistically significant improvements as large as 39% in MAP (p-value <0.001). Similar significant improvements were also obtained on the larger document set.

Conclusion: Experimental results show that our approach makes the most accurate MeSH predictions to date, which suggests its great potential in making a practical impact on MeSH indexing. Furthermore, as discussed the proposed learning framework is robust and can be adapted to many other similar tasks beyond MeSH indexing in the biomedical domain. All data sets are available at: http://www.ncbi.nlm.nih.gov/CBBresearch/Lu/indexing.

PubMed Disclaimer

Conflict of interest statement

Competing interests: None.

Figures

**Figure 1**
An overview of our approach. MH, main heading.

**Figure 2**
Sample MeSH terms assigned to a MEDLINE article. The terms inside the blue box are main headings, and those outside the blue box are subheadings.

**Figure 3**
The ranking performance (y-axis) varies with different number of neighbor documents (x-axis). MAP, mean average precision.

See this image and copyright information in PMC

References

1. Baumgartner WA, Cohen KB, Fox LM, et al. Manual curation is not sufficient for annotation of genomic databases. Bioinformatics 2007;23:i41–8 - PMC - PubMed
1. Kim W, Aronson AR, Wilbur WJ. Automatic MeSH term assignment and quality assessment. Proc AMIA Symp 2001:319–23 - PMC - PubMed
1. Trieschnigg D, Pezik P, Lee V, et al. MeSH Up: effective MeSH text classification for improved document retrieval. Bioinformatics 2009;25:1412–18 - PMC - PubMed
1. Zhu S, Zeng J, Mamitsuka H. Enhancing MEDLINE document clustering by incorporating MeSH semantic similarity. Bioinformatics 2009;25:1944–51 - PubMed
1. Djebbari A, Karamycheva S, Howe E, et al. MeSHer: identifying biological concepts in microarray assays based on PubMed references and MeSH terms. Bioinformatics 2005;21:3324–6 - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

ZIA LM091711/ImNIH/Intramural NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Recommending MeSH terms for annotating biomedical articles

Affiliation

Recommending MeSH terms for annotating biomedical articles

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Miscellaneous