INFORMATION AND RESOURCES

MetaMapped MEDLINE Baseline Results

MetaMap is a highly configurable program that maps biomedical text to concepts in the UMLS® Metathesaurus®. MetaMap also forms the core of the Indexing Initiative Project suite of programs, and is the basis for one of the indexing methods in the NLM Indexing Initiative's Medical Text Indexer (MTI). For more information on MetaMap and the Indexing Initiative Project, please see the Indexing Initiative Homepage and MetaMap Homepage web pages.

If you would prefer Human Readable or XML formatting for your results, we do have a program called MM_Print which is available for download so you can convert the MMO results into the format you want. The download is available from the following web page: https://metamap.nlm.nih.gov/MMPrint.shtml


We recently completed processing the entire 2019 Medline®/PubMed® Baseline through the MetaMap program generating MetaMap Machine Output (MMO) formatted results for each of the citations. The results are now available via the following link. You can download the 972 individual result files.

           2019 MetaMapped Medline Baseline Results

An easy way to download all of the files is to use wget -
   wget -r -np --span-hosts https://lhncbc.nlm.nih.gov/ii/information/MBR/Download/MetaMapped_Medline/2019/MMO/
   wget -r -np --span-hosts https://lhncbc.nlm.nih.gov/ii/information/MBR/Download/MetaMapped_Medline/2019/MEDLINE/


Between March 25, 2015 and April 5, 2015 we processed all 779 files with 23,343,329 citations from the 2015 Medline®/PubMed® Baseline through the MetaMap program generating MetaMap Machine Output (MMO) formatted results for each of the citations. The results are now available via the following link. You have the option of downloading the 779 individual result files, or there is a single tar file called 2015.tar which contains all of the result files.

           2015 MetaMapped Medline Baseline Results

Please note that the large size of these results requires care when downloading. The MMO results have a compressed size of 132GB.

Details:

Data Used 2015 Medline/PubMed Baseline
Data Characteristics * Created November 24 2014
* Consists of 779 files of various counts
* Total of 23,343,329 citations
Command Used * metamap.FML -Z 2014AB -qE
Please Note:
Composite Phrases (-Q 4) is now the default since MetaMap 2013 so that option was not specified on the command line.




Between February 24, 2014 and March 13, 2014 we processed all 746 files with 22,376,811 citations from the 2014 Medline®/PubMed® Baseline through the MetaMap program generating MetaMap Machine Output (MMO) formatted results for each of the citations. The results are now available via the following link. You have the option of downloading the 746 individual result files, or there is a single tar file called 2014.tar which contains all of the result files.

           2014 MetaMapped Medline Baseline Results

Please note that the large size of these results requires care when downloading. The MMO results have a compressed size of 165GB.

Details:

Data Used 2014 Medline/PubMed Baseline
Data Characteristics * Created November 21 2013
* Consists of 746 files of various counts
* Total of 22,376,811 citations
Command Used * metamap13 -Z 2013AB -qE
Please Note:
Composite Phrases (-Q 4) is now the default in MetaMap 2013 so that option was not specified on the command line.