MTI Parameter Information
Close Window

Number of MeSH Terms to Display
Parameter Default Value Option Notes
Number of MeSH Terms to Display 25 -topn <int> Max Number of Recommendations to Display - roughly. This number dictates the number of terms returned and then we add some terms depending on the options you select (addGEOs, addCTs, etc.). The options that add terms can push the number of terms over this number. The valid range here is again 0 to 32,767. 0 would not be very helpful though.
 
Filtering
Parameter Default Value Option Notes
Pre-Packaged Filtering:
Default DCMS Processing OFF -default_DCMS This is the option that we use internally to process the list of new citations going into PubMed/Medline. We process approximately 5,000 citations each night through MTI. This option incorporates the following options: Default Filtering, Add CheckTags, Add Geographics, Add USA, Remove "Do Not Index With" Terms, Limit Results for Title Only Citations, Limit Results via Publication Types, Star MHs that come from the Title, Rank Score Filtering for Title Only Citations, Show Term Unique Identifers, Use Latest Supplemental Concepts, Perform Aged/Human Review, and we use the "List" Output option.
Option-1 DCMS Processing ON -opt1_DCMS This is the same as the above Default DCMS Processing, except we use the "Detailed" Output option instead of "List".
Default Gateway Processing OFF -default_NewGateway This is the option that we use internally to process the Gateway Collection of Conference Abstracts. We annually updated the collection of 98,990 abstracts with new results based on the current MeSH Vocabulary. This option incorporates the following options: Medium Filtering, Add CheckTags, Add Geographics, Add USA, Remove "Do Not Index With" Terms, Use Latest Supplemental Concepts, Perform Aged/Human Review, and we use the "Show "NO_TERMS" List" Output option so we know when an abstract doesn't produce a result.
Option-1 Gateway Processing OFF -opt1_NewGateway This is the same as the above Default Gateway Processing, except we use the "Detailed" Output option instead of "Show "NO_TERMS" List".
Special Output (raw results) OFF -Special_Output This is the option that we use internally to create a static data set from which to work from with MTI. This option overrides everything else that you might set. The reason is that this simply takes the input from the various input paths and performs a simple filtering of the data (removing excluded terms, etc.). The remaining terms are provided as a result with all of the relevant data.
Basic Filtering:
Default Filtering OFF   Basic MTI processing based on the default values for all options.
Medium Filtering OFF -medFilter Remove items from the list of recommendations based on specific heuristics.
Strict Filtering OFF -strictFilter Remove all items from the list of recommendations that are not recommended by both MetaMap and PubMed Related Citations.
 
Post Processing
Parameter Default Value Option Notes
Star MHs that come from the Title ON -starMHTI Add "*" to each MeSH Term that was identified from the Title. This is similar to IM term identification.
Add CheckTags ON -addCTs Add from a list of CheckTags based on review of actual text and the list of CheckTags.
Add Geographics ON -addGEOs Add from a list of Geographic Locations based on review of actual text and the list of Geographics.
Add USA if Triggered ON -addUSA If the MeSH Heading United States is not already assigned and we any of the items from the USA triggers list in the article or list of results, we will add Unites States.
Remove "Do Not Index With" Terms ON -remMHs Remove MeSH Terms which have been indicated as "Do Not Index With" from our list of recommendations and prior to scoring.
Show Headings Mapped to (HM) ON -showHMs Display MeSH Headings that are in fact "Headings Mapped to" with a "HM" notation versus normal MeSH Headings "MH" notation.
Show Entry Terms (ET) ON -showETs Replace MeSH Headings with their corresponding Entry Term where applicable.
Show Treecodes OFF -showTreecodes In the detailed outputs, add in the treecodes for each result if we have them (everything above the line will have the information).
Show Term Unique Identifiers ON -showTUIs Normally only used in our overnight DCMS processing. The TUIs map to a specific term in the MeSH vocabulary and allow the DCMS personnel to map to the appropriate MeSH term even when the spelling of a term changes.
Perform Aged/Human Review ON -doAgedReview Make sure we don't add age related checktags if we already have the CheckTag Animals set and Humans not set. If Animals is not set, and we have age related CheckTags recommended, we need to add Humans. Age related CheckTags include: "Adolescent", "Adult", "Aged", "Child", "Infant", and "Infant, Newborn". If Animals is set, we will remove any of these age related CheckTags.
Bypass Related Citations Results Exclusion/td> ON -nocheckRC Do not process the results obtained via the PubMed Related Citations through our MH_exclude list.
Limit Recommendations via Publication Types ON -limitPTs Reduce the number of recommendations from the default when a citation is identified by specific Publication Types. Currently this is set as follows: PT equals "Review" or "News", we limit the number of returned terms to 14. If the PT equals "Editorial", we limit to 9, and if the PT equals "Letter", we limit to 8.
Limit Recommendations for Title Only Citations ON -limitTitleOnly Reduce the number of recommendations from the default when a citation only has a title field and no abstract. This is currently calculated based on the number of words in the title: 0-2 words limits the number to 7, 3 or 4 limit to 8, 5 or 6 limit to 10, 7 - 10 limit to 11, 11-14 limit to 12, 15 - 18 limit to 13, 19 - 21 limit to 14, and anything larger then 21 words in the title is limited to 13 items.
Rank Score Filtering for Title Only Citations ON -RSfilterTO If this is a title only abstract/citation AND the term is ranked 11 or below on the list of recommendations AND if the score is less then 190, we will stop the list.
Rank Score Filtering for Title & Abstract Citations OFF -RSfilterALL If this is a title AND abstract citation AND the term is ranked 14 or below on the list of recommendations AND if the score is less then 203, we will stop the list.
Use Latest Supplemental Concepts ON -doSuppChemUpdate Every Monday morning the MeSH Vocabulary is updated. This usually only involves the Supplement Concepts. Each week we create a lookup list telling how to handle the changes (replace or remove). This option says to use this updated lookup list and apply any relevant changes. If you don't use this option, the results will be from the static MeSH Vocabulary created at the beginning of each MeSH Indexing year.
Show MeSH DUIs OFF -showDUIs In the detailed output, add in the MeSH Unique Identifier for each result if we have it.
 
Output
Please Note: There is now a new help page just for the display options. This new page also shows examples for each of the display types.
MTI Output Help Information page
Parameter Default Value Option Notes
Simple/Default ON -display_simple2 Simple display with only the names of the MeSH Headings, CheckTags, and SubHeadings being displayed in scoring order and with annotations. This was the default output for the old Interactive MTI web page. The current options better reflect the actual program.
Detailed OFF <blank> or
-detail
Detailed display showing all relevant information about all of the topN recommendations. This includes: name of the item, CUI, final score, type, where item was found in the text, and who recommended the term. In the case of CheckTags and SubHeadings, the field after the type (CT/SH) contains the triggering information - who caused this item to be included in the recommendations. Recommendations are displayed in scoring order.
 
Debugging
Parameter Default Value Option Notes
Detailed Display of Path Inputs OFF -debug1 Detailed information for applying Medium Filtering. Usually used for
Show Decisions during Medium Filtering OFF -medFilterR Detailed information for applying Medium Filtering. Usually used for debugging purposes. Identifies how items are saved and removed. NOTE: This option turns on Medium Filtering.
Show Decisions during Aged/Human Review OFF -showAgedReview Detailed information for applying Aged/Human Review. Usually used for debugging purposes. Identifies how items are saved and removed. NOTE: This option turns on the Aged/Human Review.
Show Decisions during ET Substitution OFF -showETsD Detailed information for applying ET Substitution. Usually used for debugging purposes. Identifies how items are identified. NOTE: This option turns on the Show Entry Terms.
Show Decisions during HM Substitution OFF -showHMsD Detailed information for applying HM Substitution. Usually used for debugging purposes. Identifies how items are identified. NOTE: This option turns on the Show Headings Mapped to.
Show Decisions during Restrict to MeSH OFF -RTM_Debug Detailed information for we convert UMLS Concepts to MeSH Terms via the Restrict to MeSH process. Usually used for debugging purposes.
Show Timing during Processing OFF -doTiming Shows how long various sections of the MTI processing take during the processing of an item.
Show Interim Weight Breakout by Path OFF -showInterimBreakout Shows the raw list of results from the various paths before any scoring, clustering, and filtering has been done. The display breaks out how the various paths contribute to the overall weight for each list item.
Show Decision Point in Rank Score Filtering OFF -showRSfilter Shows the list item that triggers the Rank Score Filtering to kick-in. This works for both RSfilterTO and RSfilterALL.
 
Advanced Options
Path Weight
Parameter Default Value Option Notes
MetaMap 7 -mmi <float> Path Weight for MetaMap Range: 0 to 32,767. 0 turns off
Related Citations 2 -pub <float> Path Weight for Related Citations Range: 0 to 32,767. 0 turns off
Trigram 0 -trg <float> Path Weight for Trigram Range: 0 to 32,767. 0 turns off
 
Related Citation
Parameter Default Value Option Notes
Number of Citations 10 -cit <int> Number of related citations to use from PubMed. Range: 0 to 100. 0 effectively turns off the RC path
MeSH major topic 1.00 -im <float> Relevance scoring for MeSH major topic items returned from the Related Citations method. Range: 0.00 to 1.00
MeSH term 0.80 -nim <float> Relevance scoring for normal MeSH items returned from the Related Citations method. Range: 0.00 to 1.00
 
Nav Weight
Used in calculating the NavScore part of the Clustering Algorithm. The NavScore is the confidence in navigating from a UMLS term to a MeSH Heading.
Parameter Default Value Option Notes
Direct Match 1.00 -dir <float> Relevance scoring for term identified as having a "Direct Match" to a MeSH Heading via lookup in the UMLS MRREL file. Range: 0.00 to 1.00
ATX 1.00 -atx <float> Relevance scoring for term identified as having an "Associated Expression" relationship to a MeSH Heading via lookup in the UMLS MRREL file. Range: 0.00 to 1.00
Parent/Broader 0.90 -par <float> Relevance scoring for term identified as having an "Parent" relationship (term is the parent of the MeSH Heading), or "Broader" relationship (term is a broader term then the MeSH Heading) via lookup in the UMLS MRREL file. Range: 0.00 to 1.00
Child/Narrower 0.75 -chd <float> Relevance scoring for term identified as having a "Child" relationship (term is the child of the MeSH Heading), or "Narrower" relationship (term is a narrower then the MeSH Heading) via lookup in the UMLS MRREL file. Range: 0.00 to 1.00
Sibling 0.70 -sib <float> Relevance scoring for term identified as having a "Sibling" relationship to the MeSH Heading via lookup in the UMLS MRREL file. Range: 0.00 to 1.00
Other Related 0.50 -oth <float> Relevance scoring for term identified as having an "Other Related" relationship (not synonymous, narrower, or broader) to the MeSH Heading via lookup in the UMLS MRREL file. Range: 0.00 to 1.00
 
Ranking
Parameter Default Value Option Notes
Co-occurrences Factor 10000 -cot <int> Relevance scoring for terms identified as co-occurring with another term. Co-occurrences are concepts that occur together in the same "entries" in some information source. Co-occurrence is identified using the UMLS MRCOC file. Valid range is 0 to 32,767 where 0 turns it off.
Title Factor 20 -ti <int> Booster for MeSH Headings identified as coming from the Title field of the citation being processed. Value is a percentage of the current score to add. For example, the default is 20. We would multiply the current score by 120%. NOTE: Not currently used! This parameter has been superceded by the "Emphasize Titles" factor which is a defined doubling of the score for items found in the Title field of the citation. This emphasis is done after ranking and clustering.
Related Term Factor 100 -rel <int> Relevance scoring for terms identified as being related via the MeSH tree structure. This is used during the Clustering phase and figures into the overall RankScore for an item. Valid range is 0 to 32,767 where 0 turns it off.
 
Misc.
Parameter Default Value Option Notes
Emphasize HSTAR 0.00 -emp_hstar <float> Boost the scoring for MeSH terms found in the following MeSH tree hierarchies: N01 - N05, G02 - G03, L01.
score = current_score + (current_score * HSTAR_FACTOR)
We have had limited success with a HSTAR_FACTOR of 20. Valid range is 0.00 and up. 0.00 turns it off.


Close Window